Software


  • Re-Tuning The Re-Tuning is a new method for large language models to solve compositional tasks.


  • Social The Social is a new challenge dataset to test the social norms of large language models.


  • STEM The STEM is a new challenge dataset to test the STEM skills of neural models.



  • Zero-Shot Robustness The Zero-Shot Robustness is a comprehensive evaluation of the robustness of zero-shot multimodal models.



  • AgentInstruct The AgentInstruct improves the zero-shot reasoning abilities of large language models on general language understanding tasks.



  • DeepStruct The DeepStruct is the state-of-the-art pretrained language model for structure prediction.



  • DeepEx The DeepEx is the state-of-the-art zero-shot information extractor.



  • LASS The LASS is the state-of-the-art knowledge graph completion model.



  • PALT The PALT is the state-of-the-art parameter-lite transfer learning method for knowledge graph completion.



  • CodeSyntax The CodeSyntax is a large-scale benchmark for code syntax understanding.



  • Transformer on a Diet The Transformer on a Diet is a lightweight Transformer.



  • Language Models with Transformers The Language Models with Transformers ensembles RNNs and Transformers.



  • TextHIN The TextHIN performs semantic parsing over knowledge graphs.