Stars
Train transformer language models with reinforcement learning.
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.
Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"
[EMNLP 2024 Findings] Official Implementation of Full-Scale Matryoshka Representation Learning for Multimodal Recommendation (fMRLRec).
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
A library for mechanistic interpretability of GPT-style language models
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
Evaluating the Ripple Effects of Knowledge Editing in Language Models
[WSDM 2024] Official PyTorch Implementation of Linear Recurrent Units for Sequential Recommendation (LRURec)
Large Language Models Are Reasoning Teachers (ACL 2023)
Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch
深度学习入门教程, 优秀文章, Deep Learning Tutorial
Handwritten Equations Decipherment with Abductive Learning