This is my NLP paper reading list! Here, I maintain a list of papers (and posts) that I consider important for understanding the fundamentals of NLP. I also add papers I enjoyed reading the most in different sub-domains and try to update the list frequently.

Fundamentals - Neural Language Models, Transformers, BERT, etc.

On the difficulty of training recurrent neural networks - This paper introduces the vanishing gradient problem in RNNs. (2013)
A super easy-to-read blog post for understanding LSTMs. (2023)
Neural machine translation by jointly learning to align and translate - Introduces attention mechanism. (2015)
Attention Is All You Need - The super famous transformers paper! (2017)
Understanding attention and transformer - Blog post 1, Blog post 2. (2021)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (2019)
Roformer: Enhanced transformer with rotary position embedding - An excellent paper on positional embedding. (2021)
Deep contextualized word representations - ELMo word embeddings (2018)
Finetuned language models are zero-shot learners - Jason Wei’s paper on Instruction Tuning (FLAN) (2022)

Parameter-efficient Adoption of LLMs

LLM Alignment and Reinforcement Learning

Training language models to follow instructions with human feedback - RLHF Paper (2022)
Constitutional AI: Harmlessness from AI Feedback (2022)
An excellent, easy-to-read blog explaining the need for RLHF. (2023)
RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback (2023)
Direct Preference Optimization: Your Language Model is Secretly a Reward Model (2023)
Fine-tuning Language Models for Factuality (2023)
Alignment for Honesty (2023)
Are aligned neural networks adversarially aligned?(2023)
Understanding the effects of rlhf on llm generalisation and diversity(2024)
Huggingface’s blog post on DPO v/s IPO v/s KTO. (2024)
Weak-to-strong extrapolation expedites alignment (2024)
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Prefere (2024)
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning (2025)
One of the best videos I’ve seen about DeepSeek-R1. (2025)

Evaluation, Factuality, and Long-context

Creativity and Narrative Writing

RAG

LLM Personalization

Prompt Engineering and In-context Learning

Rethinking the role of demonstrations: What Makes In-Context Learning Work? (2022)
Blog on different prompting strategies. (2023)
What learning algorithm is in-context learning? Investigations with linear models (2023)
LLMs Are In-Context Reinforcement Learners (2024)
More Samples or More Prompts? Exploring Effective Few-Shot In-Context Learning for LLMs with In-Context Sampling (2024)

Detecting LLM-generated Text

Reasoning and Chain-of-Thought

Reading List