CSE599H: Advances and Challenges in Language Models, Reasoning, and AI Agents
Spring 2024-2025
Mondays and Wednesdays, 3pm to 4:20pm
CSE2 G04
Gradescope | Ed
Office hours by appointment.
To contact course staff, please make an Ed post.
Language models, such as GPT-o3, DeepSeek-R1, and Deep Research, have demonstrated remarkable capabilities in natural language understanding, generation, and reasoning, with applications ranging from literature summarization to complex problem-solving tasks. However, as we will discuss, these models are not without limitations, such as susceptibility to hallucinations, poor capabilities in strategic exploration, and limitations in long-horizon planning. In this class, we will explore the latest research on language models, reasoning, and AI agents, discussing both the advances and challenges in these areas. We will examine the current state-of-the-art models, their limitations, and the ongoing efforts to address these challenges. Through this course, you will engage in paper discussions and gain a deeper understanding of the latest developments in the field and contribute to the ongoing discussions and research in this exciting area.
This is a seminar designed for PhD students. Students are expected to be able to read and understand the assigned papers on their own, and they should be familiar with ML and NLP concepts at the level of having taken advanced undergraduate classes.
Schedule
Weekly due dates:- By Monday 11:59pm: Slides for Wednesday's papers (presenters only)
- By Saturday 11:59pm: Slides for Monday's papers (presenters only)
Mar 31 (Mon) | Course overview (slides) |
Apr 2 (Wed) | Basic Pre-training and Post-training (slides) Optional reading |
Apr 7 (Mon) | Guest Lecture: Nathan Lambert (slides) Optional reading |
Apr 9 (Wed) | Guest Lecture: Kyle Lo (slides) Optional reading |
Apr 14 (Mon) | Scaling Laws of Language Models (slides) Optional reading |
Apr 16 (Wed) | Building Reasoning Models & Systems I (slides) |
Apr 21 (Mon) | Building Reasoning Models & Systems II (slides) |
Apr 23 (Wed) | Test-Time Scaling (slides) Optional reading |
Apr 28 (Mon) | AI Agents and Tool Use (slides) |
Apr 30 (Wed) | AI Agents for Coding (slides) Optional reading |
May 5 (Mon) | AI Agents for Computer Use and Web Browsing (slides) Optional reading |
May 7 (Wed) | AI Agents for Deep Research Optional reading |
May 12 (Mon) | Features and Limitations I Optional reading |
May 14 (Wed) | Features and Limitations II Optional reading |
May 19 (Mon) | Alternative Architectures Optional reading |
May 21 (Wed) | Efficiency and Scaling |
May 26 (Mon) | No class, Memorial Day |
May 28 (Wed) | TBD |
Jun 2 (Mon) | TBD |
Jun 4 (Wed) | TBD |