CSE599H: Advances and Challenges in Language Models, Reasoning, and AI Agents

Spring 2024-2025
Mondays and Wednesdays, 3pm to 4:20pm
CSE2 G04
Gradescope | Ed


Office hours by appointment.
To contact course staff, please make an Ed post.

Language models, such as GPT-o3, DeepSeek-R1, and Deep Research, have demonstrated remarkable capabilities in natural language understanding, generation, and reasoning, with applications ranging from literature summarization to complex problem-solving tasks. However, as we will discuss, these models are not without limitations, such as susceptibility to hallucinations, poor capabilities in strategic exploration, and limitations in long-horizon planning. In this class, we will explore the latest research on language models, reasoning, and AI agents, discussing both the advances and challenges in these areas. We will examine the current state-of-the-art models, their limitations, and the ongoing efforts to address these challenges. Through this course, you will engage in paper discussions and gain a deeper understanding of the latest developments in the field and contribute to the ongoing discussions and research in this exciting area.

This is a seminar designed for PhD students. Students are expected to be able to read and understand the assigned papers on their own, and they should be familiar with ML and NLP concepts at the level of having taken advanced undergraduate classes.

Schedule

Weekly due dates:
  • By Monday 11:59pm: Slides for Wednesday's papers (presenters only)
  • By Saturday 11:59pm: Slides for Monday's papers (presenters only)
Mar 31 (Mon) Course overview (slides)
Apr 2 (Wed) Basic Pre-training and Post-training (slides)
Optional reading
Apr 7 (Mon) Guest Lecture: Nathan Lambert (slides)
Optional reading
Apr 9 (Wed) Guest Lecture: Kyle Lo (slides)
Optional reading
Apr 14 (Mon) Scaling Laws of Language Models (slides)
Optional reading
Apr 16 (Wed) Building Reasoning Models & Systems I (slides)
Apr 21 (Mon) Building Reasoning Models & Systems II (slides)
Apr 23 (Wed) Test-Time Scaling (slides)
Optional reading
Apr 28 (Mon) AI Agents and Tool Use (slides)
Apr 30 (Wed) AI Agents for Coding (slides)
Optional reading
May 5 (Mon) AI Agents for Computer Use and Web Browsing (slides)
Optional reading
May 7 (Wed) AI Agents for Deep Research
Optional reading
May 12 (Mon) Features and Limitations I
Optional reading
May 14 (Wed) Features and Limitations II
Optional reading
May 19 (Mon) Alternative Architectures
Optional reading
May 21 (Wed) Efficiency and Scaling
May 26 (Mon) No class, Memorial Day
May 28 (Wed) TBD
Jun 2 (Mon) TBD
Jun 4 (Wed) TBD

Acknowledgements

We are grateful to Pang Wei Koh for sharing their website template with us.