※ veridical.net
  • About
  • Publications
  • 3 Questions
  • Top N
  • Tags
  • Archives

Richard Sutton – Father of RL Thinks LLMs Are a Dead End

From the Dwarkesh Podcast:

Richard Sutton is the father of reinforcement learning, winner of the 2024 Turing Award, and author of The Bitter Lesson. …

After interviewing him, my steel man of Richard’s position is this: LLMs aren’t capable of learning on-the-job, so no matter how much we scale, we’ll need some new architecture to enable continual learning.

And once we have it, we won’t need a special training phase — the agent will just learn on-the-fly, like all humans, and indeed, like all animals.

In other words, Richard Sutton, the father of reinforcement learning, thinks large language models are a dead end.

Yeah, me too.


  • « Tough but Fair

Published

Nov 15, 2025

by Adam Wuerl

Tags

  • AI 6

Contact

Twitter
RSS
Mastodon
LinkedIn
Stack Exchange
Micro.blog
  • Powered by Pelican. Theme: Elegant by Talha Mansoor