tags
about

Welcome to the Era of Experience - by David Silver, Richard S. Sutton

Sutton and his advisee Silver argue that the “era of human data,” dominated by supervised pre‑training and RL‑from‑human‑feedback, has hit diminishing returns; the future will belong to agents that

  • act continuously in real or simulated worlds,
  • generate and label their own training data through interaction
  • optimise rewards grounded in the environment rather than in human preference alone, and
  • refine their world‑models and plans over lifelong streams of experience.

#ML