AI Engineer July 7, 2025

Training Agentic Reasoners — Will Brown, Prime Intellect

Summary

The main theme is the merging of reasoning and agents in AI development, suggesting they are not separate concepts. Key subjects discussed include reinforcement learning (RL), specifically referencing DeepSeek and OpenAI's GPT-3.5, highlighting RL's effectiveness at scale. The practical takeaway is that reinforcement learning is crucial for building more robust and agentic models capable of handling complex systems and tasks, overcoming the brittleness of generic LM APIs.

View original episode ↗

Mobile experience coming soon

Training Agentic Reasoners — Will Brown, Prime Intellect

Summary