🚧 📱

Mobile experience coming soon

Mobile development is in progress. Until it is complete, please use your desktop or laptop.

Thanks!

← Back
AI Engineer July 7, 2025

Training Agentic Reasoners — Will Brown, Prime Intellect

Summary

The main theme is the merging of reasoning and agents in AI development, suggesting they are not separate concepts. Key subjects discussed include reinforcement learning (RL), specifically referencing DeepSeek and OpenAI's GPT-3.5, highlighting RL's effectiveness at scale. The practical takeaway is that reinforcement learning is crucial for building more robust and agentic models capable of handling complex systems and tasks, overcoming the brittleness of generic LM APIs.

View original episode ↗