AI Engineer June 27, 2025

Serving Voice AI at Scale — Arjun Desai (Cartesia) & Rohit Talluri (AWS)

Summary

The discussion centers on building real-time multimodal intelligence, particularly focusing on voice AI for enterprise applications. Key subjects include foundation models, cloud-based batch processing versus real-time edge deployment, and the critical importance of low latency and high quality for interactive voice experiences. The practical takeaway is that for effective voice AI, speed and quality are paramount, necessitating models that can run on any device in real-time, not just in the cloud.

View original episode ↗

Mobile experience coming soon

Serving Voice AI at Scale — Arjun Desai (Cartesia) & Rohit Talluri (AWS)

Summary