Aaron Zisk July 31, 2025

Qwen3 Coder M4 Max vs RTX 5090

Summary

The transcript discusses the availability of Quen 3 coder, an 8-bit MLX version running on different GPU configurations, including an Nvidia 5090 and an M4 Max MacBook Pro. Performance metrics show varying token generation speeds, with the MacBook Pro achieving 79 tokens per second compared to 48 tokens per second on the Nvidia GPU. The presenter is exploring the model's capabilities across different hardware setups, highlighting potential performance differences and memory usage considerations.

View original episode ↗

Mobile experience coming soon

Qwen3 Coder M4 Max vs RTX 5090

Summary