Qwen3 Coder M4 Max vs RTX 5090
Summary
The transcript discusses the availability of Quen 3 coder, an 8-bit MLX version running on different GPU configurations, including an Nvidia 5090 and an M4 Max MacBook Pro. Performance metrics show varying token generation speeds, with the MacBook Pro achieving 79 tokens per second compared to 48 tokens per second on the Nvidia GPU. The presenter is exploring the model's capabilities across different hardware setups, highlighting potential performance differences and memory usage considerations.