Aaron Zisk February 19, 2026

Spark cluster with Qwen3.5

Summary

The transcript discusses the technical specifications of a large language model, Quen 3.5, with 397 billion parameters and approximately 807 gigabytes of data. The model requires significant computational resources, specifically noting it needs around a terabyte of VRAM and can be run across multiple Mac Studios clustered together. During a Llama Beni test, the model demonstrated impressive performance, processing 1,400 tokens per second and generating 23.5 tokens per second, while highlighting its state-of-the-art capabilities and potential for home computing.

View original episode ↗

Mobile experience coming soon

Spark cluster with Qwen3.5

Summary