Aaron Zisk March 23, 2026

This Shouldn’t Be Able to Run 120B Locally

Summary

The transcript discusses the emerging trend of compact, portable AI hardware that can run large language models locally, challenging the current paradigm of massive GPU clusters. The key focus is on the Tiny AI Pocket Lab, a small device capable of running 120 billion parameter models on limited memory, compared to traditional requirements of expensive, high-VRAM GPUs. By demonstrating the potential to run sophisticated AI models on small, portable devices with modest specifications, the discussion highlights a significant shift towards more accessible and decentralized AI computational capabilities.

View original episode ↗

Mobile experience coming soon

This Shouldn’t Be Able to Run 120B Locally

Summary