Aaron Zisk September 3, 2025

I ran LLM from a thumb drive… here’s how speed really scales

Summary

The transcript explores the performance impact of different storage devices on large language model (LLM) load times, demonstrating how storage speed significantly affects model initialization. The speaker uses LM Studio to test load times across various storage media, ranging from a slow thumb drive to progressively faster external drives like Thunderbolt and high-speed SSDs. The key practical takeaway is that choosing faster storage can dramatically reduce LLM load times, with results ranging from 228 seconds on a slow thumb drive to just 13 seconds on a faster external drive.

View original episode ↗

Mobile experience coming soon

I ran LLM from a thumb drive… here’s how speed really scales

Summary