Aaron Zisk March 14, 2025

why are we doing this

Summary

The transcript explores running local large language models (LLMs) for software developers, specifically demonstrating the use of the Quen 2.5 coder model with 7 billion parameters on a GPU-enabled setup. The speaker showcases a practical demonstration of generating code (in this case, a function to find prime numbers) using a local AI model that performs comparably to ChatGPT. The key takeaway is the growing feasibility and efficiency of running powerful AI coding assistants directly on local hardware, with performance and model selection being key considerations.

View original episode ↗

Mobile experience coming soon

why are we doing this

Summary