Aaron Zisk November 14, 2025

Local AI just leveled up... Llama.cpp vs Ollama

Summary

The transcript discusses running local large language models (LLMs) using various tools like Llama CPP, Olama, and LM Studio, with a specific focus on building Llama CPP from source on an Apple Silicon Mac. The speaker provides a detailed walkthrough of cloning the repository, building the software using CMake, and utilizing the Metal build for optimized performance, highlighting the ease of setup on Mac systems. Key points include using the -J flag for parallel compilation and the ability to run models on simple hardware with different tools. The practical takeaway is that developers can easily set up and run local LLMs on their machines with minimal configuration, particularly on Mac systems with Apple Silicon.

View original episode ↗

Mobile experience coming soon

Local AI just leveled up... Llama.cpp vs Ollama

Summary