NVIDIA users: QWEN3 is FREE, but you’ll pay double
Summary
A new open-source large language model called Quen 3 has been released, targeting developers and featuring multiple versions with varying parameter sizes and quantization levels. The model comes in two primary versions: a massive 480 billion parameter version requiring significant computational resources and a more accessible 30 billion parameter "flash" version designed for home use. Despite potential VRAM challenges with consumer GPUs, the Quen 3 demonstrates improved efficiency, delivering faster token generation speeds and better performance across different hardware configurations.