Local AI is having its moment!
Below is the number of new GGUF models created each month over the past 8 months & insights from our HF internal agent (May is partial):
- 176,000 total public GGUF models on HF
- Two distinct regimes: Oct–Feb averaged ~5.1K new GGUF models/month. Then March–April jumped to ~9.2K/month — nearly double the previous rate.
- March was the inflection point (+55% MoM) — likely driven by a wave of new open-weight model releases being quantized to GGUF.
- April sustained the momentum at 9.7K, suggesting this isn't a one-off spike but a new baseline.
- The GGUF ecosystem is accelerating — the community is quantizing models faster than ever, likely thanks to better tooling (llama.cpp improvements, automated quantization pipelines, and more models supporting GGUF natively).
Let's go!