Pedro Cuenca (@pcuenq) — X Web Viewer

2026.05.12 09:24

My data point: working on two projects in parallel with Pi + llama.cpp + Qwen-3.6-35B-A3B (I prefer the MoE 🙈) This works on my M1 Max (64 GB), which I bought 4.5 years ago. "Works" as in "you can get work done", not just "runs for a demo".

216

Forward to community

Pedro Cuenca@pcuenq

2026.04.20 17:44

Kimi K2.6 was released 1h ago, and it looks amazing! Here it's running with MLX (mlx-vlm) on two M3 Ultras (full 1T param VLM) 🔥

568

Forward to community

Pedro Cuenca@pcuenq

2026.04.16 15:01

🔈 Every model added to transformers has to be available on Apple Silicon 🍎 at once. We built a Skill and test harness for mlx-lm to get us closer 🔥 It's designed to help contributors AND support reviewers. Read on to see what we did and why it matters.