Xuan-Son Nguyen (@ngxson)

Xuan-Son Nguyen@ngxson

2026.05.17 22:57

Qwen3.6-27B running 100% on WebGPU. Not the best speed but still 😁

Forward to community

Xuan-Son Nguyen Reposted

Xenova@xenovacom

2026.05.11 16:32

I think Reachy is the one who needs chess lessons… 😅 Robotics meets WebAI: Gemma 4 running fully offline on WebGPU with Transformers.js, controlling Reachy Mini over WebSerial. No internet, just a browser and a USB-C cable. What should Reachy play next?

Forward to community

Xuan-Son Nguyen Reposted

clem 🤗@ClementDelangue

2026.05.12 20:53

Surreal to see Reachy Mini on the cover of the last @LinusTech video!

106

Forward to community

Xuan-Son Nguyen Reposted

Pedro Cuenca@pcuenq

2026.05.12 09:24

My data point: working on two projects in parallel with Pi + llama.cpp + Qwen-3.6-35B-A3B (I prefer the MoE 🙈) This works on my M1 Max (64 GB), which I bought 4.5 years ago. "Works" as in "you can get work done", not just "runs for a demo".

216

Forward to community

Xuan-Son Nguyen@ngxson

2026.05.05 20:32

Normal activity at GOSIM Paris: ❌ Give or attend to a talk ❌ Networking 🤔 Watch cool Reachy Mini demo ✅ Show off your Home Assistant setup

Forward to community

Xuan-Son Nguyen@ngxson

2026.05.04 08:01

Last week, we had a very playful but yet efficient off-site between @huggingface and @ggml_org . We brainstormed many UI/UX related subjects, many more to tackle in near future! It's a pleasure to meet everyone IRL and visit the beautiful capital of Bulgaria 🌹 @julien_c @victormustar @ggerganov and Alek

Forward to community

Xuan-Son Nguyen@ngxson

2026.05.04 07:35

Kinda funny to think about it, but a hash function is also deterministic. It's just not Turing complete.

solst/ICE of Astarte@IceSolst

2026.05.01 16:56

Interesting article on treating agent output like compiler output (and why)

Forward to community

Xuan-Son Nguyen@ngxson

2026.05.03 22:47

Come and watch our cool robot demo!

Alina Lozovskaya@ailozovskaya

2026.05.03 15:46

I'll be at GOSIM Paris May 5-6 with a Reachy Mini booth at @joinstationf, presenting the Reachy Mini Conversation App on May 6 Stop by and chat with the robot – don't forget to ask it to show you its dances! See you soon!

Forward to community

Xuan-Son Nguyen@ngxson

2026.04.27 10:39

I'm giving a talk at GOSIM 2026 about llama.cpp. It will be a high-level overview of what we archived in the past one year. Get your ticket here -->

Forward to community

Xuan-Son Nguyen@ngxson

2026.04.24 15:06

Taking my flight this WE too, will try 😁

Julien Chaumond@julien_c

2026.04.24 12:03

This is where we are right now. And i’m not gonna lie it feels pretty magical 🧚‍♀️ Qwen3.6 27B running inside of Pi coding agent via Llama.cpp on the MacBook Pro For non-trivial tasks on the @huggingface codebases, this feels very, very close to hitting the latest Opus in Claude Code, or whatever shiny monopolistic closed source API of the day is. In full airplane mode. Most people haven’t realized this yet. If you have, it means you have a huge headstart to what I call the second revolution of AI. Powerful local models for efficiency, security, privacy, sovereignty 🔥

Forward to community

Xuan-Son Nguyen Reposted

clem 🤗@ClementDelangue

2026.04.21 17:06

Share this with your representative!

145

Forward to community

Xuan-Son Nguyen Reposted

Julien Chaumond@julien_c

2026.04.21 16:16

did you know that huggingface_hub (just the Python client) is sending almost 6B requests/week? wow 😮 @huggingface

Forward to community

Xuan-Son Nguyen Reposted

Lysandre@LysandreJik

2026.04.20 13:30

We're opening a Hugging Face office in Tokyo! Our goal: help open-source AI develop in Japan and grow the local community. Let's meet! ハギングフェイスの東京オフィスがオープンしました！私たちの目標は、日本におけるオープンソースAIの発展を支援し、ローカルコミュニティを育てることです。ぜひお会いしましょう！

131

3.3K

478

Forward to community

Xuan-Son Nguyen@ngxson

2026.04.16 18:01

I stopped using claude code on all of my llama.cpp workflows for the past few days. The quality degradation is just too significant. Experimenting on a mixed usage between Gemma 4 26B-A4B and Gemini 3.1 Pro, so far much better than what anthropic can offer.

Simon Willison@simonw

2026.04.16 17:27

Shocking result on my pelican benchmark this morning, I got a better pelican from a 21GB local Qwen3.6-35B-A3B running on my laptop than I did from the new Opus 4.7! Qwen on the left, Opus on the right

Forward to community

Xuan-Son Nguyen Reposted

Julien Chaumond@julien_c

2026.04.16 15:19

opus 4.7 slightly more dangerous, slightly more expensive OR: run local models!

Forward to community

Xuan-Son Nguyen@ngxson

2026.04.13 23:06

Given the right harness, you can just do everything you want

clem 🤗@ClementDelangue

2026.04.08 18:58

"But here is what we found when we tested: We took the specific vulnerabilities Anthropic showcases in their announcement, isolated the relevant code, and ran them through small, cheap, open-weights models. Those models recovered much of the same analysis. Eight out of eight models detected Mythos's flagship FreeBSD exploit, including one with only 3.6 billion active parameters costing $0.11 per million tokens. A 5.1B-active open model recovered the core chain of the 27-year-old OpenBSD bug."

Forward to community

Xuan-Son Nguyen@ngxson

2026.04.13 22:47

Having a small break today! I'm taking a step back to reflect on my motivations and what I value when working on open source. Read my latest blog post 👇

Forward to community

Xuan-Son Nguyen@ngxson

2026.04.13 15:06

llama.cpp now supports Qwen3-ASR, Qwen3-Omni and Gemma 4 audio/vision input 🔥 Mixed modalities is the future 😼😼

Forward to community

Xuan-Son Nguyen@ngxson

2026.04.10 15:52

llama.cpp now supports various small OCR models that can run on low-end devices. These models are small enough to run on GPU with 4GB VRAM, and some of them can even run on CPU with decent performance. In this post, I will show you how to use these OCR models with llama.cpp 👇

254

Forward to community

Xuan-Son Nguyen@ngxson

2026.04.02 21:50

While working on the pre-release support of gemma 4, I was surprised by its capabilities compared to their size. We're tapping on the surface here, there are more and more to discover about gemma 4. I'm excited to see what the community will do with it in the next few days 🚀🚀

150

Forward to community