Some comments on Taalas HC1:
- It’s real. Try it yourself. At ~16k tokens/sec, the output is instantaneous.
- The current demo model is aggressively quantized (roughly 3–6 bits). The goal was to prove the system works end-to-end. Improving quantization quality, that's the easy