Register and share your invite link to earn from video plays and referrals.

Benjamin Marie
@bnjmn_marie
Independent AI researcher (LLM, NLP). My blog, The Kaitchup - AI on a Budget:
210 Following    5.3K Followers
Some comments on Taalas HC1: - It’s real. Try it yourself. At ~16k tokens/sec, the output is instantaneous. - The current demo model is aggressively quantized (roughly 3–6 bits). The goal was to prove the system works end-to-end. Improving quantization quality, that's the easy
Show more