Vals AI
@_valsai
Public LLM Evaluation // https://t.co/FjWabQY2jk
146 Following    1.4K Followers
Grok 3 Beta dominates on our proprietary benchmarks, setting the new SOTA on our Finance, Legal and Tax benchmarks. Congrats @xai @grok @elonmusk 🚀🚀🚀 We just released the benchmark results for xAI's new models: Grok 3 Beta & Grok 3 Mini Fast Beta (High & Low Reasoning) – this is what we found👇 (1/6)
Show more
0
345
1.4K
268