Vals AI
@_valsai
Public LLM Evaluation // https://t.co/FjWabQY2jk
Joined March 2024
146 Following    1.4K Followers
Grok 3 Beta dominates on our proprietary benchmarks, setting the new SOTA on our Finance, Legal and Tax benchmarks. Congrats @xai @grok @elonmusk ๐Ÿš€๐Ÿš€๐Ÿš€ We just released the benchmark results for xAI's new models: Grok 3 Beta & Grok 3 Mini Fast Beta (High & Low Reasoning) โ€“ this is what we found๐Ÿ‘‡ (1/6)
Show more
0
345
1.4K
268