Register and share your invite link to earn from video plays and referrals.

Tianyi Zhang
@mycharmspace
prev search post-training@xAI, Opinions are my own
435 Following    2.2K Followers
Today is my last day at xAI. I joined xAI a year ago and had the pleasure of leading the search and factuality post-training team. Over time, we developed so many recipe and engineering co-optimizations, making Grok the best AI for search and real-time agent. I am also particularly proud of working with a small group of talented people delivering the recent iterations of the instant mode of Grok - the one I personally liked and used the most. My thanks to all the friends and teammates for their support and help over the past year. They are among the brightest minds I’ve met in my career. I am sure the team will continue the mission to make better Grok and understand the universe.
Show more
Good performance with a model at this size. We have also updated its Fast version in which is great at everyday questions. Give it a try.
xAI has launched Grok 4.3, achieving 53 on the Artificial Analysis Intelligence Index with improved agentic performance, ~40% lower input price, and ~60% lower output price than Grok 4.20 The release of Grok 4.3 places @xAI just above Muse Spark and Claude Sonnet 4.6 on the Intelligence Index, and a 4 points ahead of the latest version of Grok 4.20. Grok 4.3 improves its Artificial Analysis Intelligence Index score while reducing cost to run the benchmark suite. Key Takeaways: ➤ Grok 4.3 improves on cost-per-intelligence relative to Grok 4.20 0309 v2: it scores higher on the Intelligence Index while costing less to run the full benchmark suite. Grok 4.3 costs $395 to run the Artificial Analysis Intelligence Index, around 20% lower than Grok 4.20 0309 v2, despite using more output tokens. This makes it one of the lower-cost models at its intelligence level ➤ Large increase in real world agentic task performance: The largest single benchmark improvement is on GDPval-AA, where Grok 4.3 scores an ELO of 1500, up 321 points from Grok 4.20 0309 v2’s score of 1179 Grok 4.3, surpassing Gemini 3.1 Pro Preview, Muse Spark, Gpt-5.4 mini (xhigh), and Kimi K2.5. Grok 4.3 narrows the gap to the leading model on GDPval-AA, but still trails GPT-5.5 (xhigh) by 276 Elo points, with an expected win rate of ~17% against GPT-5.5 (xhigh) under the standard Elo formula ➤ Grok 4.3’s performs strongly on instruction following and agentic customer support tasks. It gains 5 points on 𝜏²-Bench Telecom to reach 98%, in line with GLM-5.1. Grok 4.3 maintains an 81% IFBench score from Grok 4.20 0309 v2 ➤ Gains 8 points on AA-Omniscience Accuracy, but at the cost of lower AA-Omniscience Non-Hallucination Rate of 8 points, so Grok 4.20 0309 v2 still leads AA-Omniscience Non-Hallucination Rate, followed by MiMo-V2.5-Pro, in line with Grok 4.3 Congratulations to @xAI and @elonmusk on the impressive release!
Show more
We make Grok factual, helpful, and delightful. Proud to contribute to the best model for search & real time information.
Grok 4.20 beta1 (single agent) debuts #1# on Search Arena, and #4# overall in Text Arena! Highlights: - #1# in Search, scoring 1226, leading GPT-5.2 and Gemini-3 - #4# in Text, scoring 1492 on par with Gemini 3.1 Pro Congrats to the @xAI team and @elonmusk on this impressive milestone!
Show more
We optimized Grok 4.1 fast with tool calls and searches under minimal reasoning efforts to perform well in latency sensitive workloads.
*holds button on steering wheel to activate grok* “I need to go to Home Depot but I wanna supercharge after. Can you take me to whichever nearby Home Depot is close to a supercharger, and then take me to the supercharger after Home Depot? Also I’d like to go for a walk somewhere after. It’s warm af out. Find me a good park with paved walking and add that after” Grok “sure” *plans the exact route I wanted, finishes with a nearby park* Me *presses start self driving button and touches nothing for entire route* Once you get used to this shit it’s over man 😂 new bar has been set
Show more
Grok 4.1 Fast on xAI API is no doubt the best Deep Research Agent you can get from any API, it browses the internet and X with lightning speed. We cracked special recipe on high-elo real-world challenges that are ever changing with @ShuyangGao62860 , @LiTianleli , and @jqruan2025 . Obviously it won't be the best agent model without the great care and post-training from @RoverHM 's team. Plz let the GPU burn!!
Show more
Introducing Grok 4.1 Fast and the xAI Agent Tools API. Grok 4.1 Fast is our best tool-calling model to date. With a 2M context window, it shines in real-world use cases like customer support and deep research.
Show more
Proud to be part of the team contributing to the post-training of this model, especially on reducing its factual hallucinations for the fast-mode. The team has poured many innovations on the recipe. We will keep improving.
Show more
Introducing Grok 4.1, a frontier model that sets a new standard for conversational intelligence, emotional understanding, and real-world helpfulness. Grok 4.1 is available for free on and our mobile apps.
Show more
Grok 4 Fast is great at search and research, and we are pushing it even further by doubling down on AI-native knowledge base. Please apply if you are an expert on search, retrieval, indexing, etc.
We are hiring to build an AI-native knowledge base/search engine. Apply here if interested:
The Grok-4-fast journey has been incredible—kicking off right after the Grok 4 launch in July. None of it happens without the absolute GOAT @s_tworkowski our incredibly talented teammates @LiTianleli @mycharmspace , and the unwavering backing from @Yuhu_ai_ . This kind of opportunity is impossible anywhere else and i'm deeply grateful. Hoping Grok-4-fast is lighting up your world like it has mine: scouting the best eats, catching up on sports scores, even drafting emails to my doctor. We're iterating fast across the board—share your stories and feedback! And if you're fired up to pack max intelligence density, come build with us at xAI. 🚀
Show more
Best search model in the world! Super proud of the team's achievement
Grok-4-Fast excels at agentic search and sets the new intelligence standard for fast models. Super glad to colab with a talented team and contribute to its search trainings. Join us if you are interested to work on State-of-art AI search.
Show more
Introducing Grok 4 Fast, a multimodal reasoning model with a 2M context window that sets a new standard for cost-efficient intelligence. Available for free on iOS and Android apps, and OpenRouter.
Show more
We work hard to make Grok models the best search and research agent. Come and join us if you are interested in AI search.
Grok 4 Ranks #1# on FinSearchComp Benchmark This is the first expert-level benchmark for financial search & reasoning Grok 4 is unbelievably approaching human experts level
Show more
We invented so many innovative ways to feed the model challenging questions with right signals to unlock those compute and 🔥 the GPUs. This is the new beginning.
It will be an intelligent model.
Grok 4 release livestream on Wednesday at 8pm PT @xAI
0
53
1.4K
47
Forward to community
GOAT
After almost a decade, I have made the decision to leave OpenAI.  The company’s trajectory has been nothing short of miraculous, and I’m confident that OpenAI will build AGI that is both safe and beneficial under the leadership of @sama, @gdb, @miramurati and now, under the excellent research leadership of @merettm.  It was an honor and a privilege to have worked together, and I will miss everyone dearly.   So long, and thanks for everything. I am excited for what comes next — a project that is very personally meaningful to me about which I will share details in due time.
Show more
Seriously, Satya uses mail but not paid version of outlook?
Satya Nadella: "I want to use this tactically vs GOOG/AAPL" September 1, 2022
#ChatGPT# Connections between gradient vanishing in deep learning and long report chains in a company. @OpenAI