Register and share your invite link to earn from video plays and referrals.

Florian Brand
@xeophon
evals @PrimeIntellect | open models @interconnectsai
722 Following    13.2K Followers
We are excited to be partnering with @LangChain for deploying self-improving agents. Continual learning in your production environment unlocks compounding capability gains for model-product optimization. Your data. Your advantage.
Show more
the good thing about the bun rust port: there will be a dozen ai and rust haters searching bugs to dunk on them, which will improve the overall quality of the port
Say what you want: this is impressive.
Will be giving a talk titled “You should do RL for long-running agents (and use RLMs)” at 4pm on Sat at AI Engineer Singapore. Excited to see you all!
told you to not sleep on MiMo. what they have accomplished in such a short span is remarkable, their first (7B dense) llm was released exactly a year ago
BREAKING: MiMo V2.5 Pro (Thinking) takes 3rd overall out of open weights models on Design Arena. MiMo V2.5 Pro (Thinking) places 8 positions higher than MiMo-V2.5 on the overall leaderboard, landing in the same performance band as Claude Sonnet 4.6 on frontend coding tasks. Huge congratulations to the @XiaomiMiMo team on these improvements!
Show more
can confirm those feel just as good
the concept of vagueposting about things that are already on github
Poolside is hosting a 2-day model research hackathon in London. Join us to push an open-weight agent model as far as you can. RL and fine-tune Laguna XS.2, our latest-generation model, on Prime Intellect Lab. Dates: May 29–30 Partners: @nvidia + @PrimeIntellect + @huggingface Prize: NVIDIA DGX Spark Agents need better models. Better models need cracked researchers. Link below.
Show more
working at prime is just "ugh i had this gnarly problem, let’s fix it and then make it available to everyone" a ton of other things are coming, can’t wait to show it to yall :)
Introducing Renderers RL trainers work in tokens. Environments work in messages. Going back and forth corrupts sampled tokens, wasting compute on every agentic turn. With Renderers, we fix this mismatch. This unlocks >3x throughput on popular open models.
Show more
twitter is 4 months behind @jasminewsun‘s substack
if you really believed in agi you would be looksmaxxing as hard as possible rn
my favorite eval people (@maksym_andr @GregHBurnham @tmkadamcz @j_dekoninck) don't have premium so i have turned on notifs for them (go follow them all!)
New short blog post on MathArena! TL;DR: we tried to create new versions of Apex and Apex Shortlist, but could not because the models have gotten too strong. We will therefore slowly but surely deprecate our final-answer competitions.
Show more
Pulled 120+ malicious packages from @rubygems today. The target wasn't end users - it was RubyGems itself (XSS, data exfiltration). Reminder: sometimes the registry is the one under attack. #ruby# #rubygems# #security#
Show more
"yeah i can hard code this one value, surely it won't bite me in the future" well, dear reader, you can guess what happened in the future
Impressive performance by DeepSeek-v4-Flash, essentially equal to DeepSeek-v4-Pro, but much cheaper.
the open model ecosystem rejoices
Claude Code 2.1.139 added /goal You set a completion condition and Claude keeps working across turns until it's met Works in interactive, -p, and Remote Control 👏
🚨 UPDATE: Mini Shai-Hulud has crossed from @npmjs into @pypi and is still spreading. Newly confirmed compromised artifacts: @​opensearch-project/opensearch: 3.5.3, 3.6.2, 3.7.0, 3.8.0 (1.3M weekly downloads) mistralai: 2.4.6 on PyPI guardrails-ai: 0.10.1 on PyPI additional @​squawk/* packages on npm guardrails-ai 0.10.1 executes malicious code on import. On Linux, it downloads git-tanstack[.]com/transformers.​pyz, writes it to /tmp/transformers.​pyz, and runs it with python3 without integrity verification. The git-tanstack.​com domain displayed a message signed “With Love TeamPCP,” along with: “We've been online over 2 hours now stealing creds Regardless I just came to say hello :^)” The page also linked to a YouTube video and you can probably guess which one.
Show more
0
61
2.3K
489
Forward to community
The vibes in China's AI labs My blog about my recent trip to China is up, link in replies.