Register and share your invite link to earn from video plays and referrals.

AK
@_akhaliq
AI research paper tweets, ML @Gradio (acq. by @HuggingFace ๐Ÿค—) dm for promo ,submit papers here:
3.1K Following    497.7K Followers
I believe on-prem and local AI - based on @huggingface open-source models - will be an important answer to the GPU shortages this year (because they are cheaper, faster, safer than cloud APIs)! Great collaboration between @huggingface & @MichaelDell @Dell to make this a reality for enterprise today. Announced at the main keynote of Dell Technologies World.
Show more
"We give you model choice, without infrastructure chaos" โ€” @MichaelDell, live from #DellTechWorld# ๐ŸŽค Kimi K2.6, DeepSeek V4 Pro, GLM 5.1, MiniMax M2.7 & DeepSeek V4 Flash are now one click away on Dell Enterprise Hub, optimized for PowerEdge XE9780 with NVIDIA B300.
Show more
Introducing a revival of PapersWithCode! As @ilyasut said, we're back to the "age of research". Hence, it's important to share research and build on each other's work. > find SOTA per domain, not just LLMs > leaderboards > methods > all parsed at scale using AI agents.
Show more
Hello again, everyone! We've got another really fun 9b, this one specifically trained for tool calling and agentic coding workflows in @NousResearch Hermes agent. Happy to report that it crushes, and as a 9b it runs on super affordable hardware. We also hit this one with some coding domain-specific training, and it scored a 53.33% on SWE bench on a slice of 200 samples! To me, I was really shocked to see this high of a score on a 9B model in swe, correct me if I'm wrong, but I think that's nipping at the heels of the Gemma 4 series, much larger models on this particular benchmark, which is really incredible to see! It also crushes the HermesAgent-20 benchmark, scoring an 85 vs the base model's 71! Make sure to run it hot, --temp around 1, that seems to be the sweet spot for running these particular fine tunes in harnesses. If you have trouble, you can work your way down, but it does a much better job departing from base models, overthinking when you run it, high temp ~1. Please spin it up in Hermes and let us know your thoughts! Looking forward to hearing your feedback as always! Also, those of you waiting for Qwopus 3.6 27B, I have put together a preliminary evaluation for you in my HF repo, go check it out; we will be releasing the full model very soon! I will put the preliminary repo in the comments!
Show more
0
67
1.4K
130
Forward to community
Can fast generative models still be likelihood-based? Excited to share our new work @Apple MLR --Normalizing Trajectory Models a step toward high-quality few-step generation with exact trajectory likelihood, powered by normalizing flows. Paper: [1/9]
Show more
DeepSeek V4 Flash on a single RTX Pro 6000? ๐Ÿ‘€
Weโ€™re releasing a 30B-A3B reasoning model that reaches gold-medal level across both physics and math Olympiad evaluations: IPhO directly, and IMO/USAMO with test-time self-verification and refinement. A simple, unified scaling recipe for proof search.
Show more
0
19
1.3K
146
Forward to community
๐Ÿš€ DCI just hit #1# on Hugging Face Daily Papers! Try it Now! @HuggingPapers
NVIDIA just released a paper review dataset on Hugging Face APRES, Agents4Science, and Sakana v2 subsets covering human and AI-authored papers with real review decisions.
this week @huggingface crossed 1M datasets ๐Ÿš€ every open model you love was built on top of them next objective: more open coding session traces on Hub to push coding models even further ๐Ÿค help push the open frontier by uploading your traces!
Show more
We asked the CEO of HuggingFace @ClementDelangue what the risks of releasing powerful open source models are. He says restricting AI creates more risk than openness. "Six, seven years ago, at the time it was GPT-2, and there was already a lot of people saying that it was too dangerous to release in open source." "Mythos, when it was announced was crazy dangerous... In a few weeks or a few months, everyone is gonna be using Mythos, and not destroy the world as a result." "For cybersecurity, the biggest risk is that a few players have capabilities that other people don't have... If you make it more open, it's usually easier for defenders to react and make the whole system safer." "The idea of restricting a technology like AI based on risks is like saying, 'Some people can punch other people, so let's tie down everybody's hands.'" "Otherwise you slow down progress, you create massive gaps in terms of controls, in terms of capabilities, and you create actually additional risks."
Show more
0
44
554
107
Forward to community
Check out this fantastic video created by @_jong_hyun_park for his channel and Korean audience! It offers a great behind-the-scenes look at what we do at the LeRobot team. We had a wonderful time doing the interview, and the video is packed with interesting insights.
Show more