Search BEHIND_THEPOPUP on X

New York Post@nypost

2026.06.01 22:42

Behind the scenes look at iconic Disney ride unveiled as it malfunctions, lights turned on

0

3

0

Forward to community

Lay Zhang Studio@lay_studio

2026.05.28 08:51

Behind the scenes 📷 Two languages, one soundtrack. The Great Wall stands witness to everlasting bonds. Let the song carry us — @layzhang sings for China-Pakistan friendship atop this historic landmark! 🔗 #LAY# #LAYZHANG# #ZHANGYIXING# #张艺兴#

0

19

1.1K

563

Forward to community

Fuli Luo@_LuoFuli

2026.05.27 12:50

Behind the MiMo API Price Reduction: The deepest price cut, up to 99%, is for Input (Cache Hit). The core reason is our inference framework now supports hierarchical KV cache optimization for SWA. Production inference engine tests show this optimization increases cached token capacity by 5x, equivalent to an 80% reduction in caching costs. Combined with Cache Read Overlap among multiple Full Attention modules in the Hybrid model, actual costs are further reduced. Prices for Input (Cache Miss) and Output are also reduced by 60%-80%. This mainly benefits from the extreme 1:7 Full:SWA sparsity ratio brought by the model architecture (the prefill compute of the 70-layer MiMo-V2.5-Pro roughly equals a 10-layer GQA model). This kept our original inference costs well below the industry average, naturally leaving a 2x-3x profit margin in pricing. This price adjustment simply reflects our decision to pass these structural cost efficiencies directly to developers. Operating at these newly reduced API prices, our production inference engine is running at near full capacity, and we can still essentially break even. We previously advised LLM companies not to "blindly cut prices" precisely because very few model architectures and inference optimizations can keep API costs from running at a loss. If more architectures that save compute and KV cache emerge, along with better inference Infra to drive down API costs, this will form an excellent virtuous cycle in the industry. More crucially, affordable, high-performance model APIs will drive real, sustained, and at-scale inference demand. This upstream demand pulls forward the development of the entire AI infrastructure chain—including chips, servers, optical transceivers, PCBs, liquid cooling, power, energy storage, and data centers—serving as a strategic fulcrum for a systemic revaluation of AI hardware. In the long run, this injects more affordable and accessible compute into both training and inference pipelines, accelerating the parallel evolution of global AGI across multiple regions and technical routes. For more technical details, we will release a detailed Blog post later.

0

54

439

61

Forward to community

Giulia Bruno@prontoyyc

2026.05.25 01:33

Behind the scenes!! #pizza# #pasta# #italianfood# #dinnertime# #calgary# #entrepreneur# #viral#

0

19

692

16

Forward to community

໊@boomihoor

2026.05.24 03:40

behind every hot girl there is a deep history with the sims

0

108

37.3K

10K

Forward to community

18livefun@18livefun

2026.05.22 04:42

Behind The Scenes Of A Curvy Asian Girl’s Nude Dance Party In Her Bedroom Watch live 👉

0

2

539

274

Forward to community

The Will Cain Show@WillCainShow

2026.05.19 15:58

BEHIND-THE-SCENES AT THE WHITE HOUSE 👀 President Trump gives reporters a first-hand look at the White House ballroom construction project. 🇺🇸 Credit: Margo Martin

0

4

20

2

Forward to community

Debra Lea@thedebralea

2026.05.15 23:33

Behind the scenes of a Fox News appearance. All the things they don’t show you on TV 🎥

0

45

807

50

Forward to community

Liverpool FC@LFC

2026.05.15 19:49

Behind at half-time. #AVLLIV#

0

822

1.5K

155

Forward to community

Emily Goodin@Emilylgoodin

2026.05.14 16:25

Behind-the-scenes details of the many shouting matches/skirmishes between US and Chinese officials today - and why a camera man working for director Brett Ratner was capturing footage:

0

14

90

36

Forward to community