Alexander Whedon(@alex

2026.05.12 17:23

We've partnered with Appen to evaluate the benchmarks we published last week. Results are in and we've actually improved across the board. Link below to the full report.

Appen Research@AppenResearch

2026.05.12 17:20

@AppenResearch independently evaluated @subquadratic's SSA kernel - a learned sparse attention mechanism designed to reduce the quadratic scaling limitations of full attention. Results at 1M-token context lengths: - 56.2× wall clock speedup vs. FA2 - 62.8× FLOP reduction (validated via torch.profiler, <4% variance from theoretical) - 95.6% average score across RULER tasks at 128K - 86.2% average score on the hardest MRCR 8-needle bucket (512K–1M contexts) - 81.8% SWE-Bench Verified resolved rate Full report:

Forward to community

Alexander Whedon@alex_whedon

2026.05.07 16:03

Hey, folks! We have been blown away by the response to SubQ and the SSA breakthrough over the last 48 hours. It is awesome to see how many people are responding to our mission of creating more efficient algorithms to create better models. We are working hard to firm up our release timeline and will share more very soon. We will also share additional data and third-party validation in our model card next week. If you have questions, please post them in the thread, and I'll do my best to respond! Above all, THANK YOU! The support, feedback, and discussion from this community have been inspiring.

449

Forward to community

Alexander Whedon@alex_whedon

2026.05.05 14:00

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

1.5K

23.1K

2.9K

Forward to community

Alexander Whedon@alex_whedon

2026.05.04 05:47

We finally have swag.

100

Forward to community

Alexander Whedon@alex_whedon

2026.04.29 13:33

Watching training jobs be like…

Forward to community

Alexander Whedon@alex_whedon

2026.04.22 13:16

#ICLR2026# is this week! Join Subquadratic during the conference to chat about research, the future of the industry, and the underlying assumptions of AI that need to be challenged. Drinks on us 🍻

Forward to community