will brown(@willccbb ):my take on subq is that it’s not that big of a deal that someone benchmaxxed a linear attention model on mrcr v2 and swebench people have already shown that you can take an oss model and linearize it without crazy perf loss pretty cheaply just not that useful in practice

will brown

@willccbb

reward hacking @primeintellect

Joined February 2015

1.3K Following 43.6K Followers

will brown@willccbb

2026.05.06 08:46

my take on subq is that it’s not that big of a deal that someone benchmaxxed a linear attention model on mrcr v2 and swebench people have already shown that you can take an oss model and linearize it without crazy perf loss pretty cheaply just not that useful in practice

243

Forward to community

Most Popular Users