註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

檢索結果 SSA雪
SSA雪 貼吧
一個關鍵字就是一個貼吧,路徑全站唯一。
建立貼吧
用戶
未找到
包含 SSA雪 的搜尋結果
又一家AI對手出來了! SubQ 推出了首款 frontier LLM,採用 fully sub-quadratic sparse-attention (SSA) 架構,context window 達 1,200 萬 tokens。 重點數據: • 計算量約為傳統 quadratic Transformer 的 1/1000 • 1M token prefill 比 FlashAttention 快 52 倍 • 成本不到 Claude Opus 的 5% Benchmark: • SWE-Bench Verified:81.8% • RULER @128K:95% 早期存取與 SubQ Code agent 已開放:
顯示更多
Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.
顯示更多