註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

filipe
@filicroval
data eng @ledger | ai agents, devtools, and workflow
加入 May 2023
88 正在關注    153.9K 粉絲
the web got fast because of CDNs: instead of fetching the same page from across the planet every time, it's stored nearby and reused. ai inference never had that. so every time a model sees the same context, your long chat history, the same rag documents, the same system prompt, it recomputes all of it from scratch, burning gpu you pay for. a free open source layer finally built the CDN for ai: store what the model already worked out, reuse it everywhere. on repetitive workloads the project reports 3-10x faster responses.
顯示更多