註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

Kimi.ai
@Kimi_Moonshot
Built by Moonshot AI to empower everyone to be superhuman. ⚡️API: @KimiProduct where we share cool use cases and prompts.
加入 December 2024
136 正在關注    169.4K 粉絲
We are excited to have @baseten as a day 0 launch partner for Kimi K2.6! Their inference stack brings KV-aware routing, NVFP4 on Blackwell, multi-modal hierarchical caching, and prefill-decode disaggregation, so K2.6 runs the way it's meant to in production. Try it out at:
顯示更多
Kimi K2.6 has landed, and it is live on Baseten! We have baked in multiple inference optimizations so that you can leverage Kimi K2.6 in production right away. To run Kimi K2.6, Baseten uses: -> The Baseten Inference Stack with advanced optimizations, including KV-aware routing -> NVFP4 weights to unlock maximum performance on NVIDIA Blackwell GPUs -> Multimodal hierarchical caching for low-latency vision input -> Prefill-decode disaggregation for LLM inference optimization. Try it now at:
顯示更多
0
13
929
43
轉發到社區