가입 후 초대 링크를 공유하면 동영상 재생 및 초대 보상을 받을 수 있습니다.

Baseten
@baseten
Inference is everything.
가입 March 2021
340 팔로잉 중    10.2K
Kimi K2.6 has landed, and it is live on Baseten! We have baked in multiple inference optimizations so that you can leverage Kimi K2.6 in production right away. To run Kimi K2.6, Baseten uses: -> The Baseten Inference Stack with advanced optimizations, including KV-aware routing -> NVFP4 weights to unlock maximum performance on NVIDIA Blackwell GPUs -> Multimodal hierarchical caching for low-latency vision input -> Prefill-decode disaggregation for LLM inference optimization. Try it now at:
더 보기