Kimi.ai(@Kimi_Moonshot ):"Mooncake originated from a research collaboration between Kimi(Moonshot AI) and Tsinghua University. It was born from the need to solve the 'memory wall' in serving massive-scale models like Kimi K-Series. Since open-sourcing, it has evolved into a thriving community-driven project." GitHub:

2026.02.13 00:44

"Mooncake originated from a research collaboration between Kimi(Moonshot AI) and Tsinghua University. It was born from the need to solve the 'memory wall' in serving massive-scale models like Kimi K-Series. Since open-sourcing, it has evolved into a thriving community-driven project." GitHub:

PyTorch@PyTorch

2026.02.12 22:44

We’re excited to welcome Mooncake to the PyTorch Ecosystem! Mooncake is designed to solve the “memory wall” in LLM serving. By integrating Mooncake’s high performance KVCache transfer and storage capabilities with PyTorch native inference engines like SGLang, vLLM, and TensorRT-LLM, it unlocks new levels of throughput and scalability for large language model deployments. Mooncake enables prefill decode disaggregation, global KVCache reuse, elastic expert parallelism, and serves as a fault tolerant PyTorch distributed backend. 🔗 #PyTorch# #OpenSourceAI# #LLM# #AIInfrastructure#