註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

Baseten
@baseten
Inference is everything.
加入 March 2021
340 正在關注    10.2K 粉絲
We serve Qwen3-TTS on vLLM-Omni at $3 per 1M characters. That's 90% lower in cost than comparable closed-source TTS APIs. Our engineers optimized a single-replica serving stack to get there. Details on the optimized stack and cost per concurrent stream here.
顯示更多