註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

Perplexity
@perplexity_ai
Curiosity changes everything. Download our free app on iOS, Mac, Windows, and Android.
加入 December 2022
76 正在關注    487.9K 粉絲
We’ve developed our own inference engine Runtime-Optimized Serving Engine (ROSE) to serve models ranging from embeddings to trillion-parameter LLMs. With CuTeDSL integrated into our inference engine, Perplexity can build the specialized GPU kernels faster to bring models up to peak performance on NVIDIA Hopper and Blackwell GPUs.
顯示更多
0
75
1.1K
120
轉發到社區