注册并分享邀请链接,可获得视频播放与邀请奖励。

Perplexity
@perplexity_ai
Curiosity changes everything. Download our free app on iOS, Mac, Windows, and Android.
加入 December 2022
76 正在关注    487.9K 粉丝
We’ve developed our own inference engine Runtime-Optimized Serving Engine (ROSE) to serve models ranging from embeddings to trillion-parameter LLMs. With CuTeDSL integrated into our inference engine, Perplexity can build the specialized GPU kernels faster to bring models up to peak performance on NVIDIA Hopper and Blackwell GPUs.
显示更多
0
75
1.1K
120
转发到社区