Register and share your invite link to earn from video plays and referrals.

anonymous
@youyouAllen
Nothing
Joined August 2014
1.2K Following    3.1K Followers
感觉模型在降本增效方向演进上,MoE和kv cache压缩只满足任务级别。而在token级别的会从单向注意力朝着双向注意力进行调整。自回归和扩散相结合,以进一步降本增效。
Show more