註冊並分享邀請連結,可獲得影片播放與邀請獎勵。

Qwen
@Alibaba_Qwen
Open foundation models for AGI.
加入 February 2024
6 正在關注    209.4K 粉絲
Today we’re releasing Qwen-Scope 🔭, an open suite of sparse autoencoders for the Qwen model family. It turns SAE features into practical tools: 🎯 Inference — Steer model outputs by directly manipulating internal features, no prompt engineering needed 📂 Data — Classify & synthesize targeted data with minimal seed examples, boosting long-tail capabilities 🏋️ Training — Trace code-switching & repetitive generation back to their source, fix them at the root 📊 Evaluation — Analyze feature activation patterns to select smarter benchmarks and cut redundancy We hope the community uses Qwen-Scope to uncover new mechanisms inside Qwen models and build applications beyond what we explored.Excited to see what you build! 🚀 🔗🔗 Blog: HuggingFace: ModelScope: Technical Report:
顯示更多
0
94
2.7K
361
轉發到社區