๊ฐ€์ž… ํ›„ ์ดˆ๋Œ€ ๋งํฌ๋ฅผ ๊ณต์œ ํ•˜๋ฉด ๋™์˜์ƒ ์žฌ์ƒ ๋ฐ ์ดˆ๋Œ€ ๋ณด์ƒ์„ ๋ฐ›์„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

Qwen
@Alibaba_Qwen
Open foundation models for AGI.
๊ฐ€์ž… February 2024
6 ํŒ”๋กœ์ž‰ ์ค‘    210.5K ํŒฌ
Today weโ€™re releasing Qwen-Scope ๐Ÿ”ญ, an open suite of sparse autoencoders for the Qwen model family. It turns SAE features into practical tools๏ผš ๐ŸŽฏ Inference โ€” Steer model outputs by directly manipulating internal features, no prompt engineering needed ๐Ÿ“‚ Data โ€” Classify & synthesize targeted data with minimal seed examples, boosting long-tail capabilities ๐Ÿ‹๏ธ Training โ€” Trace code-switching & repetitive generation back to their source, fix them at the root ๐Ÿ“Š Evaluation โ€” Analyze feature activation patterns to select smarter benchmarks and cut redundancy We hope the community uses Qwen-Scope to uncover new mechanisms inside Qwen models and build applications beyond what we explored.Excited to see what you build! ๐Ÿš€ ๐Ÿ”—๐Ÿ”— Blog: HuggingFace: ModelScope: Technical Report:
๋” ๋ณด๊ธฐ