Anthropic(@AnthropicAI):AI models aren’t yet general-purpose alignment scientists. Progress isn't as easy to verify on most alignment research tasks: our AARs would find “fuzzier” research much harder. But our experiment does show that Claude can increase the rate of experimentation and exploration.

Anthropic

@AnthropicAI

We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on

加入 January 2021

36 正在关注 1.3M 粉丝

Anthropic@AnthropicAI

2026.04.14 19:39

AI models aren’t yet general-purpose alignment scientists. Progress isn't as easy to verify on most alignment research tasks: our AARs would find “fuzzier” research much harder. But our experiment does show that Claude can increase the rate of experimentation and exploration.

显示更多

174

转发到社区

热门用户