注册并分享邀请链接,可获得视频播放与邀请奖励。

Michaël Gharbi
@m_gharbi
加入 May 2012
241 正在关注    726 粉丝
Today we launched Reve 2.0! We built it from the ground up with a completely new architecture. All visual generative models today use text as intermediate representation, leveraging Large Language Models to plan their outputs before rendering any pixels. Natural language is expressive, but it is ambiguous. Ambiguity is the enemy of control. Two years ago, we made a different bet. We replaced text with a better, code-like semantic representation — a layout. Layout is the reason we can compete with models trained on 10× our compute. It opens up a whole new world of precise, non-verbal visual control. Very proud of the Reve team for this incredible achievement!
显示更多
Today, we’re launching Reve 2.0, the best 4K image model in the world. We invented a new way to generate and edit any image using precise layouts. For the first time, it’s possible to create images you can touch.
显示更多
0
22
390
26
转发到社区