Today we launched Reve 2.0! We built it from the ground up with a completely new architecture.
All visual generative models today use text as intermediate representation, leveraging Large Language Models to plan their outputs before rendering any pixels.
Natural language is expressive, but it is ambiguous. Ambiguity is the enemy of control.
Two years ago, we made a different bet. We replaced text with a better, code-like semantic representation — a layout.
Layout is the reason we can compete with models trained on 10× our compute. It opens up a whole new world of precise, non-verbal visual control.
Very proud of the Reve team for this incredible achievement!
顯示更多
Today, we’re launching Reve 2.0, the best 4K image model in the world.
We invented a new way to generate and edit any image using precise layouts. For the first time, it’s possible to create images you can touch.
顯示更多