The AI world is still figuring out how to deal with the amazing show of prowess that is DALL-E 2’s ability to draw/paint/imagine just about anything… but OpenAI isn’t the only one working on something like that. Google Research has rushed to publicize a similar model it’s been working on — which it claims is even better.
Imagen (get it?) is a text-to-image diffusion-based generator built on large transformer language models that… okay, let’s slow down and unpack that real quick.
Text-to-image models take text inputs like “a dog on a bike” and pro
コメント