Google has unveiled Veo, a video-generation model available in preview on Vertex AI, the company's model repository platform, similar to Amazon Bedrock.
The company says Veo can generate "high-quality, high-definition videos based on text or image prompts in a wide range of cinematic and visual styles with exceptional speed."
Veo can create videos "that can go beyond a minute" in up to 1080p resolution. It can accept both text prompts and images as inputs. The model supports a wide range of visual styles and "creates footage that's consistent and coherent."
Besides Veo, Google also announced Imagen 3, the next-generation update for its image generation model. It can generate "photorealistic, lifelike images," and is bundled with powerful editing tools that let you alter images with simple text prompts, selective (mask-based) editing, and upscale images to suit requirements.
It is now possible for companies to infuse their brand logos or styles into an image. Google says this will augment "the marketing process for advertising and marketing assets." Imagen 3 will be available next week in Vertex AI.
The competition for AI-based media generation tools is heating up, with everyone vying for the top spot in the enterprise market. Amazon just announced its Nova foundational models. OpenAI is close to releasing Sora, while Meta has Movie Gen.
Chinese companies are part of the skirmish too, with the likes of Alibaba, Tencent, and Kuaishou battling it out in their territory.