IBL News | New York
Google made its Veo 3 high-resolution video and synchronized audio available to developers via the Gemini API.
For now, the API is limited to text-to-video, but image-to-video support—already live in the Gemini app—is on the way.
To help developers get started, Google AI Studio offers an SDK template and a starter app for quick prototyping. Access requires an active Google Cloud project with billing enabled.
Veo 3 is Google’s first model that can generate high-resolution video and synchronized audio from a single text prompt. It creates visuals, dialogue, music, and sound effects simultaneously.
Veo 3 handles a range of video generation tasks, from cinematic narratives to dynamic character animations, and also incorporates audio elements such as dialogue, music, and sound effects. Additionally, the model can simulate real-world physics for motion.
Google posted several examples in Veo 3 in Google AI Studio.
It’s priced at $0.75 per second for video and audio output, supporting 720p, 24fps video with audio in 16:9 format and up to 8 seconds long, one of the most expensive options on the market for AI video.
Videos generated by Veo 3 models include a digital SynthID watermark.
Prompt: Fluffy Characters Stop Motion: Inside a brightly colored, cozy kitchen made of felt and yarn. Professor Nibbles, a plump, fluffy hamster with oversized glasses, nervously stirs a bubbling pot on a miniature stove, muttering, “Just a little more… ‘essence of savory,’ as the recipe calls for.” The camera is a mid-shot, capturing his frantic stirring. Suddenly, the pot emits a loud “POP!” followed by a comical “whoosh” sound, and a geyser of iridescent green slime erupts, covering the entire kitchen. Professor Nibbles shrieks, “Oh, dear! Not again!” and scurries away, leaving a trail of tiny, panicked squeaks.
Prompt: The sequence begins with an extreme close-up of a single gear, slowly turning and reflecting harsh sunlight. The camera gradually pulls back in a continuous movement, revealing this is but one component of a colossal, mechanical heart half-buried in a desolate, rust-colored desert. A sweeping aerial shot establishes its enormous scale and isolation in the barren landscape. The camera descends to capture pipes hissing steam and the rhythmic thumping that echoes across the empty plains. A subtle shake effect synchronizes with each massive heartbeat. A lateral tracking shot discovers tiny, robed figures scurrying across the metallic surface. The camera follows one such figure in a detailed tracking shot as they perform meticulous maintenance, polishing brass valves and tightening immense bolts. A complex movement circles the entire structure, capturing different maintenance teams working in precarious positions across its rusted exterior. The final shot begins tight on the meticulous work of one tiny figure before executing a dramatic pull-out that reveals the true scale of the heart and the minuscule size of its caretakers, tending to the vital organ of an unseen, sleeping giant that extends beyond the frame.