All News DISPATCH AI VIDEO

Video Generation Model Veo 3 Integrates with Gemini 3.5 for Action-Based Control

Google has introduced Veo 3, the latest iteration of its video generation model designed to handle more sophisticated motion and temporal consistency. This update integrates the model with Gemini 3.5, allowing creators to use natural language for complex scene orchestration.

Google Veo 3

Google has announced Veo 3, the latest version of its video generation technology, alongside the release of Gemini 3.5. This update focuses on improving the model's ability to interpret complex prompts and execute specific actions within a video frame. For filmmakers and digital creators, this represents a shift toward more predictable and controllable synthetic media, moving away from the unpredictable results common in earlier generative models.

What's new

Veo 3 introduces several technical improvements focused on temporal consistency and physical accuracy. The model can now handle longer sequences without the visual drifting or warping that often plagues AI-generated footage. By utilizing the reasoning capabilities of Gemini 3.5, the system better understands the physics of motion, ensuring that objects move and interact with their environment in a more realistic manner.

Key updates include:

  • Enhanced spatial awareness for better object placement and movement within the 3D space of a 2D frame.
  • Improved instruction following, allowing users to specify camera angles, lighting changes, and character movements with higher precision.
  • Faster inference times, reducing the wait between prompt submission and the final video output.
  • Better rendering of fine details, such as skin textures, fabric movement, and environmental elements like smoke or water.

How it fits your workflow

For directors and cinematographers, Veo 3 serves as a sophisticated pre-visualization tool. Instead of relying on static storyboards or crude 3D animatics, teams can generate high-fidelity motion tests to communicate a specific vision to the crew. This is particularly useful for establishing the mood of a scene or testing complex camera movements before arriving on set.

Editors and VFX artists can use Veo 3 to generate b-roll or background plates that might otherwise require expensive stock footage or location shoots. While it does not yet replace high-end cinema cameras for primary photography, it functions as a visual scratchpad that can be refined into final assets. It competes directly with tools like Runway Gen-3 Alpha and Luma Dream Machine, offering an alternative for those already integrated into the Google Workspace or Cloud ecosystems. The integration with Gemini 3.5 means the model can act as a creative partner, suggesting visual interpretations based on a script or a simple text description.

What it costs / how to try it

Veo 3 is currently being rolled out to select creators and enterprise partners through VideoFX and Google Cloud. Access is expected to expand as the model undergoes further safety testing. Interested users can sign up for the waitlist on the Google DeepMind or labs.google websites to test the model's capabilities as they become available.

Read the original announcement on Google Veo 3 ↗

Help keep this running

Your tip funds servers, models, and the time it takes to ship new tools faster. Set any amount below — every bit helps.