Rather than requiring a camera crew or actors, the new system lets a script be fed into the editor and then delivered on screen by one of a dozen digital presenters. Each avatar comes with its own look and voice, offering a quick way to create training guides or business explainers. The option was previewed earlier this year and is now broadly available to paying Workspace and education customers, as well as subscribers to Google’s AI plans.

The app also gains tools that handle some of the more tedious editing steps. Vids can now spot filler words such as “uh” or “ah” and cut them, closing up awkward pauses at the same time. Another addition, built on the Veo 3 model, can generate an eight-second animation from a single picture, making it possible to turn a static product image into a short moving clip.
For people who simply want to assemble slides, templates, or stock footage, Google has prepared a free consumer edition. This entry-level version includes the company’s design library and basic controls, but it leaves out the AI-driven functions being offered to business accounts.
Further upgrades are still on the roadmap. Noise cancellation, customizable backgrounds, and new output shapes such as portrait and square are all being worked on, though no release date has been given.
With rivals like Synthesia and D-ID already offering avatar-driven video production to companies, Google is now placing Vids as its own answer, balancing high-end AI options for enterprise users with a lighter tool that anyone can access.
Notes: This post was edited/created using GenAI tools.
Read next: Users Question Google’s AI Overviews After Spam Policy Update