modelgrep
Gemini Omni Flash sample output

Gemini Omni Flash

google/gemini-omni-flash/reference-to-video
image-to-videocommercial

Overview

Generates video with audio from combined multimodal references. Accepts text, images, audio, and video together as input to guide subject, motion, style, and sound in the output.

Frequently asked

What is Gemini Omni Flash?

Generates video with audio from combined multimodal references. Accepts text, images, audio, and video together as input to guide subject, motion, style, and sound in the output.

How can I use Gemini Omni Flash?

Gemini Omni Flash runs as a hosted API endpoint on fal.ai (google/gemini-omni-flash/reference-to-video), offered under a commercial license. No infrastructure needed — call it per generation.

More image to video models