Your all-in-one AI video pipeline — write, image, video & audio. Powered by Google Gemini + Azure.
One key for all Google tools — from Google AI Studio → Get API key. Text generation has a generous free tier.
Create a free Azure account → create a Speech resource (tier F0) → Keys and Endpoint → copy KEY 1 + Region.
In Azure, create a Translator resource (tier F0 = 2M chars/month free) → Keys and Endpoint → copy KEY 1 + the Location/Region. (This is a different resource from Speech.)
In Azure, create a Computer Vision (Azure AI Vision) resource → Keys and Endpoint → copy the Endpoint URL and KEY 1. Note: image captions need a region like East US, West US 2, or West Europe; OCR works in all regions.
Same key as the Video tab — from Google AI Studio → Get API key. Imagen is a paid feature (cheaper than Veo).
Go to Google AI Studio → Get API key. Veo video generation is a paid feature — you must enable billing on the key. Generation takes 1–3 minutes per clip.