Generate premium quality videos from text descriptions with Google’s Veo 3.1 model (quality tier). This is the high-quality variant of the Veo 3.1 family, optimized for maximum visual fidelity and detail.
Property Value Provider Google Model Veo 3.1 Capability Text to Video Base Cost 200,000 micro-cents/second ($0.20/sec) Processing Time ~240 seconds
Request Body
Model slug. Use google/veo-3.1-quality/text-to-video for text-to-video generation.
Input parameters for text-to-video generation. Text description of the video to generate (max 4000 characters).
Video aspect ratio. Default: 16:9. Options: 16:9, 9:16.
Seed for reproducible results.
Output resolution. Default: 720p. Options: 720p, 1080p, 4k.
Video duration in seconds. Default: 8. Options: 4, 6, 8.
Enable audio generation. Default: true. Options: true, false.
HTTPS URL to receive a webhook notification when the job completes or fails.
Pricing
Base cost: 200,000 micro-cents per second ($0.20/sec)
finalCost = baseCost × duration × resolution × has_sound
Factor Option Multiplier Duration 44x 66x 88x Resolution 720p1x 1080p1x 4k1.5x Sound false1x true1.5x
Default cost: 8 seconds, 720p, with sound = 200,000 × 8 × 1 × 1.5 = 2,400,000 micro-cents ($2.40)
Response
Unique identifier for the submitted job.
Initial job status. Always "pending" on successful submission.
ISO 8601 timestamp of the estimated completion time.
The cost of the job in micro-cents.
Code Examples
curl -X POST https://api.muvi.video/v1/jobs/submit \
-H "Authorization: Bearer $PIXELBYTE_API_KEY " \
-H "Content-Type: application/json" \
-d '{
"model": "google/veo-3.1-quality/text-to-video",
"input": {
"prompt": "A golden retriever running through a sunlit meadow",
"aspect_ratio": "16:9",
"resolution": "1080p",
"duration": "8",
"has_sound": "true"
}
}'