Veo 3.1 — Text to Video (Quality)

curl --request POST \
  --url https://api.muvi.video/v1/jobs/submit \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "<string>",
  "input": {
    "prompt": "<string>",
    "aspect_ratio": "<string>",
    "seed": 123,
    "resolution": "<string>",
    "duration": "<string>",
    "has_sound": "<string>"
  }
}
'

{
  "jobId": "<string>",
  "status": "<string>",
  "estimatedCompletionTime": "<string>",
  "costMicroCents": 123
}

POST

https://api.muvi.video

jobs

submit

Veo 3.1 — Text to Video (Quality)

curl --request POST \
  --url https://api.muvi.video/v1/jobs/submit \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "<string>",
  "input": {
    "prompt": "<string>",
    "aspect_ratio": "<string>",
    "seed": 123,
    "resolution": "<string>",
    "duration": "<string>",
    "has_sound": "<string>"
  }
}
'

{
  "jobId": "<string>",
  "status": "<string>",
  "estimatedCompletionTime": "<string>",
  "costMicroCents": 123
}

Generate premium quality videos from text descriptions with Google’s Veo 3.1 model (quality tier). This is the high-quality variant of the Veo 3.1 family, optimized for maximum visual fidelity and detail.

Property	Value
Provider	Google
Model	Veo 3.1
Capability	Text to Video
Base Cost	200,000 micro-cents/second ($0.20/sec)
Processing Time	~240 seconds

Request Body

model

string

required

Model slug. Use google/veo-3.1-quality/text-to-video for text-to-video generation.

input

object

required

Input parameters for text-to-video generation.

Hide properties

prompt

string

required

Text description of the video to generate (max 4000 characters).

aspect_ratio

string

Video aspect ratio. Default: 16:9. Options: 16:9, 9:16.

seed

integer

Seed for reproducible results.

resolution

string

Output resolution. Default: 720p. Options: 720p, 1080p, 4k.

duration

string

Video duration in seconds. Default: 8. Options: 4, 6, 8.

has_sound

string

Enable audio generation. Default: true. Options: true, false.

webhookUrl

string

HTTPS URL to receive a webhook notification when the job completes or fails.

Pricing

Base cost: 200,000 micro-cents per second ($0.20/sec)

finalCost = baseCost × duration × resolution × has_sound

Factor	Option	Multiplier
Duration	`4`	4x
	`6`	6x
	`8`	8x
Resolution	`720p`	1x
	`1080p`	1x
	`4k`	1.5x
Sound	`false`	1x
	`true`	1.5x

Default cost: 8 seconds, 720p, with sound = 200,000 × 8 × 1 × 1.5 = 2,400,000 micro-cents ($2.40)

Response

jobId

string

Unique identifier for the submitted job.

status

string

Initial job status. Always "pending" on successful submission.

estimatedCompletionTime

string

ISO 8601 timestamp of the estimated completion time.

costMicroCents

number

The cost of the job in micro-cents.

Code Examples

curl -X POST https://api.muvi.video/v1/jobs/submit \
  -H "Authorization: Bearer $PIXELBYTE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "google/veo-3.1-quality/text-to-video",
    "input": {
      "prompt": "A golden retriever running through a sunlit meadow",
      "aspect_ratio": "16:9",
      "resolution": "1080p",
      "duration": "8",
      "has_sound": "true"
    }
  }'

Veo 3.1 Fast — Reference to Video Wan 2.2 Super — Image to Video

⌘I

Google

Alibaba

​Request Body

​Pricing

​Response

​Code Examples

Request Body

Pricing

Response

Code Examples