Discount on all models + if you follow us on twitter and hit us on dm you will get free credit to your email hurry up 🔥🔥🔥 click here
Any issues any issue at all join our discord or use the feedback system and report it we will solve it faster than you think ̲𝖢̲𝗅̲𝗂̲𝖼̲𝗄̲ ̲𝗁̲𝖾̲𝗋̲𝖾̲ it will redirect to discord server.

MODELS · ONE SUBSCRIPTION, ALL OF THEM

Every model. One bill.

getvivix runs 221+ frontier AI models for video, image, and audio in a single studio. Pick any model to see what it does, or start free.

Start free

Video models · 77

Kling VIDEO 3.0 4K

4K multimodal video generation with native audio and richer visual detail

kling-ai
Kling VIDEO O3 4K

4K variant of Kling O3 with native audio for premium production

kling-ai
LTX-2 Fast

High speed cinematic text to video with synced audio

lightricks
LTX-2 Pro

Cinematic LTX-2 Pro text and image to video generator

lightricks
PixVerse V6

Multi-shot cinematic video generation with native audio, 20+ camera controls, and character consistency

pixverse
Seedance 1.5 Pro

Native audio-visual cinematic AI video generation

bytedance
Seedance 2.0

Premium multimodal video generation with native audio and cinematic motion

bytedance
Seedance 2.0 Fast

Speed-optimized Seedance 2.0 for rapid iteration with native audio

bytedance
Veo 3.1 Fast

High speed Google Veo 3.1 Fast text to video generation

google
Wan2.7

Multimodal video generation with reference consistency, video editing, and native audio

alibaba
Aurora v1

High-quality audio-driven avatar video generation

creatify
Aurora v1 Fast

Fast audio-driven avatar video generation

creatify
Grok Imagine Video

AI video generation with synchronized audio from text and images

xai
Grok Imagine Video 1.5 Preview

Higher-tier Grok image-to-video from a single starting frame with longer durations

xai
HappyHorse 1.0

Alibaba text-to-video and image-to-video at 720p or 1080p with seeded generation and frame conditioning

alibaba
HeyGen Avatar IV

AI talking avatar video from a HeyGen avatar or your own photo, driven by a script or audio

heygen
HeyGen Avatar V

Talking digital twins with sharper identity and motion coherence

heygen
HeyGen Video Agent

AI-powered prompt-to-video production with avatars, B-roll, and motion graphics

heygen
Kling VIDEO 2.6 Pro

Kling VIDEO 2.6 Pro is a full audio-visual AI video model that combines cinematic-quality video generation with native audio (dialogue, sound effects, ambience), with optional Motion Control for precise character movement via the API.

kling-ai
Kling VIDEO 3.0 Pro

High-fidelity multimodal video generation with native audio and advanced editing

kling-ai
Kling VIDEO 3.0 Standard

Multimodal video generation with native audio and efficient performance

kling-ai
Kling VIDEO O3 Pro

Unified multimodal video generation with native audio and higher-fidelity renders

kling-ai
Kling VIDEO O3 Standard

Cost-efficient multimodal video generation with native audio and editing

kling-ai
KlingAI 1.6 Pro

High fidelity image to video model for dynamic 1080p clips

kling-ai
KlingAI 1.6 Standard

Mid-tier KlingAI 1.6 Standard text to video model

kling-ai
KlingAI 2.0 Master

KlingAI 2.0 Master for high control AI video generation

klingai
KlingAI 2.1 Master

Premium KlingAI 2.1 Master for high fidelity video

kling-ai
KlingAI 2.1 Pro

KlingAI 2.1 Pro for cinematic AI video generation

klingai
KlingAI 2.1 Standard

KlingAI 2.1 Standard for faster AI video generation

kling-ai
KlingAI 2.5 Turbo Pro

Cinematic text to video and image to video at scale

klingai
KlingAI 2.5 Turbo Standard

Fast cinematic image to video generation for creators

klingai
KlingAI Avatar 2.0 Pro

High fidelity avatar video generation with smoother motion and quality

kling-ai
KlingAI Avatar 2.0 Standard

Expressive avatar video generation from image and audio

kling-ai
KlingAI Lip-Sync

Accurate AI lip sync for character driven video content

kling-ai
LTX-2

Open-source AI video model with synchronized audio and high-fidelity output

lightricks
LTX-2 Retake

Segmented AI video retakes with precise in-shot control

lightricks
LTX-2.3

High-fidelity multimodal video generation with native audio

lightricks
LTX-2.3 Fast

Fast multimodal video generation optimized for rapid iteration

lightricks
MiniMax 01 Director

Cinematic text to video with precise camera control

minimax
MiniMax 01 Live

Anime video model for expressive character animation

minimax
MiniMax Hailuo 02

Cinematic AI video model for viral and commercial clips

minimax
MiniMax Hailuo 2.3

High fidelity AI video generation from text or images

minimax
MiniMax Hailuo 2.3 Fast

Fast MiniMax Hailuo 2.3 model for short cinematic video

minimax
OmniHuman-1.5

Cognitive avatar video from image, audio, and text

bytedance
P-Video

Real-time AI video generation with draft mode and native audio

prunaai
P-Video-Animate

Reference-image animation driven by the motion, timing, and camera movement of a source video

prunaai
P-Video-Replace

Swap the on-camera character in a video using a reference image, preserving motion, timing, camera and scene

prunaai
PixVerse LipSync

Realistic AI lip sync from audio for any video

pixverse
PixVerse V3.5

PixVerse V3.5 early text to video effects model

pixverse
PixVerse V4

PixVerse V4 AI text to video with pro camera control

pixverse
PixVerse V4.5

PixVerse V4.5 cinematic text and image to video model

pixverse
PixVerse V5

PixVerse V5 cinematic text to video and image to video

pixverse
PixVerse V5 Fast

Fast text to video and image to video generation for rapid iteration

pixverse
PixVerse V5.6

Enhanced cinematic video generation with improved lip-sync and audio realism

pixverse
Runway Aleph 2.0

Localized video editing that transforms an existing clip from a text prompt while keeping the rest stable

runway
Runway Gen-4 Turbo

High speed Gen-4 Turbo image to video generation

runway
Runway Gen-4.5

Advanced multimodal video generation with text and image input

runway
Seedance 1.0 Pro

Seedance 1.0 Pro high fidelity 1080p text and image to video

bytedance
Seedance 1.0 Pro Fast

Fast Seedance 1.0 Pro video generation for dance content

bytedance
SkyReels V4

Multimodal video-audio foundation model with 1080p cinematic output, inpainting, and video extension

skywork
Sora 2

Next generation AI video and audio model from OpenAI

openai
Sora 2 Pro

Premium Sora 2 Pro model for high fidelity AI video

openai
sync-3

Full-scene lip synchronization with global face understanding and obstruction handling

sync
Veo 2

High fidelity text to video generation with camera control

google
Veo 3

Cinematic video generation, now with native audio

google
Veo 3 Fast

Fast Google Veo 3 video generation with native audio

google
Veo 3.1

Veo 3.1 cinematic AI video with native audio

google
Vidu 2.0

Fast 1080p AI video generation with strong consistency

vidu
Vidu Q1

Vidu Q1 high fidelity reference to video generation model

vidu
Vidu Q2 Pro

High fidelity Vidu Q2 Pro model for cinematic AI video

vidu
Vidu Q2 Turbo

Faster Vidu Q2 video generation with advanced motion control

vidu
Vidu Q3

Multimodal video generation with native audio and intelligent shot planning

vidu
Vidu Q3 Turbo

Low-latency multimodal video generation with native audio

vidu
Wan2.2 A14B

MoE video generation from text or images at 480p to 720p

alibaba
Wan2.5-Preview

Wan2.5-Preview AI Text to Video with Native Audio

alibaba
Wan2.6

Multimodal video generation with multi-shot and native sound

alibaba
Wan2.6 Flash

Fast distilled image-to-video generation model

alibaba

Image models · 84

GPT Image 1.5

GPT Image 1.5 flagship image model with faster generation and enhanced editing

openai
GPT Image 2

OpenAI GPT Image 2 — high-fidelity generation and editing with up to 16 reference images

openai
Grok Imagine Image Pro

High fidelity AI image generation and editing with improved prompt control

xai
Kling IMAGE 3.0

2K to 4K image generation with improved realism and practical image-to-image editing

kling-ai
Kling IMAGE O3

4K Omni image generation with strong consistency and reference control

kling-ai
Nano Banana 2

Gemini 3.1 Flash Image fast high quality AI image generation and editing

google
Seedream 5.0 Lite

Responsive text-to-image generation with real-time search and precise prompt adherence

bytedance
Vivi

The getvivix signature model — instant images in any style

getvivix
Wan2.7 Image

Unified image generation and editing with avatar customization, color control, and multilingual text rendering

alibaba
Wan2.7 Image Pro

Premium image generation with enhanced composition stability and precise prompt comprehension

alibaba
Bria 3.2

Commercial-safe text to image model for production use

bria
Bria FIBO

Deterministic JSON native text to image for enterprises

bria
Bria FIBO Edit

Instruction-driven image editing with mask support

bria
Bria Fibo Edit Tools

Unified image editing foundation for recolor, relight, restore, blend, reseason, and sketch

bria
DALL·E 2

DALL·E 2 AI image generator for text guided creation

openai
DALL·E 3

DALL·E 3 high fidelity text to image generation API

openai
Exactly Bold Chromatics

Vibrant, high-contrast illustrative style with bold color palettes

exactly
Exactly Bright Pulse

Bright, energetic photographic style with vivid lighting

exactly
Exactly Dark Comics

Dark, gritty comic art style with heavy shadows and noir aesthetics

exactly
Exactly Distant Reality

Dreamy photographic style with surreal, distant atmosphere

exactly
Exactly Earthy Elegance

Warm, organic illustrative style with muted earth tones

exactly
Exactly Editorial Line

Clean, editorial-style line illustrations with refined detail

exactly
Exactly Extreme Contrast

High-contrast photographic style with dramatic light and shadow

exactly
Exactly Grain Film Look

Analog film photography style with natural grain and warm tones

exactly
Exactly Graphic Harmony

Balanced, harmonious graphic illustrations with cohesive composition

exactly
Exactly Graphic Novel

Comic book and graphic novel style with strong ink lines and dramatic shading

exactly
Exactly Graphite Creature

Textured graphite-style illustrations with creature and character focus

exactly
Exactly Journey

Travel and adventure photographic style with rich, cinematic tones

exactly
Exactly Monochrome Café

Monochromatic illustrative style with warm café-inspired tones

exactly
Exactly Muted Modern

Contemporary illustrative style with soft, muted color palettes

exactly
Exactly Playful Line Adventures

Whimsical, playful line art with an adventurous character

exactly
Exactly Warm Light

Soft, warm-lit photographic style with inviting golden tones

exactly
FLUX Virtual Try-On

Low-latency virtual try-on for transferring garments onto a person image with strong identity and garment fidelity

black-forest-labs
FLUX.1 [dev]

Open-weight 12B text to image model for rich visuals

black-forest-labs
FLUX.1 [schnell]

Ultra fast FLUX.1 text to image model for local use

black-forest-labs
FLUX.1 Kontext [dev]

Open image editing model for fast iterative workflows

black-forest-labs
FLUX.1 Kontext [max]

High fidelity FLUX.1 Kontext max for precise image edits

black-forest-labs
FLUX.1 Kontext [pro]

Context aware FLUX.1 image editing and generation model

black-forest-labs
FLUX.1 Krea [dev]

FLUX.1 Krea Dev for photorealistic open‑weight generation

black-forest-labs
FLUX.1.1 [pro]

FLUX.1.1 Pro high fidelity text to image generation

black-forest-labs
FLUX.1.1 [pro] Ultra

High speed 4MP FLUX image generation for production apps

black-forest-labs
FLUX.2 [dev]

FLUX.2 dev for controllable open text to image workflows

black-forest-labs
FLUX.2 [flex]

Configurable FLUX.2 Flex for precise text aligned images

black-forest-labs
FLUX.2 [klein] 4B

Fastest Klein model for real-time image generation and editing

black-forest-labs
FLUX.2 [klein] 4B Base

Compact undistilled model for efficient image generation and editing

black-forest-labs
FLUX.2 [klein] 9B Base

Undistilled foundation model for high-quality image generation and editing

black-forest-labs
FLUX.2 [klein] 9B KV

KV-cache accelerated image generation and editing for real-time multi-reference workflows

black-forest-labs
FLUX.2 [max]

The latest state-of-the-art model from Black Forest Labs, generating images grounded in live web information.

black-forest-labs
FLUX.2 [pro]

High control FLUX.2 Pro image generation and editing

black-forest-labs
GPT Image 1

GPT Image 1 high fidelity image generation for GPT-4o

openai
Grok Imagine Image

AI image generation from text and images

xai
Grok Imagine Image Quality

xAI's quality-focused image generation and editing — sharper realism, better text rendering, tighter prompt following

xai
HiDream-I1 Dev

HiDream-I1 Dev fast 17B text to image generation model

runware
HiDream-I1 Fast

HiDream-I1 Fast for low latency text to image generation

runware
HiDream-I1 Full

HiDream-I1 Full high fidelity text to image generator

runware
Ideogram 2.0

Ideogram 2.0 text to image model for sharp design work

ideogram
Ideogram 3.0

Ideogram 3.0 text to image model for sharp design visuals

ideogram
Ideogram 4.0

Design-focused text-to-image with strong typography, layout control, transparent backgrounds, and 2K output

ideogram
Imagen 3

High fidelity text to image generation with Imagen 3

google
Imagen 3 Fast

High speed Imagen 3 Fast model for rapid image generation

google
Imagen 4 Fast

High speed Imagen 4 Fast text to image generation

google
Imagen 4 Preview

High fidelity 2K text to image generation by Google

google
Imagen 4 Ultra

High fidelity text to image model with sharp typography

google
ImagineArt 1.5 Pro

Professional AI image generation with native 4K and refined visual control

imagineart
ImagineArt 2.0

Reasoning-based text to image generation with vibrant true-to-life color

imagineart
Juggernaut Lightning Flux by RunDiffusion

Ultra fast Flux-based model for high volume image generation

rundiffusion
Juggernaut Pro Flux by RunDiffusion

Photorealistic Flux based text to image model for pros

rundiffusion
Kandinsky 5.0 Image Lite

Efficient text-to-image and image-to-image editing model

runware
Krea 2 Large

Larger Krea 2 variant for rawer, more flexible outputs with stronger photorealism and weighted reference control

krea
Krea 2 Medium

Faster Krea 2 variant for stable, consistent generation with controllable prompt strength and weighted reference guidance

krea
Nano Banana

High quality multi image generation for complex visuals

google
P-Image

Real-time text-to-image model for production graphics

prunaai
P-Image-Edit

High precision multi image AI editor for fast workflows

prunaai
Qwen-Image

Qwen-Image high fidelity text aware image generation model

alibaba
Qwen-Image-2.0

Unified image generation and editing with professional text rendering

alibaba
Qwen‑Image‑Edit

High fidelity text guided image editing for Qwen

alibaba
Recraft V4

Professional text-to-image model for brand and marketing design

recraft
Recraft V4 Pro

Advanced design-focused image generation with enhanced control and fidelity

recraft
Seedream 4.0

High speed 4K AI image generation and editing model

bytedance
Stable Diffusion 3

Stable Diffusion 3 for sharper text and complex images

runware
Wan2.5-Preview Image

High fidelity Wan2.5 image generation for rich single frames

alibaba
Wan2.6 Image

High fidelity image generation built on the Wan2.6 visual stack

alibaba
Z-Image

Efficient high-quality image generation foundation model

alibaba
Z-Image-Turbo

Fast photorealistic image generator with text control

alibaba

Audio models · 19

ACE-Step v1.5 Base

Open-source music generation with voice cloning, lyric editing, and multilingual support

runware
ACE-Step v1.5 Turbo

Fast music generation optimized for speed with reduced inference steps

runware
Eleven Flash v2

Low-latency English TTS for real-time voice use-cases

elevenlabs
Eleven Flash v2.5

Real-time TTS for voice agents, 32 languages, ~75ms latency

elevenlabs
Eleven Monolingual v1

Legacy English-only TTS

elevenlabs
Eleven Multilingual v1

Legacy multilingual TTS across 9 languages

elevenlabs
Eleven Multilingual v2

High-fidelity multilingual TTS across 29 languages

elevenlabs
Eleven Music v1

Generate studio quality music tracks from text prompts

elevenlabs
Eleven Turbo v2

Low-latency English TTS for production

elevenlabs
Eleven Turbo v2.5

Fast multilingual TTS across 32 languages

elevenlabs
Eleven v3

Premium expressive TTS across 74 languages with audio tags

elevenlabs
Gemini 3.1 Flash TTS

Expressive text-to-speech with audio tags, multi-speaker dialogue, and 70+ languages

google
Inworld TTS-1.5 Max

High-fidelity expressive text-to-speech with rich prosody and multilingual support

inworld
Inworld TTS-1.5 Mini

Low-latency expressive text-to-speech optimized for real-time apps

inworld
MiniMax Speech 2.8

High-quality text-to-speech with expressive, natural voice synthesis

minimax
Qwen3-TTS 1.7B Base

High-quality multilingual text-to-speech with voice cloning and ultra-low latency

alibaba
Qwen3-TTS 1.7B CustomVoice

Text-to-speech with preset premium timbres and precise style control

alibaba
Qwen3-TTS 1.7B VoiceDesign

Text-to-speech with voice creation from natural language descriptions

alibaba
xAI Text-to-Speech

Expressive text-to-speech with five voices, speech tags, and multilingual support

xai

Text models · 23

Claude Haiku 4.5

Anthropic's fastest Claude — latency-optimized for agentic sub-tasks and high-volume work

anthropic
Claude Opus 4.7

Anthropic's flagship — demanding coding, agent orchestration, multimodal reasoning

anthropic
Claude Sonnet 4.6

Anthropic's daily-driver Sonnet — coding, agents, long-context reasoning, computer use

anthropic
DeepSeek V4 Flash

Budget-tier reasoning LLM with 1M context window and 384K max output

deepseek
Gemini 3 Flash

Advanced multimodal text and reasoning model

google
Gemini 3.1 Flash Lite

Advanced multimodal text and reasoning model

google
Gemini 3.1 Pro

Advanced multimodal text and reasoning model

google
GLM-4.7

Z.ai's affordable mid-range LLM — 200K context and 73.8% on SWE-bench

zai
GLM-5.1

Z.ai's flagship LLM — premium reasoning, 200K context, JSON mode, agentic strength

zai
GPT-5.4

Flagship reasoning LLM with 1M context, native computer use, and high factual accuracy

openai
GPT-5.4 Mini

Efficient reasoning LLM with 400K context for coding assistants and subagent workflows

openai
GPT-5.4 Nano

Ultra-low-latency LLM for high-volume classification, extraction, and lightweight automation

openai
GPT-5.5

OpenAI's newest flagship LLM — deepest reasoning, computer-use, 1M+ context

openai
Kimi K2.6

Moonshot AI multimodal LLM with native image and video understanding, 262K context

moonshotai
LLaVA-1.6-Mistral-7B

Vision-language model for image understanding and captioning

runware
MiniMax M2.5

State-of-the-art agentic coding and office-work model, optimized for speed and cost

minimax
MiniMax M2.7

Long‑context agentic coding and office productivity model for fast, reliable tool use

minimax
MiniMax M2.7 Highspeed

Faster throughput for agentic coding and tool‑driven automation

minimax
Open Age Detection

Facial age estimation model

runware
OpenAI CLIP ViT-L/14

Vision encoder for text-image representation and similarity

openai
Qwen2.5-VL-3B-Instruct

Instruction-tuned vision-language model for image and text understanding

alibaba
Qwen2.5-VL-7B-Instruct

Instruction-tuned multimodal vision-language model

alibaba
ViT Age Classifier

Vision transformer model for estimating age from facial images

runware

Utility models · 13

3D models · 5