Product Guide
Everything you need to know about using ModelPix.ai. Each guide includes step-by-step instructions, examples, tips, and credit costs.
Getting StartedModelPix.ai gives you access to over 70 AI image models, 16+ AI video models, voice cloning, avatar training, and more through a single platform. This product guide covers every tool available, with step-by-step instructions, parameter references, example use cases, tips from experienced creators, and transparent credit costs so you know exactly what each generation will cost before you start.
Whether you are learning how to generate AI images from text, animate still photos into cinematic video, swap faces in photos and videos, clone a voice for narration, or train a custom avatar, each guide walks you through the full workflow from start to finished output. Guides are organized by category: AI Generation covers the core creation tools, Avatars and Voices covers personalization features, and Account and Billing explains the credit system and pricing tiers.
Every guide includes a detailed overview explaining how the underlying AI models work, a list of configurable parameters with descriptions, numbered steps for completing each workflow, real-world example use cases, and tips for getting the best results. Many guides also include frequently asked questions that address common issues creators encounter. If you are new to AI generation, start with the Getting Started guide below to understand the platform layout, credit system, and basic generation flow.
AI Image Generation
ModelPix includes one of the largest selections of AI image models available on any platform. Text to Image lets you describe a scene in plain language and the AI generates a high-resolution photo, illustration, or digital artwork in seconds. You can choose from over 70 models including Flux 2, GPT-Image, Nano Banana 2, and dozens of community-trained style models that specialize in photorealism, anime, 3D renders, oil paintings, and more.
Each model handles prompts differently, so our guide covers prompt structure, negative prompts, aspect ratios, and seed values to help you get consistent, repeatable results. Image Edit lets you modify specific regions of an existing image using inpainting and outpainting. Upscale enhances any image to higher resolution using AI super-resolution.
AI Video Generation
The video generation tools on ModelPix turn static images into motion. Image to Video animates any still image into a short video clip using models like Veo 3.1, Kling 3.0, and WAN 2.6, with control over duration, motion intensity, and camera movement. Talking Photo takes a portrait and an audio file and generates a realistic video of the person speaking.
Motion Transfer extracts movement from a reference video and applies it to a target image, so a character in a still photo can dance, wave, or perform any captured action. Caption Removal cleanly erases hardcoded subtitles from video footage. Video Replace Background swaps backgrounds in real time, and Video Animate adds cinematic motion effects to product shots and portraits.
Face Swap and Head Swap
Face Swap replaces one face with another in any photo while preserving lighting, skin tone, and expression. The AI handles angle matching automatically, so the swapped face looks natural even in profile shots or angled compositions.
Head Swap goes further by replacing the entire head including hair, ears, and neck, which produces more convincing results when hairstyles differ significantly. Both tools work with the Face Library, where you can upload reference faces once and reuse them across unlimited generations.
Avatars, Voices, and Audio
Avatar Training lets you create a custom AI avatar from a short video. Once trained, your avatar generates talking-head videos from just a text script or audio file, without needing to record new footage. This is powerful for creators who want to scale video production or create videos without being on camera.
Voice Cloning captures the unique characteristics of any voice from a short audio sample and creates a reusable voice profile. Dubbing translates spoken audio into another language while preserving the original speaker's voice. Think Sound generates custom sound effects and ambient audio from text descriptions.
Virtual Try-On and Photobook
Virtual Try-On places any garment onto a model photo with realistic fabric draping, wrinkle simulation, and body-aware fitting. Upload a flat-lay product image and the AI renders it onto your chosen model with accurate shadow casting and material texture.
Photobook generates a complete set of consistent character images from a single reference photo, maintaining the same face, body proportions, and style across multiple poses and settings. This is useful for character sheets, content series, or marketing assets where visual consistency matters.
Credits and Pricing
ModelPix uses a credit-based pay-per-use system with no recurring charges. New users receive free credits on sign-up to explore every tool before purchasing. Credits never expire, so you can buy a pack and use them at your own pace.
Each tool has a transparent credit cost listed in its guide and shown before every generation. Simple image operations like face swap cost fewer credits, while intensive tasks like video generation cost more. The Account and Billing guide explains how to check your balance and purchase additional credit packs.
