Best MCP servers for video and image generation

Generating and editing images and video used to mean leaving your workflow for a separate UI; with an MCP server the agent can produce visual assets inline, render a hero image, generate variations, upscale or transform an existing asset, as part of the task it's already doing. The right server depends on what you need: a broad model marketplace, a speed-optimized inference platform, a frontier image-model provider, or a design-grade generator. The recurring need is the same, give the agent programmatic access to image and video models with a current API. The servers below are real MCP servers with current, verified install configs.

Top pick

Replicate

Replicate

Official

Replicate's official MCP server: discover, compare, and run thousands of hosted AI models — image, video, audio, and language — straight from your agent.

ai-ml

Replicate's server exposes a huge catalog of community and commercial image and video models, the most flexible pick when you want to run many different models from one tool.

Pick 2

fal.ai

Raveen Beemsingh

Community

Community MCP server for fal.ai: generate and edit images, video, music, and audio with 600+ fast generative models from your agent.

ai-ml48

fal's server is built for fast, low-latency generative inference, strong when you need images or video produced quickly inside an interactive workflow.

Pick 3

Stability AI

Tadas Antanavicius

Community

Community MCP server for Stability AI: generate, edit, upscale, outpaint, and restyle images with Stable Diffusion from your agent.

ai-ml83

Stability AI's server gives an agent direct access to Stable Diffusion and related models, a solid default for high-quality image generation.

Pick 4

Recraft

Recraft

Official

Recraft's official MCP server: generate and edit raster and vector images, build reusable styles, vectorize, upscale, and swap backgrounds from your agent.

ai-ml

Recraft's server generates design-grade images and vector art with brand-style control, the pick when output needs to look like a designer made it.