Skip to content

Roadmap

img-mcp is becoming a general-purpose image manipulation MCP server. Generation is one tool; the main focus is fast, local, non-AI image operations powered by Node.js.

Shipped

  • [x] Text-to-image generation — create images from natural-language prompts.
  • [x] Reference-image generation — guide generation with base64 or data-URI images.
  • [x] Multiple aspect ratios and sizes — 1:1, 16:9, 9:16, 4K, and more.
  • [x] Transparent backgrounds — remove backgrounds via @imgly/background-removal-node.
  • [x] Zero-install usage — run with npx -y img-mcp.
  • [x] Bring-your-own-key — your Google API key stays in your environment.
  • [x] Resize — scale by width, height, percentage, or fit mode (cover, contain, fill, inside, outside).
  • [x] Convert formats — JPEG, PNG, WebP, AVIF, GIF.

Planned — Core image manipulation

The priority is robust, dependency-light Node.js operations that work without an API key.

  • [ ] Crop — crop by coordinates/dimensions or gravity-based smart crop.
  • [ ] Compress / optimize — quality, format-specific tuning, and target file-size optimization.
  • [ ] Rotate & flip — arbitrary rotation, horizontal/vertical flip, auto-orient from EXIF.
  • [ ] Blur & sharpen — gaussian blur, unsharp mask, motion blur.
  • [ ] Adjustments — brightness, contrast, saturation, hue, exposure, gamma.
  • [ ] Filters & effects — grayscale, sepia, invert, tint, rounded corners, drop shadow.
  • [ ] Metadata read/write — EXIF, IPTC, XMP extraction and stripping.
  • [ ] Batch operations — apply the same transform to multiple images in one call.

Planned — Advanced manipulation

  • [ ] Watermark / overlay — add text or image watermarks with positioning and opacity.
  • [ ] Collage / montage — combine multiple images into a grid or custom layout.
  • [ ] Sprite sheet generation — pack frames into a single image.
  • [ ] Image comparison — diff two images and return similarity metrics.
  • [ ] Thumbnails — generate preset thumbnail sizes with face/center gravity.
  • [ ] Animated image support — split/combine GIF/WebP frames.
  • [ ] SVG rasterization — render SVGs to PNG/JPEG at any size.

Research

Longer-term or optional capabilities that may rely on AI or heavier dependencies.

  • [ ] Additional generation providers — support image models beyond Google Nano Banana.
  • [ ] Upscale & super-resolution — increase resolution and recover detail.
  • [ ] Inpainting / outpainting — edit or extend image regions.
  • [ ] Background replacement — replace backgrounds via segmentation or color keying.
  • [ ] Style transfer & presets — apply consistent artistic styles and brand presets.
  • [ ] Image analysis feedback loop — let agents critique and iteratively refine images.

Share your ideas

Have a feature request? Open an issue on GitHub and let us know what you'd like to see next.

Released under the MIT License.