midjourney/v8.1/text-to-image

文生图

Midjourney V8.1 Text-to-Image API by MIDJOURNEY

midjourney/v8.1/text-to-image

Text-to-image

Midjourney V8.1 generates four images from a text prompt, with optional native 2K HD, a style reference, and aspect-ratio / stylize / chaos / weird controls.

输入

正在加载参数配置...

输出

空闲

等待中

每次运行将花费 $0.086。$10 可运行约 116 次。

你可以继续：

图生视频图生图

参数

代码示例
import requests
import time

# Step 1: Start image generation
generate_url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "midjourney/v8.1/text-to-image",
    "prompt": "A beautiful landscape with mountains and lake",
    "width": 512,
    "height": 512,
    "steps": 20,
    "guidance_scale": 7.5,
}

generate_response = requests.post(generate_url, headers=headers, json=data)
generate_result = generate_response.json()
prediction_id = generate_result["data"]["id"]

# Step 2: Poll for result
poll_url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"

def check_status():
    while True:
        response = requests.get(poll_url, headers={"Authorization": "Bearer $ATLASCLOUD_API_KEY"})
        result = response.json()

        if result["data"]["status"] == "completed":
            print("Generated image:", result["data"]["outputs"][0])
            return result["data"]["outputs"][0]
        elif result["data"]["status"] == "failed":
            raise Exception(result["data"]["error"] or "Generation failed")
        else:
            # Still processing, wait 2 seconds
            time.sleep(2)

image_url = check_status()

安装

安装所需的依赖包。

pip install requests

认证

所有 API 请求需要通过 API Key 进行认证。您可以在 Atlas Cloud 控制台获取 API Key。

export ATLASCLOUD_API_KEY="your-api-key-here"

HTTP 请求头

import os

API_KEY = os.environ.get("ATLASCLOUD_API_KEY")
headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {API_KEY}"
}

保护好您的 API Key

切勿在客户端代码或公开仓库中暴露您的 API Key。请使用环境变量或后端代理。

提交请求

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}
data = {
    "model": "your-model",
    "prompt": "A beautiful landscape"
}

response = requests.post(url, headers=headers, json=data)
print(response.json())

提交请求

提交一个异步生成请求。API 返回一个 prediction ID，您可以用它来检查状态和获取结果。

POST/api/v1/model/generateImage

请求体

import requests

url = "https://api.atlascloud.ai/api/v1/model/generateImage"
headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer $ATLASCLOUD_API_KEY"
}

data = {
    "model": "midjourney/v8.1/text-to-image",
    "prompt": "A beautiful landscape with mountains and lake"
}

response = requests.post(url, headers=headers, json=data)
result = response.json()

print(f"Prediction ID: {result['data']['id']}")
print(f"Status: {result['data']['status']}")

响应

{
  "code": 200,
  "data": {
    "id": "pred_abc123",
    "status": "processing",
    "model": "model-name",
    "created_at": "2025-01-01T00:00:00Z"
  }
}

检查状态

轮询 prediction 端点以检查请求的当前状态。

GET/api/v1/model/prediction/{prediction_id}

轮询示例

import requests
import time

prediction_id = "pred_abc123"
url = f"https://api.atlascloud.ai/api/v1/model/prediction/{prediction_id}"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

while True:
    response = requests.get(url, headers=headers)
    result = response.json()
    status = result["data"]["status"]
    print(f"Status: {status}")

    if status in ["completed", "succeeded"]:
        output_url = result["data"]["outputs"][0]
        print(f"Output URL: {output_url}")
        break
    elif status == "failed":
        print(f"Error: {result['data'].get('error', 'Unknown')}")
        break

    time.sleep(3)

状态值

processing请求仍在处理中。

completed生成完成，输出可用。

succeeded生成成功，输出可用。

failed生成失败，请检查 error 字段。

完成响应

{
  "data": {
    "id": "pred_abc123",
    "status": "completed",
    "outputs": [
      "https://storage.atlascloud.ai/outputs/result.png"
    ],
    "metrics": {
      "predict_time": 8.3
    },
    "created_at": "2025-01-01T00:00:00Z",
    "completed_at": "2025-01-01T00:00:10Z"
  }
}

上传文件

将文件上传到 Atlas Cloud 存储，获取可在 API 请求中使用的 URL。使用 multipart/form-data 上传。

POST/api/v1/model/uploadMedia

上传示例

import requests

url = "https://api.atlascloud.ai/api/v1/model/uploadMedia"
headers = { "Authorization": "Bearer $ATLASCLOUD_API_KEY" }

with open("image.png", "rb") as f:
    files = {"file": ("image.png", f, "image/png")}
    response = requests.post(url, headers=headers, files=files)

result = response.json()
download_url = result["data"]["download_url"]
print(f"File URL: {download_url}")

响应

{
  "data": {
    "download_url": "https://storage.atlascloud.ai/uploads/abc123/image.png",
    "file_name": "image.png",
    "content_type": "image/png",
    "size": 1024000
  }
}

Input Schema

以下参数在请求体中被接受。

总计: 0必填: 0可选: 0

暂无可用参数。

请求体示例

{
  "model": "midjourney/v8.1/text-to-image"
}

Output Schema

API 返回包含生成输出 URL 的 prediction 响应。

idstringrequired

Unique identifier for the prediction.

statusstringrequired

Current status of the prediction.

processingcompletedsucceededfailed

modelstringrequired

The model used for generation.

outputsarray[string]

Array of output URLs. Available when status is "completed".

errorstring

Error message if status is "failed".

metricsobject

Performance metrics.

predict_timenumber

Time taken for image generation in seconds.

created_atstringrequired

ISO 8601 timestamp when the prediction was created.

Format: date-time

completed_atstring

ISO 8601 timestamp when the prediction was completed.

Format: date-time

响应示例

{
  "id": "pred_abc123",
  "status": "completed",
  "model": "model-name",
  "outputs": [
    "https://storage.atlascloud.ai/outputs/result.png"
  ],
  "metrics": {
    "predict_time": 8.3
  },
  "created_at": "2025-01-01T00:00:00Z",
  "completed_at": "2025-01-01T00:00:10Z"
}

Atlas Cloud Skills

Atlas Cloud Skills 将 300+ AI 模型直接集成到您的 AI 编程助手中。一条命令安装，即可用自然语言生成图像、视频和与 LLM 对话。

支持的客户端

Claude Code

OpenAI Codex

Gemini CLI

Cursor

Windsurf

VS Code

Trae

GitHub Copilot

Cline

Roo Code

Amp

Goose

Replit

40+ 支持的客户端

安装

npx skills add AtlasCloudAI/atlas-cloud-skills

设置 API Key

从 Atlas Cloud 控制台获取 API Key，并将其设置为环境变量。

export ATLASCLOUD_API_KEY="your-api-key-here"

功能

安装后，您可以在 AI 助手中使用自然语言访问所有 Atlas Cloud 模型。

图像生成使用 Nano Banana 2、Z-Image 等模型生成图像。

视频创作使用 Kling、Vidu、Veo 等模型从文本或图像创建视频。

LLM 对话与 Qwen、DeepSeek 等大语言模型对话。

媒体上传上传本地文件用于图像编辑和图生视频工作流。

MCP Server

Atlas Cloud MCP Server 通过 Model Context Protocol 将您的 IDE 与 300+ AI 模型连接。支持任何兼容 MCP 的客户端。

支持的客户端

Cursor

VS Code

Windsurf

Claude Code

OpenAI Codex

Gemini CLI

Cline

Roo Code

100+ 支持的客户端

安装

npx -y atlascloud-mcp

配置

将以下配置添加到您的 IDE 的 MCP 设置文件中。

{
  "mcpServers": {
    "atlascloud": {
      "command": "npx",
      "args": [
        "-y",
        "atlascloud-mcp"
      ],
      "env": {
        "ATLASCLOUD_API_KEY": "your-api-key-here"
      }
    }
  }
}

可用工具

atlas_generate_image从文本提示生成图像。

atlas_generate_video从文本或图像创建视频。

atlas_chat与大语言模型对话。

atlas_list_models浏览 300+ 可用 AI 模型。

atlas_quick_generate一步式内容创建，自动选择最佳模型。

atlas_upload_media上传本地文件用于 API 工作流。

了解更多

github.com/AtlasCloudAI/mcp-server

API Schema

Schema 不可用

暂无可用示例

加载中...

1. Introduction

Midjourney V8.1 is the latest iteration of Midjourney's image-synthesis model. This README covers the two core generation endpoints:

midjourney/v8.1/text-to-image
midjourney/v8.1/image-to-video

It belongs to a larger Midjourney V8.1 family on this platform, which also includes midjourney/v8.1/image-to-image, midjourney/v8.1/blend, midjourney/v8.1/style-transfer, and midjourney/v8.1/remove-background (each documented separately).

Midjourney V8.1 is designed to produce high-aesthetic, prompt-faithful imagery at native 2K resolution with substantially faster generation than prior versions. It is built by Midjourney, an independent, self-funded San Francisco research lab (~11–50 staff) founded in August 2021 by David Holz, and is positioned as a speed- and quality-focused evolution of the company's image pipeline.

The V8 line is a full from-scratch rewrite of Midjourney's image model, accompanied by a migration from TPU-based to GPU-native PyTorch infrastructure. The model's defining methodology is a human-preference aesthetic tuning loop combined with per-user personalization, prioritizing visually compelling output over raw fidelity to a reference dataset. V8.1 entered alpha on April 14, 2026, reached general availability across web and Discord on April 30, 2026, and became Midjourney's default model on June 10, 2026. A few capabilities from the prior V7 model are not yet present in V8.1 (see below), but it is now the company's primary image model.

2. Key Features & Innovations

Native 2K HD output without a separate upscaler: V8.1 generates directly at 2048px resolution, eliminating the dedicated upscaling step required by earlier versions. HD renders take roughly 1.33 GPU-minutes and standard-definition renders under 1 GPU-minute, with HD running approximately 3× faster and cheaper than in V8.
~4–5× faster generation: The GPU-native PyTorch rewrite delivers an estimated four-to-fivefold speedup in generation time over previous Midjourney versions (a Midjourney-stated figure, not an independent benchmark), improving iteration speed for creative workflows.
Improved text rendering: V8.1 renders in-image text more reliably, with quoted strings in prompts used to specify the intended text — narrowing a long-standing weakness relative to text-specialized competitors.
Stronger prompt-following: The model adheres more closely to prompt instructions, improving controllability and reducing the prompt-engineering effort needed to achieve a target composition.
Restored image conditioning: Image prompts and image weights — absent in the V8.0 alpha — returned in V8.1, alongside backward compatibility with V7 style references (srefs), moodboards, and personalization profiles. (Image-driven generation is offered here as the dedicated midjourney/v8.1/image-to-image and blend models.)
Workflow tooling: V8.1 ships with a Prompt Shortener and an updated /describe command, and its aesthetic has been re-tuned "in the spirit of V7" to preserve the look users prefer.
Personalized aesthetic tuning: A human-preference (RLHF-style) aesthetic tuning loop combined with per-user personalization shapes outputs toward individually preferred visual styles.

3. Model Architecture & Technical Details

Midjourney V8.1 is a complete from-scratch rewrite of the company's image model. As part of the V8 program, Midjourney migrated from TPU-based infrastructure to a GPU-native PyTorch stack; David Holz has publicly stated that the original TPU choice "set research back a year." The underlying generative approach is understood to be latent diffusion, though Midjourney has not published a technical paper or model card, and the specific backbone, parameter count, and text encoder remain undisclosed.

Training details are not publicly documented. The dataset has never been disclosed and is the subject of active, unresolved copyright litigation — Disney Enterprises, Inc. v. Midjourney, Inc. (No. 2:25-cv-05275, U.S. District Court for the Central District of California), filed June 11, 2025 by a coalition of major studios including Disney, Marvel, Lucasfilm, Twentieth Century, Universal, and DreamWorks Animation. The studios' infringement claims are allegations in pending litigation and have not been adjudicated. The defining training methodology is a human-preference aesthetic tuning loop (an RLHF-style process) layered with per-user personalization, which together steer the model toward high-aesthetic, user-aligned outputs rather than optimizing for a single fixed objective.

Because V8.1 began as an alpha, several capabilities present in V7 were initially unavailable; the gaps still being closed include Omni Reference (--oref), Character Reference, the --no negative prompt, multi-prompts, the Niji model, Draft Mode, and Turbo mode. (Image prompts and image weights have returned; quality is supported at the 1 and 4 levels; speed is fixed to the fast tier.)

Regarding the midjourney/v8.1/image-to-video identifier: Midjourney's video capability is separately branded V1, launched June 18, 2025, and is image-to-video only (no text-to-video). It produces 5-second base clips at 24fps, extendable to roughly 21 seconds, with a 480p base resolution and 720p available on higher tiers. It offers Low/High Motion, Auto/Manual settings, and looping with end-frame control (added July 2025). No V8-native or "V8.1" video model has been confirmed, so the "v8.1" tag on the video endpoint reflects the model-family naming on this platform rather than a distinct V8.1 video model.

4. Performance Highlights

Midjourney has not published quantitative benchmarks, ELO scores, or arena rankings for V8.1, and the absence of an official public API limits the model's presence in third-party evaluation arenas. Performance is therefore best described qualitatively:

Speed and efficiency: Approximately 4–5× faster generation overall (Midjourney-stated), with native 2K HD rendering at ~1.33 GPU-minutes and SD under 1 GPU-minute.
Resolution: Direct 2048px output with no separate upscaling pass.
Text fidelity: Materially improved in-image text rendering versus prior Midjourney versions.
Prompt adherence: Stronger instruction-following and controllability.
Aesthetics: Re-tuned to preserve the visual character of V7 while improving fidelity.

The table below summarizes the competitive landscape for context. No directly comparable arena scores are available across these systems.

Category	Model	Developer	Notable Strength
Text-to-image	Midjourney V8.1	Midjourney	Aesthetics, native 2K HD, speed
Text-to-image	Flux 2	Black Forest Labs	Photorealism, open weights
Text-to-image	Imagen 4	Google	In-image text
Text-to-image	Ideogram v3	Ideogram	In-image text
Text-to-image	GPT Image / DALL·E	OpenAI	Instruction-following
Text-to-image	Firefly 3	Adobe	Commercial licensing
Video	Sora	OpenAI	Text-to-video
Video	Veo	Google	High-fidelity video
Video	Runway / Kling / Luma	Various	Motion control, length

As a rule of thumb, V8.1 is preferred for speed, HD resolution, and text rendering.

5. Intended Use & Applications

Concept art & pre-production: Rapid generation of high-resolution concept imagery for games, film, and product design, accelerating early ideation with fast 2K output.
Marketing & social content: Production of on-brand visuals and social media assets at scale, leveraging improved text rendering for graphics that include words and short phrases.
Film storyboarding & previsualization: Creation of storyboard frames and previs imagery, optionally animated into short clips via Midjourney's separate V1 image-to-video pipeline (midjourney/v8.1/image-to-video).
Brand & graphic design: Exploration of visual identities, typography-inclusive layouts, and stylistic directions using image prompts, style references, and moodboards.
Personalized creative iteration: Per-user aesthetic personalization tailors outputs to an individual's preferred visual style, supporting consistent look-and-feel across a body of work.

For image-guided workflows, see the companion models: image-to-image (generate from a reference), blend (fuse multiple images), style-transfer (restyle while preserving composition), and remove-background (isolate a subject on transparency).

Midjourney V8.1 Text-to-Image API by MIDJOURNEY

输入

输出

参数

代码示例

安装

认证

HTTP 请求头

提交请求

提交请求

请求体

响应

检查状态

轮询示例

状态值

完成响应

上传文件

上传示例

响应

Input Schema

请求体示例

Output Schema

响应示例

Atlas Cloud Skills

支持的客户端

安装

设置 API Key

功能

MCP Server

支持的客户端

安装

配置

可用工具

API Schema

1. Introduction

2. Key Features & Innovations

3. Model Architecture & Technical Details

4. Performance Highlights

5. Intended Use & Applications

探索类似模型

Openai GPT Image 2 Text-to-Image

Openai GPT Image 2 Edit

GPT Image 2 Developer Edit

GPT Image 2 Developer Text-to-Image

Seed3D 2.0 Image-to-3D

Hunyuan 3D Rapid Image-to-3D

Hunyuan 3D Rapid Text-to-3D

Hunyuan 3D Pro Image-to-3D

Hunyuan 3D Pro Text-to-3D

Nano Banana 2 Reference to Image

Nano Banana 2 Reference to Image Developer

Grok Imagine Image Quality Edit

Grok Imagine Image Quality Text-to-Image

Baidu ERNIE Image Turbo Text-to-image

Wan-2.7 Pro Text-to-image

Wan-2.7 Image-to-image

一个 API，畅享全模态 AI。

Join our Discord community

输入

输出

参数

代码示例

安装

认证

HTTP 请求头

提交请求

提交请求

请求体

响应

检查状态

轮询示例

状态值

完成响应

上传文件

上传示例

响应

Input Schema

请求体示例

Output Schema

响应示例