AiHubMix Documentation Hub

Prise en charge de l’interface multi-fonctions de l’API Openai Responses ; les fonctions suivantes ont été lancées :

Text input : entrée texte
Image input : entrée image
Streaming : streaming
Web search : recherche web
Deep research : pour les tâches complexes d’analyse et de recherche
Reasoning : contrôle de la profondeur de raisonnement, prend en charge 4 niveaux (minimal / low / medium / high). Seule la série gpt-5 prend en charge minimal.
Verbosity : longueur de sortie ; la série gpt-5 prend en charge 3 niveaux (low / medium / high)
Functions : fonctions
Utilisation de l’outil image_generation : la création et la génération d’images sont facturées sous gpt-image-1.
Code Interpreter : permet aux modèles d’écrire et d’exécuter du Python pour résoudre des problèmes. reasoning.effort ‘minimal’ n’est pas pris en charge lors de l’utilisation du code interpreter avec gpt-5.
Remote MCP : appel à un serveur MCP distant
Computer Use : Computer Use

Utilisation (appel Python) :

Identique à la méthode d’appel officielle d’OpenAI, il suffit de remplacer api_key et base_url pour la redirection. La Chine continentale peut y accéder directement.

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Replace with the key you generated in AiHubMix
    base_url="https://aihubmix.com/v1"
)

For inference models, the output inference summary can be controlled using the following parameter, with the detail richness of the summary ranked as detailed > auto > None, where auto provides the best balance.

"summary": "auto"

Optional deep reasoning models: ‎⁠o3-deep-research⁠ and ‎⁠o4-mini-deep-research⁠, only supported on the ‎⁠responses⁠ endpoint.
The gpt-5 series focuses on stable reasoning and consistent outputs, and no longer supports the temperature and top_p parameters for controlling randomness. If you need more freedom, you can try gpt-5-chat-latest, which supports temperature.
Reasoning models (o series / gpt-5 series) have deprecated max_tokens. Please use max_completion_tokens for completions or max_output_tokens for responses to explicitly set the output token limit.

from openai import OpenAI

client = OpenAI(
    api_key="sk-***", # Replace with the key generated in your AIHubMix dashboard
    base_url="https://aihubmix.com/v1"
)

response = client.responses.create(
    model="gpt-5", # gpt-5, gpt-5-chat-latest, gpt-5-mini, gpt-5-nano
    input="Why is tarot divination effective? What are the underlying principles and transferable methods? Output format: Markdown", # GPT-5 does not output in Markdown format by default, so you need to explicitly specify it.
    reasoning={
        "effort": "minimal" # Reasoning depth – Controls how many reasoning tokens the model generates before producing a response. Value can be "minimal", "low", "medium" or "high". Default is "medium".
    },
    text={
        "verbosity": "low" # Output length – Verbosity determines how many output tokens are generated. Value can be "low", "medium", or "high". Models before GPT-5 defaulted to "medium" verbosity.
    },
    stream=True
)

for event in response:
  print(event)

from openai import OpenAI

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Your Key "sk-***"
    base_url="https://aihubmix.com/v1"
)

response = client.responses.create(
  model="gpt-4o-mini", # codex-mini-latest AVAILABLE
  input="Tell me a three sentence bedtime story about a unicorn."
)

print(response)

from openai import OpenAI

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Your Key "sk-***"
    base_url="https://aihubmix.com/v1"
)

response = client.responses.create(
    model="gpt-4o-mini", # codex-mini-latest AVAILABLE
    input=[
        {
            "role": "user",
            "content": [
                { "type": "input_text", "text": "what is in this image?" },
                {
                    "type": "input_image",
                    "image_url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
                }
            ]
        }
    ]
)

print(response)

from openai import OpenAI

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Your Key "sk-***"
    base_url="https://aihubmix.com/v1"
)

response = client.responses.create(
  model="gpt-4o-mini", # codex-mini-latest AVAILABLE
  instructions="You are a helpful assistant.",
  input="Hello!",
  stream=True
)

for event in response:
  print(event)

from openai import OpenAI

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Your Key "sk-***"
    base_url="https://aihubmix.com/v1"
)

response = client.responses.create(
  model="gpt-4o-mini",
  tools=[{ "type": "web_search_preview" }],
  input="What was a positive news story from today?",
)

print(response)

from openai import OpenAI

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Your Key "sk-***"
    base_url="https://aihubmix.com/v1",
    timeout=3600
)

input_text = """
Research the economic impact of semaglutide on global healthcare systems.
Do:
- Include specific figures, trends, statistics, and measurable outcomes.
- Prioritize reliable, up-to-date sources: peer-reviewed research, health
  organizations (e.g., WHO, CDC), regulatory agencies, or pharmaceutical
  earnings reports.
- Include inline citations and return all source metadata.

Be analytical, avoid generalities, and ensure that each section supports
data-backed reasoning that could inform healthcare policy or financial modeling.
"""

response = client.responses.create(
  model="o3-deep-research", # o4-mini-deep-research
  input=input_text,
  tools=[
    {"type": "web_search_preview"},
    {"type": "code_interpreter", "container": {"type": "auto"}},
  ],
)

print(response.output_text)

from openai import OpenAI

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Your Key "sk-***"
    base_url="https://aihubmix.com/v1/"
)

response = client.responses.create(
    model="o4-mini", # codex-mini-latest, o4-mini, o3-mini, o3, o1
    input="How much wood would a woodchuck chuck?",
    reasoning={
        "effort": "medium", # low, medium, high
        "summary": "auto", # resoning summary
    }
)

print(response)

from openai import OpenAI

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Your Key "sk-***"
    base_url="https://aihubmix.com/v1"
)

tools = [
    {
        "type": "function",
        "name": "get_current_weather",
        "description": "Get the current weather in a given location",
        "parameters": {
          "type": "object",
          "properties": {
              "location": {
                  "type": "string",
                  "description": "The city and state, e.g. San Francisco, CA",
              },
              "unit": {"type": "string", "enum": ["celsius", "fahrenheit"]},
          },
          "required": ["location", "unit"],
        }
    }
]

response = client.responses.create(
  model="gpt-4o-mini", # codex-mini-latest AVAILABLE
  tools=tools,
  input="What is the weather like in Boston today?",
  tool_choice="auto"
)

print(response)

from openai import OpenAI
import base64

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Your Key "sk-***"
    base_url="https://aihubmix.com/v1"
)

response = client.responses.create(
    model="gpt-4.1-mini",
    input="Generate an image of gray tabby cat hugging an otter with an orange scarf",
    tools=[
        {
            "type": "image_generation",
            "background": "opaque", 
            "quality": "high",
        }
    ],
)

# Save the image to a file
image_data = [
    output.result
    for output in response.output
    if output.type == "image_generation_call"
]

if image_data:
    image_base64 = image_data[0]
    with open("cat_and_otter.png", "wb") as f:
        f.write(base64.b64decode(image_base64))

from openai import OpenAI

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Your Key "sk-***"
    base_url="https://aihubmix.com/v1"
)

instructions = """
You are a personal math tutor. When asked a math question, 
write and run code using the python tool to answer the question.
"""

resp = client.responses.create(
    model="gpt-4.1",
    tools=[
        {
            "type": "code_interpreter",
            "container": {"type": "auto"}
        }
    ],
    instructions=instructions,
    input="I need to solve the equation 3x + 11 = 14. Can you help me?",
)

print(resp.output)

from openai import OpenAI

client = OpenAI(
    api_key="AIHUBMIX_API_KEY", # Your Key "sk-***"
    base_url="https://aihubmix.com/v1"
)

resp = client.responses.create(
    model="gpt-4.1",
    tools=[{
        "type": "mcp",
        "server_label": "deepwiki",
        "server_url": "https://mcp.deepwiki.com/mcp",
        "require_approval": "never",
        "allowed_tools": ["ask_question"],
    }],
    input="What transport protocols does the 2025-03-26 version of the MCP spec (modelcontextprotocol/modelcontextprotocol) support?",
)

print(resp.output_text)

Note:

The latest codex-mini-latest does not support search.
The Computer use feature requires integration with Playwright. It’s recommended to refer to the official repository.

Known issues:

Use cases are complex to invoke
Takes many screenshots, which is time-consuming and often unreliable
May trigger CAPTCHA or Cloudflare human verification, potentially leading to infinite loops

Dernière mise à jour : 2026-06-01

Démarrage

Fonctionnalités passerelle

Capacités des modèles

Référence du protocole

Gestion de la plateforme

Confidentialité et conditions

Prise en charge de l'API Openai Responses

Utilisation (appel Python) :

​Utilisation (appel Python) :

Utilisation (appel Python) :