Create model response (Responses format)
Create a model response in OpenAI Responses API format with multi-turn and tool-calling support
Models you can use
| Model ID | Notes |
|---|---|
gpt-5.4 | GPT-5 flagship — top reasoning / coding / agentic, 1M context |
gpt-5.4-mini | Lightweight, balanced — great for high-volume and fallback |
deepseek-v4-pro | DeepSeek cost-effective reasoning model |
GET /v1/models for the full list.
Multi-turn continuation
Pass theid from a previous response as previous_response_id to
continue the conversation without resending the full history.
Reasoning control
For reasoning-capable models, usereasoning.effort (low / medium /
high) to set reasoning depth, and reasoning.summary (auto /
concise / detailed) to control how much reasoning detail is returned.
Context truncation
Settruncation to auto to let the system automatically drop older
context when the window is exceeded. Set to disabled to return an
error instead.Authorizations
Body
OpenAI Responses API request body.
Model ID, e.g. gpt-5.4. See GET /v1/models for the full list.
"gpt-5.4"
Input content — either a plain text string or an array of messages.
Omit when using previous_response_id to continue a prior turn.
System-level instructions, equivalent to a system message in Chat Completions.
Maximum number of tokens the model may generate in this response, including reasoning tokens.
Sampling temperature between 0 and 2, controlling output randomness.
0 <= x <= 2Nucleus sampling threshold. Tune this or temperature, not both.
0 <= x <= 1Whether to stream the response as Server-Sent Events.
A list of tools the model may call.
Tool calling strategy — auto, none, or required as a string,
or an object specifying a particular tool.
auto, none, required Reasoning configuration, only effective for reasoning-capable models.
The id of a prior response. When set, the conversation continues
from that point without resending history.
Context truncation strategy. auto drops older context when the
window is exceeded; disabled returns an error instead.
auto, disabled Response
Successful response
OpenAI Responses API response body.
Unique identifier for this response, usable as previous_response_id in the next turn.
Object type, value is response.
"response"
Unix timestamp (seconds) of creation.
Response status.
completed, failed, in_progress, incomplete The model that actually processed the request.
List of output blocks generated by the model.
Token usage statistics for the request.