feat: add responses api chat format by bzp2010 · Pull Request #86 · api7/aisix

bzp2010 · 2026-05-04T11:49:36Z

Summary by CodeRabbit

New Features
- Added support for the OpenAI Responses API format in the gateway, enabling request/response bridging with chat completions.
- Streaming support with incremental text and tool-call events, and usage propagation.
- Maps various input/output forms and response formats for broader compatibility.
Tests
- Added comprehensive unit tests covering mapping, streaming, native delegation, and validation.

coderabbitai · 2026-05-04T11:49:54Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 4bd41bad-037a-4f9c-b42c-9a9873af6ef9

📥 Commits

Reviewing files that changed from the base of the PR and between 827f0e1 and 5ea4081.

📒 Files selected for processing (1)

src/gateway/formats/openai/responses.rs

🚧 Files skipped from review as they are similar to previous changes (1)

src/gateway/formats/openai/responses.rs

📝 Walkthrough

Walkthrough

Adds a new ResponsesApiFormat ChatFormat that bridges OpenAI Responses API requests/responses to the internal chat completions hub, implements streaming/tool-call accumulation and native delegation, and updates native streaming state to carry usage.

Changes

Responses API Format Bridge

Layer / File(s)	Summary
Data Shape `src/gateway/traits/native.rs`	`OpenAIResponsesNativeStreamState` changed from unit-like to `pub struct OpenAIResponsesNativeStreamState { pub usage: Usage }`.
Core Implementation `src/gateway/formats/openai/responses.rs`	Adds `ResponsesApiFormat` and `ResponsesBridgeState`; implements ChatFormat: request->hub mapping (inputs, tools, tool_choice, text.format), hub->Responses mapping (outputs, function calls, usage), streaming bridging with tool-call merging and output-index stability, native delegation hooks, helpers, and comprehensive unit tests.
Module Exports `src/gateway/formats/openai/mod.rs`	Adds `mod responses;` and `pub use responses::ResponsesApiFormat;`.
Public API `src/gateway/formats/mod.rs`	Re-exports `ResponsesApiFormat` alongside `OpenAIChatFormat`, guarded with `#[allow(unused_imports)]`.

Sequence Diagram

sequenceDiagram
    participant Client as Responses API<br/>Client
    participant Format as ResponsesApiFormat
    participant Hub as Chat Completions<br/>Hub
    participant Native as Native OpenAI<br/>Handlers

    Client->>Format: ResponsesApiRequest (instructions, items, tools)
    Format->>Format: validate_request()
    Format->>Format: to_hub() -> ChatCompletionRequest
    alt Native Delegation
        Format->>Native: call_native(request)
        Native->>Native: transform_request()
        Native->>Hub: send native request
        Hub-->>Native: ChatCompletionResponse / stream
        Native->>Native: transform_response()
        Native-->>Format: transformed response / stream events
    else Direct Hub Call
        Format->>Hub: send ChatCompletionRequest
        Hub-->>Format: ChatCompletionResponse / stream
    end
    Format->>Format: from_hub() -> ResponsesApiResponse
    Format-->>Client: ResponsesApiResponse (output_text, function_calls, usage)

    alt Streaming
        Hub->>Format: ChatCompletionChunk stream
        Format->>Format: merge_streaming_tool_call() / accumulate text deltas
        Format-->>Client: ResponsesApiStreamEvent (output_item_added / content_part_added / content_block_delta)
        Format-->>Client: ResponsesApiStreamEvent (output_item_done / content_block_done)
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

feat(provider): add new provider gateway types #14: Related — adds OpenAI Responses types and Usage referenced by the new ResponsesApiFormat and native stream state.
feat(provider): add new provider gateway traits #16: Related — introduced native stream state types and ChatFormat extensions that this change builds upon.
feat(provider): add stream bridged and native implementations #23: Related — modifies native streaming integration and usage handling used by the new ResponsesApiFormat.

🚥 Pre-merge checks | ✅ 5 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
E2e Test Quality Review	⚠️ Warning	PR adds ResponsesApiFormat implementation with comprehensive unit tests but lacks E2E tests covering full integration with OpenAI Responses API bridge and gateway infrastructure.	Add integration tests demonstrating complete request/response flow through ResponsesApiFormat bridge with actual gateway infrastructure, or document why unit tests suffice and request E2E waiver.

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'feat: add responses api chat format' clearly and concisely describes the main change: introducing ResponsesApiFormat, a new ChatFormat implementation for the OpenAI Responses API.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Security Check	✅ Passed	Security review reveals no CRITICAL or HIGH severity vulnerabilities across seven security categories. The PR introduces a format transformation layer with no credential handling, sensitive logging, database operations, authorization decisions, or cryptographic configurations.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch bzp/feat-responses-chat-format

_{Review rate limit: 3/5 reviews remaining, refill in 23 minutes and 5 seconds.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (1)

src/gateway/formats/openai/responses.rs (1)

27-30: ⚡ Quick win

Add /// docs to the new public types.

ResponsesApiFormat is publicly re-exported, and ResponsesBridgeState is public as well, but neither has a doc comment.

Minimal doc-comment patch

+/// Bridges OpenAI Responses API requests and streams through the hub chat format.
 pub struct ResponsesApiFormat;
 
 #[derive(Debug, Clone, Default)]
+/// Stateful data used while converting hub streaming chunks into Responses API events.
 pub struct ResponsesBridgeState {

As per coding guidelines "Use /// for doc comments on public items in Rust".

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/gateway/formats/openai/responses.rs` around lines 27 - 30, Add /// doc
comments to the new public types: place a short descriptive triple-slash comment
above the ResponsesApiFormat declaration explaining its role as the API response
format adapter, and above the ResponsesBridgeState struct explaining what state
it holds and when it is used; ensure the comments are concise, use proper Rust
doc style (///), describe public behavior or purpose, and mention any important
invariants or usage notes for ResponsesApiFormat and ResponsesBridgeState.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/gateway/formats/openai/responses.rs`:
- Around line 204-258: The stream assigns different numeric indices to the same
logical output as items (tools/text) are added, because tool_output_index(state,
...) recomputes index from current presence; fix by making the output index
stable and stored in the per-item state: when handling a tool call (functions:
merge_streaming_tool_call, streaming_function_call_output_item, and where you
currently call tool_output_index), check the tool_state for a stored
output_index Option, and if None compute the index once (using the same
logic/tool_output_index helper) and persist it to tool_state.output_index before
emitting any
OutputItemAdded/OutputTextDelta/FunctionCallArgumentsDelta/Output... events;
replace subsequent calls to tool_output_index with reading
tool_state.output_index so all later delta/done/completed events use the same
stable index. Apply the same pattern in the other referenced blocks (around
lines ~292-305, ~749-786, ~788-823).
- Around line 71-98: The bridge currently accepts requests with
previous_response_id but to_hub() only forwards the current input, so follow-ups
with FunctionCallOutput become orphaned tool messages; fix this by updating
ensure_request_is_bridgeable() to reject any request that has
previous_response_id (and/or any ResponsesInput item that represents a
FunctionCallOutput continuation) by returning an Err when
req.previous_response_id.is_some() (and the analogous predicate for input
items), so to_hub(), responses_input_item_to_hub_message, and BridgeContext are
never invoked for these unresolved continuations; reference
ensure_request_is_bridgeable, to_hub, BridgeContext, previous_response_id, and
responses_input_item_to_hub_message when making the change.

---

Nitpick comments:
In `@src/gateway/formats/openai/responses.rs`:
- Around line 27-30: Add /// doc comments to the new public types: place a short
descriptive triple-slash comment above the ResponsesApiFormat declaration
explaining its role as the API response format adapter, and above the
ResponsesBridgeState struct explaining what state it holds and when it is used;
ensure the comments are concise, use proper Rust doc style (///), describe
public behavior or purpose, and mention any important invariants or usage notes
for ResponsesApiFormat and ResponsesBridgeState.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 157487af-ca4d-4937-bd88-a5868193fa8e

📥 Commits

Reviewing files that changed from the base of the PR and between ee53cd6 and 827f0e1.

📒 Files selected for processing (4)

src/gateway/formats/mod.rs
src/gateway/formats/openai/mod.rs
src/gateway/formats/openai/responses.rs
src/gateway/traits/native.rs

coderabbitai · 2026-05-04T11:56:47Z

+    fn to_hub(req: &Self::Request) -> Result<(ChatCompletionRequest, BridgeContext)> {
+        ensure_request_is_bridgeable(req)?;
+
+        let mut messages = Vec::new();
+        if let Some(instructions) = req.instructions.as_ref().filter(|text| !text.is_empty()) {
+            messages.push(ChatMessage {
+                role: "system".into(),
+                content: Some(MessageContent::Text(instructions.clone())),
+                name: None,
+                tool_calls: None,
+                tool_call_id: None,
+            });
+        }
+
+        match &req.input {
+            ResponsesInput::Text(text) => messages.push(ChatMessage {
+                role: "user".into(),
+                content: Some(MessageContent::Text(text.clone())),
+                name: None,
+                tool_calls: None,
+                tool_call_id: None,
+            }),
+            ResponsesInput::Items(items) => {
+                for item in items {
+                    messages.push(responses_input_item_to_hub_message(item)?);
+                }
+            }
+        }


⚠️ Potential issue | 🟠 Major | 🏗️ Heavy lift

Reject previous_response_id tool-result continuations until the bridge can reconstruct history.

to_hub() only forwards the current input, while previous_response_id is just echoed into BridgeContext. A follow-up request containing FunctionCallOutput therefore turns into a standalone role="tool" message with no preceding assistant tool_calls message in the Chat Completions payload, which will break multi-turn tool-calling flows.

Either resolve previous_response_id into prior messages before building the hub request, or mark these requests as non-bridgeable in ensure_request_is_bridgeable() for now.

Temporary guard if history reconstruction is not ready yet

fn ensure_request_is_bridgeable(request: &ResponsesApiRequest) -> Result<()> { + let has_function_call_output = matches!( + &request.input, + ResponsesInput::Items(items) + if items.iter().any(|item| matches!(item, ResponsesInputItem::FunctionCallOutput { .. })) + ); + + if request.previous_response_id.is_some() || has_function_call_output { + return Err(GatewayError::Bridge( + "Responses API continuation/tool-result turns cannot be bridged through Chat Completions yet" + .into(), + )); + } + if request.background.unwrap_or(false) { return Err(GatewayError::Bridge( "Responses API background mode cannot be bridged through Chat Completions".into(),

Also applies to: 393-417, 420-437, 564-610

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/gateway/formats/openai/responses.rs` around lines 71 - 98, The bridge currently accepts requests with previous_response_id but to_hub() only forwards the current input, so follow-ups with FunctionCallOutput become orphaned tool messages; fix this by updating ensure_request_is_bridgeable() to reject any request that has previous_response_id (and/or any ResponsesInput item that represents a FunctionCallOutput continuation) by returning an Err when req.previous_response_id.is_some() (and the analogous predicate for input items), so to_hub(), responses_input_item_to_hub_message, and BridgeContext are never invoked for these unresolved continuations; reference ensure_request_is_bridgeable, to_hub, BridgeContext, previous_response_id, and responses_input_item_to_hub_message when making the change.

feat: add responses api chat format

827f0e1

coderabbitai Bot reviewed May 4, 2026

View reviewed changes

bzp2010 added 3 commits May 4, 2026 19:59

fix lint

284f3e2

fix comment

e2da421

fix comment

5ea4081

bzp2010 merged commit 342afdd into main May 4, 2026
3 checks passed

bzp2010 deleted the bzp/feat-responses-chat-format branch May 4, 2026 12:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add responses api chat format#86

feat: add responses api chat format#86
bzp2010 merged 4 commits intomainfrom
bzp/feat-responses-chat-format

bzp2010 commented May 4, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 4, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot May 4, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bzp2010 commented May 4, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

bzp2010 commented May 4, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 4, 2026 •

edited

Loading