CLIProxyAPI

Author	SHA1	Message	Date
Luis Pater	e7a185962d	feat(api): add request body decoding with Content-Encoding support - Introduced `ReadRequestBody` helper function to support decoding request bodies based on "Content-Encoding" (e.g., `zstd`). - Replaced `c.GetRawData()` with `ReadRequestBody` across handlers to enable decoding. - Added test case to validate `zstd` decoding for compact responses.	2026-05-16 12:19:32 +08:00
Luis Pater	15ac7fb932	refactor(auth): simplify home auth session management and remove ref counting - Consolidated `homeRuntimeAuths` to store a map of session-scoped auth maps, replacing `homeRuntimeAuthSessions` and `homeRuntimeAuthRefs`. - Adjusted session cleanup logic to directly remove session-scoped auths without reference counting. - Added `GetExecutionSessionAuthByID` to retrieve auths scoped to a specific execution session. - Updated tests to reflect the new session-scoped caching behavior.	2026-05-10 15:21:33 +08:00
Luis Pater	a44e5eb1ab	Merge branch 'v7' into dev	2026-05-10 02:33:42 +08:00
Codex	c883114a4d	fix responses websocket tool output context	2026-05-08 05:12:30 +00:00
Luis Pater	e50cabac4b	chore: upgrade CLIProxyAPI dependency to v7 across the project - Updated all references from v6 to v7 for `github.com/router-for-me/CLIProxyAPI`. - Ensured consistency in imports within core libraries, tests, and integration tests. - Added missing tests for new features in Redis Protocol integration.	2026-05-08 11:46:46 +08:00
Luis Pater	fb08b92402	feat(executor): add upstream disconnect handling for Codex WebSocket sessions - Introduced `UpstreamDisconnectChan` for Codex WebSocket sessions to notify downstream connections of upstream disconnections. - Implemented `notifyUpstreamDisconnect` to signal errors and close channels on disconnect events. - Added integration tests to validate WebSocket session behavior on upstream disconnect. - Updated OpenAI WebSocket response handlers to properly close connections upon upstream disconnect notifications.	2026-05-06 22:09:33 +08:00
Luis Pater	bdc424007e	Merge pull request #2896 from edlsh/fix/oauth-tool-rename-per-request-map fix(amp): smart-mode tool name fixes + deep-mode response repair	2026-05-05 00:58:39 +08:00
Luis Pater	8e6ef3fa64	fix(websocket): ensure state consistency on auth errors in streaming - Added logic to reset `pinnedAuthID` and replay transcript on unauthorized, forbidden, or throttling errors. - Enhanced error handling in `forwardResponsesWebsocket` with detailed status inspection. - Introduced `shouldReleaseResponsesWebsocketPinnedAuth` to determine auth reset conditions. - Updated state management to preserve prior request and response data during forced replay. Fixed: #2230	2026-05-04 05:23:23 +08:00
Luis Pater	82ebe24b9e	Merge pull request #2266 from DragonFSKY/fix/ws-compact-tool-output-mismatch fix(websocket): skip stale state merge after client-side compact	2026-05-04 04:40:43 +08:00
Luis Pater	f56a19e5b8	feat: add tri-state support for `disable-image-generation` configuration - Introduced `DisableImageGenerationMode` with support for `false`, `true`, and `chat` values. - Updated payload handling to preserve `image_generation` on images endpoints when `chat` mode is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to respect tri-state logic. - Added unit tests for `DisableImageGenerationMode` behavior and endpoint-specific handling. - Enhanced configuration diff logging to support `DisableImageGenerationMode`.	2026-04-30 12:10:27 +08:00
Luis Pater	e3e60f914b	feat: support disabling image generation globally - Added `disable-image-generation` configuration flag to disable the `image_generation` tool globally. - Updated payload handling to remove `image_generation` tools from request payload arrays when the flag is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to return 404 when the feature is disabled. - Enhanced configuration diff logging to track changes for the `disable-image-generation` flag. - Added accompanying unit tests for the new feature in payload helpers and image handler logic.	2026-04-30 03:42:27 +08:00
Luis Pater	9fb6a49260	test(api): add validation for unsupported models in OpenAI image handlers - Introduced tests to ensure unsupported models are rejected in `/images/generations` and `/images/edits`. - Added `isSupportedImagesModel` and `rejectUnsupportedImagesModel` functions for consistent model validation. - Enhanced image handler logic to apply validation checks for model compatibility.	2026-04-28 17:19:12 +08:00
edlsh	80eb03709a	fix(openai): preserve multiline repaired SSE data	2026-04-25 18:12:27 -04:00
edlsh	d36e70e9dc	fix(openai): preserve unindexed response output items	2026-04-25 18:06:00 -04:00
edlsh	fd45dece7f	fix(openai): repair empty responses stream output	2026-04-25 17:46:44 -04:00
Luis Pater	0a7c6b0a4a	feat(api): enhance model assignment logic in image handlers - Updated `buildImagesResponsesRequest` to derive `model` dynamically based on `toolJSON`. - Adjusted streaming execution to handle dynamic model resolution across multiple contexts. Closes: #2965	2026-04-26 03:24:43 +08:00
Luis Pater	a7e92e2639	feat(auth): disallow free-tier Codex auth during selection process - Introduced `disallowFreeAuthFromMetadata` and `isFreeCodexAuth` to enforce skipping free-tier credentials. - Modified scheduler logic to honor `DisallowFreeAuthMetadataKey` during auth selection. - Updated `ensureImageGenerationTool` to skip tool injection for free-tier Codex auth. - Added context utility `WithDisallowFreeAuth` and integrated with image handlers. - Augmented relevant tests to cover free-tier exclusion scenarios.	2026-04-24 23:18:56 +08:00
Luis Pater	a188159632	fix(handlers): remove references to unsupported `n` parameter in OpenAI image handlers	2026-04-22 21:28:17 +08:00
Luis Pater	fd71960c3e	fix(handlers): remove handling of unsupported `n` parameter in OpenAI image handlers	2026-04-22 21:12:50 +08:00
Luis Pater	e935196df4	feat(models): add hardcoded GPT-Image-2 model support in Codex - Added `GPT-Image-2` as a built-in model to avoid dependency on remote updates for Codex. - Updated model tier functions (`CodexFree`, `CodexTeam`, etc.) to include built-in models via `WithCodexBuiltins`. - Introduced new handlers for image generation and edit operations under `OpenAIAPIHandler`. - Extended tests to validate 503 response for unsupported image model requests.	2026-04-22 20:51:13 +08:00
DragonFSKY	4ca00f7983	fix(websocket): gate compact replay by downstream support	2026-04-07 14:25:05 +08:00
DragonFSKY	d2d0e6f6a1	fix(websocket): narrow compact replay detection	2026-04-07 14:23:44 +08:00
DragonFSKY	a0fe273081	fix(websocket): skip stale state merge after client-side compact After a Codex CLI compact, the client sends a full conversation transcript (with compaction items or assistant messages) as input. Previously, normalizeResponseSubsequentRequest() unconditionally merged this with stale lastRequest/lastResponseOutput, breaking function_call/function_call_output pairings and causing 400 errors ("No tool output found for function call"). Add inputContainsFullTranscript() heuristic that detects compaction items (type=compaction/compaction_summary) or assistant messages in the input array, and bypasses the merge when a full transcript is present. Fixes #2207	2026-04-07 14:22:53 +08:00
Kai Wang	d1fd2c4ad4	fix: repair websocket custom tool calls	2026-04-03 17:11:44 +08:00
Kai Wang	b6c6379bfa	fix: repair websocket custom tool calls	2026-04-03 17:11:42 +08:00
Kai Wang	8f0e66b72e	fix: repair websocket custom tool calls	2026-04-03 17:11:41 +08:00
Luis Pater	3e78a8d500	Merge branch 'main' into dev	2026-04-02 21:21:26 +08:00
davidwushi1145	108895fc04	Harden Responses SSE framing against partial chunk boundaries Follow-up review found two real framing hazards in the handler-layer framer: it could flush a partial `data:` payload before the JSON was complete, and it could inject an extra newline before chunks that already began with `\n`/`\r\n`. This commit tightens the framer so it only emits undelimited events when the buffered `data:` payload is already valid JSON (or `[DONE]`), skips newline injection for chunks that already start with a line break, and avoids the heavier `bytes.Split` path while scanning SSE fields. The regression suite now covers split `data:` payload chunks, newline-prefixed chunks, and dropping incomplete trailing data on flush, so the original Responses fix remains intact while the review concerns are explicitly locked down. Constraint: Keep the follow-up limited to handler-layer framing and tests Rejected: Ignore the review and rely on current executor chunk shapes \| leaves partial data payload corruption possible Rejected: Build a fully generic SSE parser \| wider change than needed for the identified risks Confidence: high Scope-risk: narrow Reversibility: clean Directive: Do not emit undelimited Responses SSE events unless buffered `data:` content is already complete and valid Tested: /tmp/go1.26.1/go/bin/go test ./sdk/api/handlers/openai -count=1 Tested: /tmp/go1.26.1/go/bin/go test ./sdk/api/handlers -count=1 Tested: /tmp/go1.26.1/go/bin/go vet ./sdk/api/handlers/... Not-tested: Full repository test suite outside sdk/api/handlers packages	2026-04-02 20:39:49 +08:00
davidwushi1145	abc293c642	Prevent malformed Responses SSE frames from breaking stream clients Line-oriented upstream executors can emit `event:` and `data:` as separate chunks, but the Responses handler had started terminating each incoming chunk as a full SSE event. That split `response.created` into an empty event plus a later data block, which broke downstream clients like OpenClaw. This keeps the fix in the handler layer: a small stateful framer now buffers standalone `event:` lines until the matching `data:` arrives, preserves already-framed events, and ignores delimiter-only leftovers. The regression suite now covers split event/data framing, full-event passthrough, terminal errors, and the bootstrap path that forwards line-oriented openai-response streams from non-Codex executors too. Constraint: Keep the fix localized to Responses handler framing instead of patching every executor Rejected: Revert to v6.9.7 chunk writing \| would reintroduce data-only framing regressions Rejected: Patch each line-oriented executor separately \| duplicates fragile SSE assembly logic Confidence: high Scope-risk: narrow Reversibility: clean Directive: Do not assume incoming Responses stream chunks are already complete SSE events; preserve handler-layer reassembly for split `event:`/`data:` inputs Tested: /tmp/go1.26.1/go/bin/go test ./sdk/api/handlers/openai -count=1 Tested: /tmp/go1.26.1/go/bin/go test ./sdk/api/handlers -count=1 Tested: /tmp/go1.26.1/go test ./sdk/api/handlers/... -count=1 Tested: /tmp/go1.26.1/go/bin/go vet ./sdk/api/handlers/... Tested: Temporary patched server on 127.0.0.1:18317 -> /v1/models 200, /v1/responses non-stream 200, /v1/responses stream emitted combined `event:` + `data:` frames Not-tested: Full repository test suite outside sdk/api/handlers packages	2026-04-02 20:26:42 +08:00
hkfires	34339f61ee	Refactor websocket logging and error handling - Introduced new logging functions for websocket requests, handshakes, errors, and responses in `logging_helpers.go`. - Updated `CodexWebsocketsExecutor` to utilize the new logging functions for improved clarity and consistency in websocket operations. - Modified the handling of websocket upgrade rejections to log relevant metadata. - Changed the request body key to a timeline body key in `openai_responses_websocket.go` to better reflect its purpose. - Enhanced tests to verify the correct logging of websocket events and responses, including disconnect events and error handling scenarios.	2026-04-02 17:30:51 +08:00
Luis Pater	c422d16beb	Merge pull request #2398 from 7RPH/fix/responses-sse-framing fix: preserve SSE event boundaries for Responses streams	2026-04-02 00:46:51 +08:00
hkfires	caa529c282	fix(openai): improve client IP retrieval in websocket handler	2026-04-01 20:16:01 +08:00
hkfires	51a4379bf4	refactor(openai): remove websocket body log truncation limit	2026-04-01 18:11:43 +08:00
Luis Pater	acf98ed10e	fix(openai): add session reference counter and cache lifecycle management for websocket tools	2026-04-01 17:28:50 +08:00
Luis Pater	d1c07a091e	fix(openai): add websocket tool call repair with caching and tests to improve transcript consistency	2026-04-01 17:16:49 +08:00
Luis Pater	ca11b236a7	refactor(runtime, openai): simplify header management and remove redundant websocket logging logic	2026-04-01 11:57:31 +08:00
apparition	a3e21df814	fix(openai): avoid developer transcript resets - Narrow websocket transcript replacement detection to assistant outputs and function calls - Preserve existing merge behavior for follow-up developer messages without previous_response_id - Add a regression test covering mid-session developer message updates	2026-03-30 23:33:16 +08:00
apparition	c1d7599829	fix(openai): handle transcript replacement after websocket compaction - Add shouldReplaceWebsocketTranscript() to detect historical model output in input - Add normalizeResponseTranscriptReplacement() for full transcript reset handling - Prevent duplicate stale turn-state when clients replace local history post-compaction - Avoid orphaned function_call items from incremental append on compact transcripts - Add unit tests for transcript replacement detection and state reset behavior	2026-03-30 22:44:58 +08:00
trph	f73d55ddaa	fix: simplify responses SSE suffix handling	2026-03-29 22:19:25 +08:00
trph	0fcc02fbea	fix: tighten responses SSE review follow-up	2026-03-29 22:10:28 +08:00
trph	c03883ccf0	fix: address responses SSE review feedback	2026-03-29 22:00:46 +08:00
trph	134a9eac9d	fix: preserve SSE event boundaries for Responses streams	2026-03-29 17:23:16 +08:00
Luis Pater	2bd646ad70	refactor: replace `sjson.Set` usage with `sjson.SetBytes` to optimize mutable JSON transformations	2026-03-19 17:58:54 +08:00
Luis Pater	dc7187ca5b	fix(websocket): pin only websocket-capable auth IDs and add corresponding test	2026-03-16 09:57:38 +08:00
hkfires	d1e3195e6f	feat(codex): register models by plan tier	2026-03-10 11:20:37 +08:00
Supra4E8C	fc2f0b6983	fix: cap websocket body log growth	2026-03-09 17:48:30 +08:00
Luis Pater	ddcf1f279d	Fixed: #1901 docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details test(websocket): add tests for incremental input and prewarm handling logic - Added test cases for incremental input support based on upstream capabilities. - Introduced validation for prewarm handling of `response.create` messages locally. - Enhanced test coverage for websocket executor behavior, including payload forwarding checks. - Updated websocket implementation with prewarm and incremental input logic for better testability.	2026-03-07 13:11:28 +08:00
Luis Pater	5ebc58fab4	refactor(executor): remove legacy `connCreateSent` logic and standardize `response.create` usage for all websocket events docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details - Simplified connection logic by removing `connCreateSent` and related state handling. - Updated `buildCodexWebsocketRequestBody` to always use `response.create`. - Added unit tests to validate `response.create` behavior and beta header preservation. - Dropped unsupported `response.append` and outdated `response.done` event types.	2026-03-07 09:07:23 +08:00
canxin121	49c8ec69d0	fix(openai): emit valid responses stream error chunks When /v1/responses streaming fails after headers are sent, we now emit a type=error chunk instead of an HTTP-style {error:{...}} payload, preventing AI SDK chunk validation errors.	2026-02-23 12:59:50 +08:00
Luis Pater	2789396435	fix: ensure connection-scoped headers are filtered in upstream requests - Added `connectionScopedHeaders` utility to respect "Connection" header directives. - Updated `FilterUpstreamHeaders` to remove connection-scoped headers dynamically. - Refactored and tested upstream header filtering with additional validations. - Adjusted upstream header handling during retries to replace headers safely.	2026-02-19 13:19:10 +08:00

1 2

62 Commits