CLIProxyAPI

Author	SHA1	Message	Date
VooDisss	511b8a992e	fix(codex): restore prompt cache continuity for Codex requests Prompt caching on Codex was not reliably reusable through the proxy because repeated chat-completions requests could reach the upstream without the same continuity envelope. In practice this showed up most clearly with OpenCode, where cache reads worked in the reference client but not through CLIProxyAPI, although the root cause is broader than OpenCode itself. The proxy was breaking continuity in several ways: executor-layer Codex request preparation stripped prompt_cache_retention, chat-completions translation did not preserve that field, continuity headers used a different shape than the working client behavior, and OpenAI-style Codex requests could be sent without a stable prompt_cache_key. When that happened, session_id fell back to a fresh random value per request, so upstream Codex treated repeated requests as unrelated turns instead of as part of the same cacheable context. This change fixes that by preserving caller-provided prompt_cache_retention on Codex execution paths, preserving prompt_cache_retention when translating OpenAI chat-completions requests to Codex, aligning Codex continuity headers to session_id, and introducing an explicit Codex continuity policy that derives a stable continuity key from the best available signal. The resolution order prefers an explicit prompt_cache_key, then execution session metadata, then an explicit idempotency key, then stable request-affinity metadata, then a stable client-principal hash, and finally a stable auth-ID hash when no better continuity signal exists. The same continuity key is applied to both prompt_cache_key in the request body and session_id in the request headers so repeated requests reuse the same upstream cache/session identity. The auth manager also keeps auth selection sticky for repeated request sequences, preventing otherwise-equivalent Codex requests from drifting across different upstream auth contexts and accidentally breaking cache reuse. To keep the implementation maintainable, the continuity resolution and diagnostics are centralized in a dedicated Codex continuity helper instead of being scattered across executor flow code. Regression coverage now verifies retention preservation, continuity-key precedence, stable auth-ID fallback, websocket parity, translator preservation, and auth-affinity behavior. Manual validation confirmed prompt cache reads now occur through CLIProxyAPI when using Codex via OpenCode, and the fix should also benefit other clients that rely on stable repeated Codex request continuity.	2026-03-27 17:49:29 +02:00
Luis Pater	d475aaba96	Fixed: #2274 fix(translator): omit null content fields in Codex OpenAI tool call responses	2026-03-24 01:00:57 +08:00
Luis Pater	97c0487add	Merge pull request #2223 from cnrpman/fix/codex-responses-web-search-preview-compat fix: normalize web_search_preview for codex responses	2026-03-24 00:25:37 +08:00
Junyi Du	d1df70d02f	chore: add codex builtin tool normalization logging	2026-03-20 14:08:37 +08:00
Luis Pater	2bd646ad70	refactor: replace `sjson.Set` usage with `sjson.SetBytes` to optimize mutable JSON transformations	2026-03-19 17:58:54 +08:00
Junyi Du	793840cdb4	fix: cover dated and nested codex web search aliases	2026-03-19 03:41:12 +08:00
Junyi Du	8f421de532	fix: handle sjson errors in codex tool normalization	2026-03-19 03:36:06 +08:00
Junyi Du	be2dd60ee7	fix: normalize web_search_preview for codex responses	2026-03-19 03:23:14 +08:00
Muran-prog	0b94d36c4a	test: use exact match for tool name assertion Address review feedback - drop function.name fallback and strings.Contains in favor of direct == comparison.	2026-03-14 21:45:28 +02:00
Muran-prog	c8cee6a209	fix: skip empty assistant message in tool call translation (#2132 ) When assistant has tool_calls but no text content, the translator emitted an empty message into the Responses API input array before function_call items. The API then couldn't match function_call_output to its function_call by call_id, returning: No tool output found for function call ... Only emit assistant messages that have content parts. Tool-call-only messages now produce function_call items directly. Added 9 tests for tool calling translation covering single/parallel calls, multi-turn conversations, name shortening, empty content edge cases, and call_id integrity.	2026-03-14 21:01:01 +02:00
Luis Pater	2695a99623	fix(translator): conditionally remove `service_tier` from OpenAI response processing docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details	2026-03-06 11:07:22 +08:00
Luis Pater	cc8dc7f62c	Merge branch 'main' into dev docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details	2026-03-05 23:13:21 +08:00
Luis Pater	a3846ea513	Merge pull request #1870 from sususu98/fix/remove-instructions-restore cleanup(translator): remove leftover instructions restore in codex responses	2026-03-05 23:12:31 +08:00
Luis Pater	0e6bb076e9	fix(translator): comment out `service_tier` removal from OpenAI response processing	2026-03-05 22:49:38 +08:00
sususu98	68a6cabf8b	style: blank unused params in codex responses translator	2026-03-05 16:42:48 +08:00
sususu98	ac0e387da1	cleanup(translator): remove leftover instructions restore in codex responses The instructions restore logic was originally needed when the proxy injected custom instructions (per-model system prompts) into requests. Since `ac802a46` removed the injection system, the proxy no longer modifies instructions before forwarding. The upstream response's instructions field now matches the client's original value, making the restore a no-op. Also removes unused sjson import. Closes router-for-me/CLIProxyAPI#1868	2026-03-05 16:34:55 +08:00
Luis Pater	5850492a93	Fixed: #1548 docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details test(translator): add unit tests for fallback logic in `ConvertCodexResponseToOpenAI` model assignment	2026-03-05 12:11:54 +08:00
hkfires	914db94e79	refactor(headers): streamline User-Agent handling and introduce GeminiCLI versioning docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details	2026-03-02 13:04:30 +08:00
Luis Pater	d24ea4ce2a	Merge pull request #1664 from ciberponk/pr/responses-compaction-compat docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details feat: add codex responses compatibility for compaction payloads	2026-02-25 01:21:59 +08:00
Luis Pater	c3e12c5e58	Merge pull request #1654 from alexey-yanchenko/feature/pass-file-inputs Pass file input from /chat/completions and /responses to codex and claude	2026-02-24 05:53:11 +08:00
fan	afc8a0f9be	refactor: simplify context_management compatibility handling	2026-02-21 22:20:48 +08:00
ciberponk	d693d7993b	feat: support responses compaction payload compatibility for codex translator	2026-02-21 12:56:10 +08:00
Alexey Yanchenko	0cbfe7f457	Pass file input from /chat/completions and /responses to codex and claude	2026-02-20 10:25:44 +07:00
Kirill Turanskiy	5fa23c7f41	fix: handle tool call argument streaming in Codex→OpenAI translator The OpenAI Chat Completions translator was silently dropping response.function_call_arguments.delta and response.function_call_arguments.done Codex SSE events, meaning tool call arguments were never streamed incrementally to clients. Add proper handling mirroring the proven Claude translator pattern: - response.output_item.added: announce tool call (id, name, empty args) - response.function_call_arguments.delta: stream argument chunks - response.function_call_arguments.done: emit full args if no deltas - response.output_item.done: defensive fallback for backward compat State tracking via HasReceivedArgumentsDelta and HasToolCallAnnounced ensures no duplicate argument emission and correct behavior for models like codex-spark that skip delta events entirely.	2026-02-18 19:09:05 +03:00
Alexey Yanchenko	63d4de5eea	Pass cache usage from codex to openai chat completions	2026-02-15 12:04:15 +07:00
xxddff	bb9fe52f1e	Update internal/translator/codex/openai/responses/codex_openai-responses_request_test.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-02-10 18:24:58 +09:00
xxddff	afe4c1bfb7	更新internal/translator/codex/openai/responses/codex_openai-responses_request.go Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2026-02-10 18:24:26 +09:00
xxddff	865af9f19e	Implement test for user field deletion Add test to verify deletion of user field in response	2026-02-10 17:38:49 +09:00
xxddff	2b97cb98b5	Delete 'user' field from raw JSON Remove the 'user' field from the raw JSON as requested.	2026-02-10 17:35:54 +09:00
Luis Pater	a5a25dec57	refactor(translator, executor): remove redundant `bytes.Clone` calls for improved performance docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details - Replaced all instances of `bytes.Clone` with direct references to enhance efficiency. - Simplified payload handling across executors and translators by eliminating unnecessary data duplication.	2026-02-06 03:26:29 +08:00
Luis Pater	d885b81f23	Fixed: #1403 docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details fix(translator): handle "input" field transformation for OpenAI responses	2026-02-03 21:49:30 +08:00
hkfires	354f6582b2	fix(codex): convert system role to developer for codex input	2026-02-01 15:37:37 +08:00
hkfires	ac802a4646	refactor(codex): remove codex instructions injection support	2026-02-01 14:33:31 +08:00
Luis Pater	65b4e1ec6c	feat(codex): enable instruction toggling and update role terminology - Added conditional logic for Codex instruction injection based on configuration. - Updated role terminology from "user" to "developer" for better alignment with context.	2026-01-17 04:12:29 +08:00
Luis Pater	6600d58ba2	feat(codex): enhance input transformation and remove unused `safety_identifier` field docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details - Added logic to transform `inputResults` into structured JSON for improved processing. - Removed redundant `safety_identifier` field in executor payload to streamline requests.	2026-01-16 19:59:01 +08:00
hkfires	220ca45f74	fix(codex): only override instructions when upstream provides them	2026-01-11 15:52:21 +08:00
hkfires	70a82d80ac	fix(codex): only override instructions in responses for OpenCode UA	2026-01-11 15:19:37 +08:00
hkfires	ac626111ac	feat(codex): add OpenCode instructions based on user agent	2026-01-11 13:36:35 +08:00
Muzhen Gaming	0b834fcb54	fix(translator): preserve built-in tools across openai<->responses - Pass through non-function tool definitions like web_search - Translate tool_choice for built-in tools and function tools - Add regression tests for built-in tool passthrough	2025-12-15 21:18:54 +08:00
hkfires	d131435e25	fix(codex): raise default reasoning effort to medium	2025-12-12 18:18:48 +08:00
Luis Pater	98596c0a3f	refactor(translator): remove `service_tier` from Codex OpenAI request payload docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details	2025-11-20 20:12:06 +08:00
hkfires	1ba057112a	fix: use underscore suffix in short name mapping Replace the "~<n>" suffix with "_<n>" when generating unique short names in codex translators (Claude, Gemini, OpenAI chat). This avoids using a special character in identifiers, improving compatibility with downstream APIs while preserving length constraints.	2025-11-18 16:59:25 +08:00
Luis Pater	fd2b23592e	Fixed: #193 docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details fix(translator): consolidate temperature and top_p conditionals in OpenAI Claude request Fixed: #169 fix(translator): adjust instruction strings in Codex Claude and OpenAI responses	2025-11-01 15:37:51 +08:00
Luis Pater	f6cf784cd1	refactor(translator): remove unused log dependency and comment out debug logging docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docs: add GPT-5 Codex guidelines for CLI usage - Added detailed guidelines for GPT-5 Codex in Codex CLI. - Expanded instructions on sandboxing, approvals, editing constraints, and style requirements. - Included presentation and response formatting best practices. fix(codex_instructions): update comparison logic to use prefix matching - Changed system instructions comparison to use `strings.HasPrefix` for improved flexibility.	2025-10-24 12:15:15 +08:00
Luis Pater	e6d7677373	docs: add GPT-5 Codex guidelines for internal usage docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details - Added comprehensive instructions for Codex CLI harness, sandboxing, approvals, and editing constraints to `internal/misc/codex_instructions/`. - Clarified `approval_policy` configurations and scenarios requiring escalated permissions. - Provided detailed style and structure guidelines for presenting results in the Codex CLI.	2025-10-23 09:14:56 +08:00
Luis Pater	b641d90287	Fixed #91 docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details refactor(translator): streamline Codex response handling and remove redundant code - Updated `ConvertCodexResponseToOpenAIResponses` logic for clarity and consistency. - Simplified `ConvertCodexResponseToOpenAIResponsesNonStream` by removing unnecessary buffer setup and scanner logic. - Switched to using `sjson.SetRaw` for improved processing of raw input strings.	2025-10-15 12:58:18 +08:00
Luis Pater	b727e4e12e	Fixed: #86 docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details feat(translator): add support for single input string in Codex responses parser - Modified input parsing logic to handle cases where input is a single string instead of an array. - Added functionality to convert single string inputs into structured JSON format.	2025-10-07 02:10:59 +08:00
Luis Pater	bbdd68a8b4	feat(registry/runtime): add Gemini 2.5 model and increase buffer sizes - Added new "Gemini 2.5 Flash Image Preview" model definition, with enhanced image generation capabilities. - Increased scanner buffer size to 20,971,520 bytes across executors and translators to handle larger payloads.	2025-10-06 04:44:45 +08:00
Ben Vargas	9e3b84939f	fix(translator): remove unsupported token limit fields for Codex Responses API The OpenAI Codex Responses API (chatgpt.com/backend-api/codex/responses) rejects requests containing max_output_tokens and max_completion_tokens fields, causing Factory CLI to fail with "Unsupported parameter" errors. This fix strips these incompatible fields during request translation, allowing Factory CLI to work properly with CLIProxyAPI when using ChatGPT Plus/Pro OAuth. Fixes compatibility issue where Factory sends token limit parameters that aren't supported by the Codex Responses endpoint.	2025-09-27 15:44:33 -06:00
Luis Pater	f5dc380b63	rebuild branch docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details	2025-09-25 10:32:48 +08:00

1 2

70 Commits