CLIProxyAPI

Author	SHA1	Message	Date
VooDisss	511b8a992e	fix(codex): restore prompt cache continuity for Codex requests Prompt caching on Codex was not reliably reusable through the proxy because repeated chat-completions requests could reach the upstream without the same continuity envelope. In practice this showed up most clearly with OpenCode, where cache reads worked in the reference client but not through CLIProxyAPI, although the root cause is broader than OpenCode itself. The proxy was breaking continuity in several ways: executor-layer Codex request preparation stripped prompt_cache_retention, chat-completions translation did not preserve that field, continuity headers used a different shape than the working client behavior, and OpenAI-style Codex requests could be sent without a stable prompt_cache_key. When that happened, session_id fell back to a fresh random value per request, so upstream Codex treated repeated requests as unrelated turns instead of as part of the same cacheable context. This change fixes that by preserving caller-provided prompt_cache_retention on Codex execution paths, preserving prompt_cache_retention when translating OpenAI chat-completions requests to Codex, aligning Codex continuity headers to session_id, and introducing an explicit Codex continuity policy that derives a stable continuity key from the best available signal. The resolution order prefers an explicit prompt_cache_key, then execution session metadata, then an explicit idempotency key, then stable request-affinity metadata, then a stable client-principal hash, and finally a stable auth-ID hash when no better continuity signal exists. The same continuity key is applied to both prompt_cache_key in the request body and session_id in the request headers so repeated requests reuse the same upstream cache/session identity. The auth manager also keeps auth selection sticky for repeated request sequences, preventing otherwise-equivalent Codex requests from drifting across different upstream auth contexts and accidentally breaking cache reuse. To keep the implementation maintainable, the continuity resolution and diagnostics are centralized in a dedicated Codex continuity helper instead of being scattered across executor flow code. Regression coverage now verifies retention preservation, continuity-key precedence, stable auth-ID fallback, websocket parity, translator preservation, and auth-affinity behavior. Manual validation confirmed prompt cache reads now occur through CLIProxyAPI when using Codex via OpenCode, and the fix should also benefit other clients that rely on stable repeated Codex request continuity.	2026-03-27 17:49:29 +02:00
Luis Pater	d475aaba96	Fixed: #2274 fix(translator): omit null content fields in Codex OpenAI tool call responses	2026-03-24 01:00:57 +08:00
Luis Pater	2bd646ad70	refactor: replace `sjson.Set` usage with `sjson.SetBytes` to optimize mutable JSON transformations	2026-03-19 17:58:54 +08:00
Muran-prog	0b94d36c4a	test: use exact match for tool name assertion Address review feedback - drop function.name fallback and strings.Contains in favor of direct == comparison.	2026-03-14 21:45:28 +02:00
Muran-prog	c8cee6a209	fix: skip empty assistant message in tool call translation (#2132 ) When assistant has tool_calls but no text content, the translator emitted an empty message into the Responses API input array before function_call items. The API then couldn't match function_call_output to its function_call by call_id, returning: No tool output found for function call ... Only emit assistant messages that have content parts. Tool-call-only messages now produce function_call items directly. Added 9 tests for tool calling translation covering single/parallel calls, multi-turn conversations, name shortening, empty content edge cases, and call_id integrity.	2026-03-14 21:01:01 +02:00
Luis Pater	5850492a93	Fixed: #1548 docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details test(translator): add unit tests for fallback logic in `ConvertCodexResponseToOpenAI` model assignment	2026-03-05 12:11:54 +08:00
Luis Pater	c3e12c5e58	Merge pull request #1654 from alexey-yanchenko/feature/pass-file-inputs Pass file input from /chat/completions and /responses to codex and claude	2026-02-24 05:53:11 +08:00
Alexey Yanchenko	0cbfe7f457	Pass file input from /chat/completions and /responses to codex and claude	2026-02-20 10:25:44 +07:00
Kirill Turanskiy	5fa23c7f41	fix: handle tool call argument streaming in Codex→OpenAI translator The OpenAI Chat Completions translator was silently dropping response.function_call_arguments.delta and response.function_call_arguments.done Codex SSE events, meaning tool call arguments were never streamed incrementally to clients. Add proper handling mirroring the proven Claude translator pattern: - response.output_item.added: announce tool call (id, name, empty args) - response.function_call_arguments.delta: stream argument chunks - response.function_call_arguments.done: emit full args if no deltas - response.output_item.done: defensive fallback for backward compat State tracking via HasReceivedArgumentsDelta and HasToolCallAnnounced ensures no duplicate argument emission and correct behavior for models like codex-spark that skip delta events entirely.	2026-02-18 19:09:05 +03:00
Alexey Yanchenko	63d4de5eea	Pass cache usage from codex to openai chat completions	2026-02-15 12:04:15 +07:00
Luis Pater	a5a25dec57	refactor(translator, executor): remove redundant `bytes.Clone` calls for improved performance docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details - Replaced all instances of `bytes.Clone` with direct references to enhance efficiency. - Simplified payload handling across executors and translators by eliminating unnecessary data duplication.	2026-02-06 03:26:29 +08:00
hkfires	ac802a4646	refactor(codex): remove codex instructions injection support	2026-02-01 14:33:31 +08:00
Luis Pater	65b4e1ec6c	feat(codex): enable instruction toggling and update role terminology - Added conditional logic for Codex instruction injection based on configuration. - Updated role terminology from "user" to "developer" for better alignment with context.	2026-01-17 04:12:29 +08:00
hkfires	ac626111ac	feat(codex): add OpenCode instructions based on user agent	2026-01-11 13:36:35 +08:00
Muzhen Gaming	0b834fcb54	fix(translator): preserve built-in tools across openai<->responses - Pass through non-function tool definitions like web_search - Translate tool_choice for built-in tools and function tools - Add regression tests for built-in tool passthrough	2025-12-15 21:18:54 +08:00
hkfires	d131435e25	fix(codex): raise default reasoning effort to medium	2025-12-12 18:18:48 +08:00
hkfires	1ba057112a	fix: use underscore suffix in short name mapping Replace the "~<n>" suffix with "_<n>" when generating unique short names in codex translators (Claude, Gemini, OpenAI chat). This avoids using a special character in identifiers, improving compatibility with downstream APIs while preserving length constraints.	2025-11-18 16:59:25 +08:00
Luis Pater	e6d7677373	docs: add GPT-5 Codex guidelines for internal usage docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details - Added comprehensive instructions for Codex CLI harness, sandboxing, approvals, and editing constraints to `internal/misc/codex_instructions/`. - Clarified `approval_policy` configurations and scenarios requiring escalated permissions. - Provided detailed style and structure guidelines for presenting results in the Codex CLI.	2025-10-23 09:14:56 +08:00
Luis Pater	f5dc380b63	rebuild branch docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details	2025-09-25 10:32:48 +08:00
Luis Pater	3f69254f43	remove all	2025-09-25 10:31:02 +08:00
Luis Pater	3ade03f3b3	feat(usage): implement usage tracking infrastructure across executors - Added `LoggerPlugin` to log usage metrics for observability. - Introduced a new `Manager` to handle usage record queuing and plugin registration. - Integrated new usage reporter and detailed metrics parsing into executors, covering providers like OpenAI, Codex, Claude, and Gemini. - Improved token usage breakdown across streaming and non-streaming responses.	2025-09-24 03:49:09 +08:00
Luis Pater	f81898c906	feat: introduce custom provider example and remove redundant debug logs - Added `examples/custom-provider/main.go` showcasing custom executor and translator integration using the SDK. - Removed redundant debug logs from translator modules to enhance code cleanliness. - Updated SDK documentation with new usage and advanced examples. - Expanded the management API with new endpoints, including request logging and GPT-5 Codex features.	2025-09-22 03:37:53 +08:00
Luis Pater	d9ad65622a	refactor: standardize constant naming and improve file-based auth handling - Renamed constants from uppercase to CamelCase for consistency. - Replaced redundant file-based auth handling logic with the new `util.CountAuthFiles` helper. - Fixed various error-handling inconsistencies and enhanced robustness in file operations. - Streamlined auth client reload logic in server and watcher components. - Applied minor code readability improvements across multiple packages.	2025-09-22 02:56:45 +08:00
Luis Pater	4999fce7f4	v6 version first commit	2025-09-22 01:40:24 +08:00
Luis Pater	869a3dfbb4	feat: implement model-specific Codex instructions for GPT-5 - Added `CodexInstructions(modelName string)` function to dynamically select instructions based on the model (e.g., GPT-5 Codex). - Introduced `gpt_5_instructions.txt` and `gpt_5_codex_instructions.txt` for respective model configurations. - Updated translators to pass `modelName` and use the new instruction logic.	2025-09-19 08:47:54 +08:00
Luis Pater	9fce13fe03	Update internal module imports to use `v5` package path docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details - Updated all `github.com/luispater/CLIProxyAPI/internal/...` imports to point to `github.com/luispater/CLIProxyAPI/v5/internal/...`. - Adjusted `go.mod` to specify `module github.com/luispater/CLIProxyAPI/v5`.	2025-09-13 23:34:32 +08:00
kaixxx	aacfcae382	Codex CLI - setting 'store = false' store = true leads to: BadRequestError("Error code: 400 - {'detail': 'Store must be set to false'}")	2025-09-12 00:59:49 +02:00
Luis Pater	54ffb52838	Add `FunctionCallIndex` to `ConvertCliToOpenAIParams` and enhance tool call handling docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details - Introduced `FunctionCallIndex` to track and manage function call indices within `ConvertCliToOpenAIParams`. - Enhanced handling for `response.completed` and `response.output_item.done` data types to support tool call scenarios. - Improved logic for restoring original tool names and setting function arguments during response parsing.	2025-09-05 09:02:24 +08:00
Luis Pater	ad943b2d4d	Add reverse mappings for original tool names and improve error logging docker-image / docker (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details - Introduced reverse mapping logic for tool names in translators to restore original names when shortened. - Enhanced error handling by logging API response errors consistently across handlers. - Refactored request and response loggers to include API error details, improving debugging capabilities. - Integrated robust tool name shortening and uniqueness mechanisms for OpenAI, Gemini, and Claude requests. - Improved handler retry logic to properly capture and respond to errors.	2025-09-04 02:39:56 +08:00
Luis Pater	3acdec51bd	Add OpenAI Responses support	2025-09-03 00:15:35 +08:00
Luis Pater	e3994ace33	Refactor translator packages for OpenAI Chat Completions - Renamed `openai` packages to `chat_completions` across translator modules. - Introduced `openai_responses_handlers` with handlers for `/v1/models` and OpenAI-compatible chat completions endpoints. - Updated constants and registry identifiers for OpenAI response type. - Simplified request/response conversions and added detailed retry/error handling. - Added `golang.org/x/crypto` for additional cryptographic functions.	2025-09-01 08:18:59 +08:00

31 Commits