CLIProxyAPI

Author	SHA1	Message	Date
Luis Pater	48a1c88115	Merge pull request #3476 from sususu98/fix/codex-context-length-stream-errors-dev fix codex context length stream errors	2026-05-21 02:53:54 +08:00
Luis Pater	f1ee883cd3	Merge pull request #3484 from yavon007/main Add reasoning_effort to usage event payloads	2026-05-20 12:34:40 +08:00
Luis Pater	de0394917a	feat(models): expand supported reasoning levels for Codex - Added new reasoning levels: `none`, `minimal`, and `unsupported` to Codex model configurations. - Introduced metadata sanitization and normalization for reasoning levels in API response. - Extended unit tests to cover reasoning levels validation and metadata sanitation logic.	2026-05-20 03:21:46 +08:00
yavon007	0de0ad0d36	Add reasoning effort to usage events	2026-05-19 22:10:48 +08:00
sususu98	ad868308c0	fix codex context length stream errors	2026-05-19 16:05:40 +08:00
Luis Pater	feebe6c7f2	feat(api): add OpenAI compatibility for image models - Introduced OpenAI-compatible image model support in the API, enabling integration through image generation and editing endpoints. - Added registry type for OpenAIImageModelType to classify and validate compatibility. - Implemented request handling for OpenAI-compatible image models, including JSON and multipart formats. - Enhanced executor methods to support OpenAI-compatible image streaming and non-streaming requests. - Included tests to validate model registration, streaming behavior, and multipart payload formatting.	2026-05-19 10:13:26 +08:00
Luis Pater	ad98c9549a	feat(runtime): track upstream response headers in logging and usage reporting - Added APIs to store, retrieve, and clone upstream response headers in context for detailed logging. - Updated `RecordAPIResponseMetadata`, `RecordAPIWebsocketHandshake`, and related methods to capture response headers. - Extended `UsageReporter` to include response headers in published usage records. - Enhanced payload tests to validate response headers' integrity and persistence. - Refactored `usage.Record` to support optional `ResponseHeaders` field.	2026-05-19 01:29:23 +08:00
Luis Pater	96754f5a33	refactor(api): move Codex client model handling to `registry` package - Relocated Codex client model JSON and related logic from `openai` package to `registry` for better modularity. - Updated references to use `registry.GetCodexClientModelsJSON()` in loading logic. - Extended test cases to cover additional field removals (`upgrade`, `availability_nux`).	2026-05-17 05:11:41 +08:00
Luis Pater	088ab33df8	feat(api): add Codex client models support for OpenAI API - Introduced Codex client models framework in `openai` package. - Added JSON-based model definitions (`codex_client_models.json`) for Codex, including metadata, reasoning levels, and configuration options. - Implemented handlers to load, clone, and build Codex client models with support for visibility overrides and metadata application. - Enabled sorting and prioritization of models based on configuration or runtime criteria. - Added utility functions for managing and validating model attributes.	2026-05-17 04:48:34 +08:00
Luis Pater	53d1fd6c5c	feat(api, xai): add xAI Grok video model support with API integration - Introduced new xAI `grok-imagine-video` model for video generation with configurable options (e.g., duration, size, resolution). - Implemented video-specific API endpoints (`/v1/videos`, `/v1/videos/generations`, `/v1/videos/edits`, `/v1/videos/extensions`), including request validation and model handling. - Enhanced model registry with `xaiBuiltinVideoModelID` and metadata for video capabilities. - Added unit tests to validate video model support, request structures, and API response handling. - Extended `XAIExecutor` to integrate video generation and retrieval via runtime requests.	2026-05-17 02:53:50 +08:00
Luis Pater	2ff9e33e26	feat(api, xai): integrate xAI Grok image models and extend API endpoints for image support - Added new xAI Grok image models (`grok-imagine-image`, `grok-imagine-image-quality`) with high-fidelity and aspect ratio configurations. - Extended `isSupportedImagesModel` logic to validate xAI models. - Implemented API request builders for image generation/editing with customizable options (e.g., resolution, aspect ratio, response format). - Enhanced `/v1/images` endpoints to handle xAI model capabilities, including response normalization and model-specific handlers. - Updated unit tests to validate xAI model validation, request structure, and API integration.	2026-05-17 01:30:23 +08:00
Luis Pater	e7a185962d	feat(api): add request body decoding with Content-Encoding support - Introduced `ReadRequestBody` helper function to support decoding request bodies based on "Content-Encoding" (e.g., `zstd`). - Replaced `c.GetRawData()` with `ReadRequestBody` across handlers to enable decoding. - Added test case to validate `zstd` decoding for compact responses.	2026-05-16 12:19:32 +08:00
Luis Pater	15ac7fb932	refactor(auth): simplify home auth session management and remove ref counting - Consolidated `homeRuntimeAuths` to store a map of session-scoped auth maps, replacing `homeRuntimeAuthSessions` and `homeRuntimeAuthRefs`. - Adjusted session cleanup logic to directly remove session-scoped auths without reference counting. - Added `GetExecutionSessionAuthByID` to retrieve auths scoped to a specific execution session. - Updated tests to reflect the new session-scoped caching behavior.	2026-05-10 15:21:33 +08:00
Luis Pater	a44e5eb1ab	Merge branch 'v7' into dev	2026-05-10 02:33:42 +08:00
Luis Pater	1721994111	feat(management): expose additional OAuth and configuration helpers - Added new helper methods for OAuth session management (`RegisterOAuthSession`, `CompleteOAuthSession`, etc.). - Introduced `WriteConfig` for persisting management configurations. - Exported `Handler` type and `NewHandler` constructors for SDK consumers.	2026-05-09 00:23:45 +08:00
Codex	c883114a4d	fix responses websocket tool output context	2026-05-08 05:12:30 +00:00
Luis Pater	e50cabac4b	chore: upgrade CLIProxyAPI dependency to v7 across the project - Updated all references from v6 to v7 for `github.com/router-for-me/CLIProxyAPI`. - Ensured consistency in imports within core libraries, tests, and integration tests. - Added missing tests for new features in Redis Protocol integration.	2026-05-08 11:46:46 +08:00
Luis Pater	fb08b92402	feat(executor): add upstream disconnect handling for Codex WebSocket sessions - Introduced `UpstreamDisconnectChan` for Codex WebSocket sessions to notify downstream connections of upstream disconnections. - Implemented `notifyUpstreamDisconnect` to signal errors and close channels on disconnect events. - Added integration tests to validate WebSocket session behavior on upstream disconnect. - Updated OpenAI WebSocket response handlers to properly close connections upon upstream disconnect notifications.	2026-05-06 22:09:33 +08:00
Luis Pater	ba5d8ca733	feat(usage): add support for requested model alias handling - Introduced methods for setting and retrieving model aliases in execution and usage contexts. - Enhanced `UsageReporter` and related structures to include client-requested aliases. - Updated tests to validate alias propagation and ensure correct usage reporting. - Adjusted metadata handling in CLIProxyAPI executors to address alias integration.	2026-05-05 01:47:53 +08:00
Luis Pater	bdc424007e	Merge pull request #2896 from edlsh/fix/oauth-tool-rename-per-request-map fix(amp): smart-mode tool name fixes + deep-mode response repair	2026-05-05 00:58:39 +08:00
Luis Pater	8e6ef3fa64	fix(websocket): ensure state consistency on auth errors in streaming - Added logic to reset `pinnedAuthID` and replay transcript on unauthorized, forbidden, or throttling errors. - Enhanced error handling in `forwardResponsesWebsocket` with detailed status inspection. - Introduced `shouldReleaseResponsesWebsocketPinnedAuth` to determine auth reset conditions. - Updated state management to preserve prior request and response data during forced replay. Fixed: #2230	2026-05-04 05:23:23 +08:00
Luis Pater	82ebe24b9e	Merge pull request #2266 from DragonFSKY/fix/ws-compact-tool-output-mismatch fix(websocket): skip stale state merge after client-side compact	2026-05-04 04:40:43 +08:00
Luis Pater	4035abc0cd	refactor(logging): replace gin-specific context handling with generic context-based request metadata utilities - Introduced reusable utilities in `requestmeta` to manage endpoint and response status in request contexts. - Refactored plugins and handlers to use context-based metadata, removing direct dependency on `gin`. - Updated tests to validate new context utilities and replaced `gin`-based context handling. Fixed: #3166	2026-04-30 23:36:07 +08:00
Luis Pater	f56a19e5b8	feat: add tri-state support for `disable-image-generation` configuration - Introduced `DisableImageGenerationMode` with support for `false`, `true`, and `chat` values. - Updated payload handling to preserve `image_generation` on images endpoints when `chat` mode is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to respect tri-state logic. - Added unit tests for `DisableImageGenerationMode` behavior and endpoint-specific handling. - Enhanced configuration diff logging to support `DisableImageGenerationMode`.	2026-04-30 12:10:27 +08:00
Luis Pater	e3e60f914b	feat: support disabling image generation globally - Added `disable-image-generation` configuration flag to disable the `image_generation` tool globally. - Updated payload handling to remove `image_generation` tools from request payload arrays when the flag is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to return 404 when the feature is disabled. - Enhanced configuration diff logging to track changes for the `disable-image-generation` flag. - Added accompanying unit tests for the new feature in payload helpers and image handler logic.	2026-04-30 03:42:27 +08:00
Luis Pater	9fb6a49260	test(api): add validation for unsupported models in OpenAI image handlers - Introduced tests to ensure unsupported models are rejected in `/images/generations` and `/images/edits`. - Added `isSupportedImagesModel` and `rejectUnsupportedImagesModel` functions for consistent model validation. - Enhanced image handler logic to apply validation checks for model compatibility.	2026-04-28 17:19:12 +08:00
Luis Pater	a325533f20	Merge pull request #2972 from XYenon/feat/amp-thread-id feat: support X-Amp-Thread-Id for session affinity	2026-04-26 23:30:12 +08:00
edlsh	80eb03709a	fix(openai): preserve multiline repaired SSE data	2026-04-25 18:12:27 -04:00
edlsh	d36e70e9dc	fix(openai): preserve unindexed response output items	2026-04-25 18:06:00 -04:00
edlsh	fd45dece7f	fix(openai): repair empty responses stream output	2026-04-25 17:46:44 -04:00
Luis Pater	0a7c6b0a4a	feat(api): enhance model assignment logic in image handlers - Updated `buildImagesResponsesRequest` to derive `model` dynamically based on `toolJSON`. - Adjusted streaming execution to handle dynamic model resolution across multiple contexts. Closes: #2965	2026-04-26 03:24:43 +08:00
Luis Pater	a7e92e2639	feat(auth): disallow free-tier Codex auth during selection process - Introduced `disallowFreeAuthFromMetadata` and `isFreeCodexAuth` to enforce skipping free-tier credentials. - Modified scheduler logic to honor `DisallowFreeAuthMetadataKey` during auth selection. - Updated `ensureImageGenerationTool` to skip tool injection for free-tier Codex auth. - Added context utility `WithDisallowFreeAuth` and integrated with image handlers. - Augmented relevant tests to cover free-tier exclusion scenarios.	2026-04-24 23:18:56 +08:00
XYenon	8e49c795f5	fix: forward HTTP headers to executor Options so session affinity can read X-Amp-Thread-Id	2026-04-23 15:34:31 +08:00
Luis Pater	a188159632	fix(handlers): remove references to unsupported `n` parameter in OpenAI image handlers	2026-04-22 21:28:17 +08:00
Luis Pater	fd71960c3e	fix(handlers): remove handling of unsupported `n` parameter in OpenAI image handlers	2026-04-22 21:12:50 +08:00
Luis Pater	e935196df4	feat(models): add hardcoded GPT-Image-2 model support in Codex - Added `GPT-Image-2` as a built-in model to avoid dependency on remote updates for Codex. - Updated model tier functions (`CodexFree`, `CodexTeam`, etc.) to include built-in models via `WithCodexBuiltins`. - Introduced new handlers for image generation and edit operations under `OpenAIAPIHandler`. - Extended tests to validate 503 response for unsupported image model requests.	2026-04-22 20:51:13 +08:00
Luis Pater	f5dc6483d5	chore: remove iFlow-related modules and dependencies - Deleted `iflow` provider implementation, including thinking configuration (`apply.go`) and authentication modules. - Removed iFlow-specific tests, executors, and helpers across SDK and internal components. - Updated all references to exclude iFlow functionality.	2026-04-17 01:07:12 +08:00
Luis Pater	7b03f04670	fix(handlers): include execution session metadata and skip idempotency key when absent - Refactored `requestExecutionMetadata` to handle empty `Idempotency-Key` gracefully. - Added test to validate metadata inclusion of execution session without idempotency key.	2026-04-16 21:44:32 +08:00
sususu98	7c24d54ca8	feat(session-affinity): add session-sticky routing for multi-account load balancing When multiple auth credentials are configured, requests from the same session are now routed to the same credential, improving upstream prompt cache hit rates and maintaining context continuity. Core components: - SessionAffinitySelector: wraps RoundRobin/FillFirst selectors with session-to-auth binding; automatic failover when bound auth is unavailable, re-binding via the fallback selector for even distribution - SessionCache: TTL-based in-memory cache with background cleanup goroutine, supporting per-session and per-auth invalidation - StoppableSelector interface: lifecycle hook for selectors holding resources, called during Manager.StopAutoRefresh() Session ID extraction priority (extractSessionIDs): 1. metadata.user_id with Claude Code session format (old user_{hash}_session_{uuid} and new JSON {session_id} format) 2. X-Session-ID header (generic client support) 3. metadata.user_id (non-Claude format, used as-is) 4. conversation_id field 5. Stable FNV hash from system prompt + first user/assistant messages (fallback for clients with no explicit session ID); returns both a full hash (primaryID) and a short hash without assistant content (fallbackID) to inherit bindings from the first turn Multi-format message hash covers OpenAI messages, Claude system array, Gemini contents/systemInstruction, and OpenAI Responses API input items (including inline messages with role but no type field). Configuration (config.yaml routing section): - session-affinity: bool (default false) - session-affinity-ttl: duration string (default "1h") - claude-code-session-affinity: bool (deprecated, alias for above) All three fields trigger selector rebuild on config hot reload. Side effect: Idempotency-Key header is no longer auto-generated with a random UUID when absent — only forwarded when explicitly provided by the client, to avoid polluting session hash extraction.	2026-04-16 00:18:47 +08:00
Luis Pater	8fac29631d	chore: remove Qwen support from SDK and internal components - Deleted `QwenAuthenticator`, internal `qwen_auth`, and `qwen_executor` implementations. - Removed all Qwen-related OAuth flows, token handling, and execution logic. - Cleaned up dependencies and references to Qwen across the codebase.	2026-04-15 12:16:08 +08:00
DragonFSKY	4ca00f7983	fix(websocket): gate compact replay by downstream support	2026-04-07 14:25:05 +08:00
DragonFSKY	d2d0e6f6a1	fix(websocket): narrow compact replay detection	2026-04-07 14:23:44 +08:00
DragonFSKY	a0fe273081	fix(websocket): skip stale state merge after client-side compact After a Codex CLI compact, the client sends a full conversation transcript (with compaction items or assistant messages) as input. Previously, normalizeResponseSubsequentRequest() unconditionally merged this with stale lastRequest/lastResponseOutput, breaking function_call/function_call_output pairings and causing 400 errors ("No tool output found for function call"). Add inputContainsFullTranscript() heuristic that detects compaction items (type=compaction/compaction_summary) or assistant messages in the input array, and bypasses the merge when a full transcript is present. Fixes #2207	2026-04-07 14:22:53 +08:00
zilianpn	0ea768011b	fix(auth): honor disable-cooling and enrich no-auth errors	2026-04-07 01:12:13 +08:00
Luis Pater	f389667ec3	Merge pull request #2513 from lonr-6/codex/fix-ws-custom-tool-repair-v2 fix: repair responses websocket custom tool call pairing	2026-04-03 23:45:38 +08:00
Luis Pater	adb580b344	feat(security): add configuration to toggle Gemini CLI endpoint access Closes: #2445	2026-04-03 21:46:49 +08:00
Luis Pater	06405f2129	fix(security): enforce stricter localhost validation for GeminiCLIAPIHandler Closes: #2445	2026-04-03 21:22:03 +08:00
Kai Wang	d1fd2c4ad4	fix: repair websocket custom tool calls	2026-04-03 17:11:44 +08:00
Kai Wang	b6c6379bfa	fix: repair websocket custom tool calls	2026-04-03 17:11:42 +08:00
Kai Wang	8f0e66b72e	fix: repair websocket custom tool calls	2026-04-03 17:11:41 +08:00

1 2 3

150 Commits