CLIProxyAPI

Author	SHA1	Message	Date
Luis Pater	8e6ef3fa64	fix(websocket): ensure state consistency on auth errors in streaming - Added logic to reset `pinnedAuthID` and replay transcript on unauthorized, forbidden, or throttling errors. - Enhanced error handling in `forwardResponsesWebsocket` with detailed status inspection. - Introduced `shouldReleaseResponsesWebsocketPinnedAuth` to determine auth reset conditions. - Updated state management to preserve prior request and response data during forced replay. Fixed: #2230	2026-05-04 05:23:23 +08:00
Luis Pater	82ebe24b9e	Merge pull request #2266 from DragonFSKY/fix/ws-compact-tool-output-mismatch fix(websocket): skip stale state merge after client-side compact	2026-05-04 04:40:43 +08:00
Luis Pater	18bb9c315f	chore: remove usage tracking and logging functionality - Deleted the `LoggerPlugin` along with associated usage tracking and in-memory statistics logic. - Removed all related tests (`logger_plugin_test.go`, `usage_tab_test.go`) and external-facing handler (`usage.go`) for usage statistics export/import. - Cleaned up TUI integration by deleting `usage_tab.go`.	2026-05-02 04:50:58 +08:00
Luis Pater	b8bba053fc	feat: add tracking for auth request success and failure counts - Introduced `Success` and `Failed` fields in auth records to track request outcomes. - Updated `/v0/management/auth-files` and `/v0/management/api-key-usage` responses to include success and failure counts. - Enhanced tests to validate tracking logic and API responses.	2026-05-02 03:40:00 +08:00
Luis Pater	b0dc9df887	feat: add API key usage endpoint with provider and key grouping - Implemented `GetAPIKeyUsage` to expose recent request data grouped by provider and API key. - Added supporting function `mergeRecentRequestBuckets` for bucket aggregation. - Registered new endpoint `/v0/management/api-key-usage` in the management API. - Included extensive unit tests for provider and key-based grouping validation. - Updated `formatRecentRequestBucketLabel` to support configurable bucket duration.	2026-05-01 23:34:18 +08:00
Luis Pater	6187919000	feat: add support for recent request tracking in auth records - Implemented `RecentRequestsSnapshot` in `Auth` to capture bucketed recent request data. - Added new fields and methods to `Auth` for tracking request success and failure counts over time. - Updated `/v0/management/auth-files` response to include recent request data for each auth record. - Introduced unit tests to validate request tracking and snapshot generation logic.	2026-05-01 22:55:22 +08:00
Luis Pater	4035abc0cd	refactor(logging): replace gin-specific context handling with generic context-based request metadata utilities - Introduced reusable utilities in `requestmeta` to manage endpoint and response status in request contexts. - Refactored plugins and handlers to use context-based metadata, removing direct dependency on `gin`. - Updated tests to validate new context utilities and replaced `gin`-based context handling. Fixed: #3166	2026-04-30 23:36:07 +08:00
Luis Pater	f56a19e5b8	feat: add tri-state support for `disable-image-generation` configuration - Introduced `DisableImageGenerationMode` with support for `false`, `true`, and `chat` values. - Updated payload handling to preserve `image_generation` on images endpoints when `chat` mode is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to respect tri-state logic. - Added unit tests for `DisableImageGenerationMode` behavior and endpoint-specific handling. - Enhanced configuration diff logging to support `DisableImageGenerationMode`.	2026-04-30 12:10:27 +08:00
Luis Pater	e3e60f914b	feat: support disabling image generation globally - Added `disable-image-generation` configuration flag to disable the `image_generation` tool globally. - Updated payload handling to remove `image_generation` tools from request payload arrays when the flag is enabled. - Modified OpenAI image handlers (`ImagesGenerations`, `ImagesEdits`) to return 404 when the feature is disabled. - Enhanced configuration diff logging to track changes for the `disable-image-generation` flag. - Added accompanying unit tests for the new feature in payload helpers and image handler logic.	2026-04-30 03:42:27 +08:00
Luis Pater	f95ab4cdf1	Merge pull request #3089 from XYenon/feat/session-affinity feat: support Codex/PI session headers for session affinity	2026-04-28 23:54:16 +08:00
Luis Pater	9fb6a49260	test(api): add validation for unsupported models in OpenAI image handlers - Introduced tests to ensure unsupported models are rejected in `/images/generations` and `/images/edits`. - Added `isSupportedImagesModel` and `rejectUnsupportedImagesModel` functions for consistent model validation. - Enhanced image handler logic to apply validation checks for model compatibility.	2026-04-28 17:19:12 +08:00
XYenon	3ac39dcc7d	feat: support Codex/PI session headers for session affinity Amp-Thread-ID: https://ampcode.com/threads/T-019dce25-c070-773a-ac52-11c541220b30 Co-authored-by: Amp <amp@ampcode.com>	2026-04-27 17:10:50 +08:00
Luis Pater	a325533f20	Merge pull request #2972 from XYenon/feat/amp-thread-id feat: support X-Amp-Thread-Id for session affinity	2026-04-26 23:30:12 +08:00
Luis Pater	38573050aa	feat(config): add support for disabling OpenAI compatibility providers - Introduced a `Disabled` flag to OpenAI compatibility configurations. - Updated routing, auth selection, and API handling logic to respect the `Disabled` state. - Extended relevant APIs, YAML configurations, and data structures to include the `Disabled` field. - Adjusted all relevant loops and filters to skip disabled providers. Closes: #3060 #3059 #2977	2026-04-26 21:49:36 +08:00
Luis Pater	0a7c6b0a4a	feat(api): enhance model assignment logic in image handlers - Updated `buildImagesResponsesRequest` to derive `model` dynamically based on `toolJSON`. - Adjusted streaming execution to handle dynamic model resolution across multiple contexts. Closes: #2965	2026-04-26 03:24:43 +08:00
Luis Pater	28d78273e4	feat(api): implement protocol multiplexer and Redis queue for usage integration - Added `protocol_multiplexer.go`, enabling support for both HTTP and Redis protocols on a single listener. - Introduced `redis_queue_protocol.go` to handle Redis-compatible RESP commands for queue management. - Integrated `redisqueue` package, supporting in-memory queuing with expiration pruning. - Updated server initialization to manage a shared listener and multiplex connections. - Adjusted `Handler` to adopt `AuthenticateManagementKey` for modular key validation, supporting both HTTP and Redis flows.	2026-04-25 18:52:24 +08:00
Luis Pater	a7e92e2639	feat(auth): disallow free-tier Codex auth during selection process - Introduced `disallowFreeAuthFromMetadata` and `isFreeCodexAuth` to enforce skipping free-tier credentials. - Modified scheduler logic to honor `DisallowFreeAuthMetadataKey` during auth selection. - Updated `ensureImageGenerationTool` to skip tool injection for free-tier Codex auth. - Added context utility `WithDisallowFreeAuth` and integrated with image handlers. - Augmented relevant tests to cover free-tier exclusion scenarios.	2026-04-24 23:18:56 +08:00
sususu98	5f5d5936fa	fix antigravity credits stream fallback	2026-04-24 15:47:18 +08:00
sususu98	f130846ec1	fix(auth): break credits cold-start deadlock by keeping unknown-hint auths as fallback candidates Replace antigravityCreditsAvailableForModel with inline known/unknown split. Auths whose credit hints are not yet populated are kept as lower-priority candidates instead of being rejected, breaking the chicken-and-egg deadlock at cold start.	2026-04-23 22:47:51 +08:00
sususu98	e75daa299b	fix(antigravity): respect pinned auth in credits fallback, release deferred body on success - findAllAntigravityCreditsCandidateAuths now filters by PinnedAuthMetadataKey to prevent credential isolation violations during credits fallback - Release deferredBody reference on success path to avoid holding large payloads in memory for the lifetime of the gin context	2026-04-23 17:38:02 +08:00
XYenon	8e49c795f5	fix: forward HTTP headers to executor Options so session affinity can read X-Amp-Thread-Id	2026-04-23 15:34:31 +08:00
sususu98	4de5c29f86	fix(antigravity): remove credits fallback from CountTokens, fix gofmt CountTokens upstream API does not support enabledCreditTypes, so remove the dead credits fallback path from ExecuteCount and delete the unused tryAntigravityCreditsExecuteCount method. Fix gofmt on credits test file.	2026-04-23 15:17:00 +08:00
XYenon	4d6457e6ec	feat: support extracting X-Amp-Thread-Id header as session id for session affinity	2026-04-23 13:49:00 +08:00
sususu98	14d46a0a5d	feat(antigravity): conductor-level credits fallback for Claude models Move credits handling from executor-level retry to conductor-level orchestration. When all free-tier auths are exhausted (429/503), the conductor discovers auths with available Google One AI credits and retries with enabledCreditTypes injected via context flag. Key changes: - Add AntigravityCreditsHint system for tracking per-auth credits state - Conductor tries credits fallback after all auths fail (Execute/Stream/Count) - Executor injects enabledCreditTypes only when conductor sets context flag - Credits fallback respects provider scope (requires antigravity in providers) - Add context cancellation check in credits fallback to avoid wasted requests - Remove executor-level attemptCreditsFallback and preferCredits machinery - Restructure 429 decision logic (parse details first, keyword fallback) - Expand shouldAbort to cover INVALID_ARGUMENT/FAILED_PRECONDITION/500+UNKNOWN - Support human-readable retry delay parsing (e.g. "1h43m56s")	2026-04-23 13:44:20 +08:00
Luis Pater	a188159632	fix(handlers): remove references to unsupported `n` parameter in OpenAI image handlers	2026-04-22 21:28:17 +08:00
Luis Pater	fd71960c3e	fix(handlers): remove handling of unsupported `n` parameter in OpenAI image handlers	2026-04-22 21:12:50 +08:00
Luis Pater	e935196df4	feat(models): add hardcoded GPT-Image-2 model support in Codex - Added `GPT-Image-2` as a built-in model to avoid dependency on remote updates for Codex. - Updated model tier functions (`CodexFree`, `CodexTeam`, etc.) to include built-in models via `WithCodexBuiltins`. - Introduced new handlers for image generation and edit operations under `OpenAIAPIHandler`. - Extended tests to validate 503 response for unsupported image model requests.	2026-04-22 20:51:13 +08:00
Luis Pater	e6866ff19c	feat(auth): add refresh backoff for ineffective token updates - Introduced `refreshIneffectiveBackoff` to prevent tight-looping in auto-refresh when token refresh fails to update expiry. - Adjusted refresh logic to apply backoff when `shouldRefresh` evaluates true. Closes: #2830	2026-04-20 15:40:43 +08:00
Luis Pater	f5dc6483d5	chore: remove iFlow-related modules and dependencies - Deleted `iflow` provider implementation, including thinking configuration (`apply.go`) and authentication modules. - Removed iFlow-specific tests, executors, and helpers across SDK and internal components. - Updated all references to exclude iFlow functionality.	2026-04-17 01:07:12 +08:00
Luis Pater	7b03f04670	fix(handlers): include execution session metadata and skip idempotency key when absent - Refactored `requestExecutionMetadata` to handle empty `Idempotency-Key` gracefully. - Added test to validate metadata inclusion of execution session without idempotency key.	2026-04-16 21:44:32 +08:00
sususu98	7c24d54ca8	feat(session-affinity): add session-sticky routing for multi-account load balancing When multiple auth credentials are configured, requests from the same session are now routed to the same credential, improving upstream prompt cache hit rates and maintaining context continuity. Core components: - SessionAffinitySelector: wraps RoundRobin/FillFirst selectors with session-to-auth binding; automatic failover when bound auth is unavailable, re-binding via the fallback selector for even distribution - SessionCache: TTL-based in-memory cache with background cleanup goroutine, supporting per-session and per-auth invalidation - StoppableSelector interface: lifecycle hook for selectors holding resources, called during Manager.StopAutoRefresh() Session ID extraction priority (extractSessionIDs): 1. metadata.user_id with Claude Code session format (old user_{hash}_session_{uuid} and new JSON {session_id} format) 2. X-Session-ID header (generic client support) 3. metadata.user_id (non-Claude format, used as-is) 4. conversation_id field 5. Stable FNV hash from system prompt + first user/assistant messages (fallback for clients with no explicit session ID); returns both a full hash (primaryID) and a short hash without assistant content (fallbackID) to inherit bindings from the first turn Multi-format message hash covers OpenAI messages, Claude system array, Gemini contents/systemInstruction, and OpenAI Responses API input items (including inline messages with role but no type field). Configuration (config.yaml routing section): - session-affinity: bool (default false) - session-affinity-ttl: duration string (default "1h") - claude-code-session-affinity: bool (deprecated, alias for above) All three fields trigger selector rebuild on config hot reload. Side effect: Idempotency-Key header is no longer auto-generated with a random UUID when absent — only forwarded when explicitly provided by the client, to avoid polluting session hash extraction.	2026-04-16 00:18:47 +08:00
Luis Pater	8fac29631d	chore: remove Qwen support from SDK and internal components - Deleted `QwenAuthenticator`, internal `qwen_auth`, and `qwen_executor` implementations. - Removed all Qwen-related OAuth flows, token handling, and execution logic. - Cleaned up dependencies and references to Qwen across the codebase.	2026-04-15 12:16:08 +08:00
Luis Pater	5bfaf8086b	feat(auth): add configurable worker pool size for auto-refresh loop - Introduced `auth-auto-refresh-workers` config option to override default concurrency. - Updated `authAutoRefreshLoop` to support customizable worker counts. - Enhanced token refresh scheduling flexibility by aligning worker pool with runtime configurations.	2026-04-12 13:56:05 +08:00
Luis Pater	6c0a1efd71	refactor(auth): simplify auth directory scanning and improve JSON processing logic - Replaced `filepath.Walk` with `os.ReadDir` for cleaner directory traversal. - Fixed `isAuthJSON` check to use `filepath.Dir` for directory comparison. - Updated auth hash cache generation and file synthesis to improve readability and maintainability.	2026-04-12 13:32:03 +08:00
Luis Pater	a583463d60	feat(auth): implement auto-refresh loop for managing auth token schedule - Introduced `authAutoRefreshLoop` to handle token refresh scheduling. - Replaced semaphore-based refresh logic in `Manager` with the new loop. - Added unit tests to verify refresh schedule logic and edge cases.	2026-04-12 02:06:40 +08:00
Luis Pater	730809d8ea	fix(auth): preserve and restore ready view cursors during index rebuilds	2026-04-09 20:26:16 +08:00
Luis Pater	5e81b65f2f	fix(auth, executor): normalize Qwen base URL, adjust RefreshLead duration, and add tests	2026-04-09 18:07:07 +08:00
Luis Pater	ad8e3964ff	fix(auth): add retry logic for 429 status with Retry-After and improve testing	2026-04-09 07:07:19 +08:00
Luis Pater	941334da79	fix(auth): handle OAuth model alias in retry logic and refine Qwen quota handling	2026-04-09 03:44:19 +08:00
hkfires	d390b95b76	fix(tests): update test cases	2026-04-08 08:53:50 +08:00
DragonFSKY	4ca00f7983	fix(websocket): gate compact replay by downstream support	2026-04-07 14:25:05 +08:00
DragonFSKY	d2d0e6f6a1	fix(websocket): narrow compact replay detection	2026-04-07 14:23:44 +08:00
DragonFSKY	a0fe273081	fix(websocket): skip stale state merge after client-side compact After a Codex CLI compact, the client sends a full conversation transcript (with compaction items or assistant messages) as input. Previously, normalizeResponseSubsequentRequest() unconditionally merged this with stale lastRequest/lastResponseOutput, breaking function_call/function_call_output pairings and causing 400 errors ("No tool output found for function call"). Add inputContainsFullTranscript() heuristic that detects compaction items (type=compaction/compaction_summary) or assistant messages in the input array, and bypasses the merge when a full transcript is present. Fixes #2207	2026-04-07 14:22:53 +08:00
Luis Pater	cad45ffa33	Merge pull request #2578 from LemonZuo/feat_socks5h feat: support socks5h scheme for proxy settings	2026-04-07 09:57:18 +08:00
Luis Pater	6a27bceec0	Merge pull request #2576 from zilianpn/fix/disable-cooling-auth-errors fix(auth): honor disable-cooling and enrich no-auth errors	2026-04-07 09:56:25 +08:00
Lemon	163d68318f	feat: support socks5h scheme for proxy settings	2026-04-07 07:46:11 +08:00
zilianpn	0ea768011b	fix(auth): honor disable-cooling and enrich no-auth errors	2026-04-07 01:12:13 +08:00
Luis Pater	ea43361492	Merge pull request #2121 from destinoantagonista-wq/main Reconcile registry model states on auth changes	2026-04-06 09:13:27 +08:00
Luis Pater	f389667ec3	Merge pull request #2513 from lonr-6/codex/fix-ws-custom-tool-repair-v2 fix: repair responses websocket custom tool call pairing	2026-04-03 23:45:38 +08:00
Luis Pater	adb580b344	feat(security): add configuration to toggle Gemini CLI endpoint access Closes: #2445	2026-04-03 21:46:49 +08:00

1 2 3 4 5 ...

420 Commits