CLIProxyAPI

Author	SHA1	Message	Date
sususu98	7c24d54ca8	feat(session-affinity): add session-sticky routing for multi-account load balancing When multiple auth credentials are configured, requests from the same session are now routed to the same credential, improving upstream prompt cache hit rates and maintaining context continuity. Core components: - SessionAffinitySelector: wraps RoundRobin/FillFirst selectors with session-to-auth binding; automatic failover when bound auth is unavailable, re-binding via the fallback selector for even distribution - SessionCache: TTL-based in-memory cache with background cleanup goroutine, supporting per-session and per-auth invalidation - StoppableSelector interface: lifecycle hook for selectors holding resources, called during Manager.StopAutoRefresh() Session ID extraction priority (extractSessionIDs): 1. metadata.user_id with Claude Code session format (old user_{hash}_session_{uuid} and new JSON {session_id} format) 2. X-Session-ID header (generic client support) 3. metadata.user_id (non-Claude format, used as-is) 4. conversation_id field 5. Stable FNV hash from system prompt + first user/assistant messages (fallback for clients with no explicit session ID); returns both a full hash (primaryID) and a short hash without assistant content (fallbackID) to inherit bindings from the first turn Multi-format message hash covers OpenAI messages, Claude system array, Gemini contents/systemInstruction, and OpenAI Responses API input items (including inline messages with role but no type field). Configuration (config.yaml routing section): - session-affinity: bool (default false) - session-affinity-ttl: duration string (default "1h") - claude-code-session-affinity: bool (deprecated, alias for above) All three fields trigger selector rebuild on config hot reload. Side effect: Idempotency-Key header is no longer auto-generated with a random UUID when absent — only forwarded when explicitly provided by the client, to avoid polluting session hash extraction.	2026-04-16 00:18:47 +08:00
Luis Pater	8fac29631d	chore: remove Qwen support from SDK and internal components - Deleted `QwenAuthenticator`, internal `qwen_auth`, and `qwen_executor` implementations. - Removed all Qwen-related OAuth flows, token handling, and execution logic. - Cleaned up dependencies and references to Qwen across the codebase.	2026-04-15 12:16:08 +08:00
Luis Pater	5bfaf8086b	feat(auth): add configurable worker pool size for auto-refresh loop - Introduced `auth-auto-refresh-workers` config option to override default concurrency. - Updated `authAutoRefreshLoop` to support customizable worker counts. - Enhanced token refresh scheduling flexibility by aligning worker pool with runtime configurations.	2026-04-12 13:56:05 +08:00
Luis Pater	6c0a1efd71	refactor(auth): simplify auth directory scanning and improve JSON processing logic - Replaced `filepath.Walk` with `os.ReadDir` for cleaner directory traversal. - Fixed `isAuthJSON` check to use `filepath.Dir` for directory comparison. - Updated auth hash cache generation and file synthesis to improve readability and maintainability.	2026-04-12 13:32:03 +08:00
Luis Pater	a583463d60	feat(auth): implement auto-refresh loop for managing auth token schedule - Introduced `authAutoRefreshLoop` to handle token refresh scheduling. - Replaced semaphore-based refresh logic in `Manager` with the new loop. - Added unit tests to verify refresh schedule logic and edge cases.	2026-04-12 02:06:40 +08:00
Luis Pater	730809d8ea	fix(auth): preserve and restore ready view cursors during index rebuilds	2026-04-09 20:26:16 +08:00
Luis Pater	ad8e3964ff	fix(auth): add retry logic for 429 status with Retry-After and improve testing	2026-04-09 07:07:19 +08:00
Luis Pater	941334da79	fix(auth): handle OAuth model alias in retry logic and refine Qwen quota handling	2026-04-09 03:44:19 +08:00
hkfires	d390b95b76	fix(tests): update test cases	2026-04-08 08:53:50 +08:00
Luis Pater	6a27bceec0	Merge pull request #2576 from zilianpn/fix/disable-cooling-auth-errors fix(auth): honor disable-cooling and enrich no-auth errors	2026-04-07 09:56:25 +08:00
zilianpn	0ea768011b	fix(auth): honor disable-cooling and enrich no-auth errors	2026-04-07 01:12:13 +08:00
Luis Pater	ea43361492	Merge pull request #2121 from destinoantagonista-wq/main Reconcile registry model states on auth changes	2026-04-06 09:13:27 +08:00
Luis Pater	09e480036a	feat(auth): add support for managing custom headers in auth files Closes #2457	2026-04-02 19:11:09 +08:00
Luis Pater	e783d0a62e	Merge pull request #2441 from MonsterQiu/issue-2421-alias-before-suspension fix(auth): resolve oauth aliases before suspension checks	2026-04-02 10:27:39 +08:00
Luis Pater	105a21548f	fix(codex): centralize session management with global store and add tests for executor session lifecycle	2026-04-01 13:17:10 +08:00
Luis Pater	1734aa1664	fix(codex): prioritize websocket-enabled credentials across priority tiers in scheduler logic	2026-04-01 12:51:12 +08:00
MonsterQiu	f611dd6e96	refactor(auth): dedupe route-aware model support checks	2026-03-31 15:42:25 +08:00
MonsterQiu	07b7c1a1e0	fix(auth): resolve oauth aliases before suspension checks	2026-03-31 14:27:14 +08:00
Luis Pater	17363edf25	fix(auth): skip downtime for request-scoped 404 errors in model state management	2026-03-30 22:22:42 +08:00
Luis Pater	6d8de0ade4	feat(auth): implement weighted provider rotation for improved scheduling fairness	2026-03-29 13:49:01 +08:00
Luis Pater	73c831747b	Merge pull request #2133 from DragonFSKY/fix/2061-stale-modelstates fix(auth): prevent stale runtime state inheritance from disabled auth entries	2026-03-28 20:50:57 +08:00
hkfires	fee736933b	feat(openai-compat): add per-model thinking support	2026-03-24 14:21:12 +08:00
Luis Pater	0906aeca87	Merge pull request #2254 from clcc2019/main refactor: streamline usage reporting by consolidating record publishi…	2026-03-24 00:39:31 +08:00
DragonFSKY	74b862d8b8	test(cliproxy): cover delete re-add stale state flow	2026-03-24 00:21:04 +08:00
clcc2019	c1bf298216	refactor: streamline usage reporting by consolidating record publishing logic - Introduced a new method `buildRecord` in `usageReporter` to encapsulate record creation, improving code readability and maintainability. - Added latency tracking to usage records, ensuring accurate reporting of request latencies. - Updated tests to validate the inclusion of latency in usage records and ensure proper functionality of the new reporting structure.	2026-03-20 19:44:26 +08:00
Tam Nhu Tran	ea3e0b713e	fix: harden pooled model-support fallback state	2026-03-18 13:19:20 -04:00
Tam Nhu Tran	5135c22cd6	fix: fall back on model support errors during auth rotation	2026-03-18 12:43:45 -04:00
Luis Pater	b5701f416b	Fixed: #2102 fix(auth): ensure unique auth index for shared API keys across providers and credential identities	2026-03-15 02:48:54 +08:00
DragonFSKY	5c817a9b42	fix(auth): prevent stale ModelStates inheritance from disabled auth entries When an auth file is deleted and re-created with the same path/ID, the new auth could inherit stale ModelStates (cooldown/backoff) from the previously disabled entry, preventing it from being routed. Gate runtime state inheritance (ModelStates, LastRefreshedAt, NextRefreshAfter) on both existing and incoming auth being non-disabled in Manager.Update and Service.applyCoreAuthAddOrUpdate. Closes #2061	2026-03-14 23:46:23 +08:00
destinoantagonista-wq	e08f68ed7c	chore(auth): drop reconcile test file from pr	2026-03-14 14:41:26 +00:00
destinoantagonista-wq	f09ed25fd3	fix(auth): tighten registry model reconciliation	2026-03-14 14:40:06 +00:00
hkfires	58fd9bf964	fix(codex): add 'go' plan_type in registerModelsForAuth	2026-03-14 22:09:14 +08:00
destinoantagonista-wq	e166e56249	Reconcile registry model states on auth changes Add Manager.ReconcileRegistryModelStates to clear stale per-model runtime failures for models currently registered in the global model registry. The method finds models supported for an auth, resets non-clean ModelState entries, updates aggregated availability, persists changes, and pushes a snapshot to the scheduler. Introduce modelStateIsClean helper to determine when a model state needs resetting. Call ReconcileRegistryModelStates from Service paths that register/refresh models (applyCoreAuthAddOrUpdate and refreshModelRegistrationForAuth) to keep the scheduler and global registry aligned after model re-registration.	2026-03-13 19:41:49 +00:00
hkfires	f44f0702f8	feat(service): extend model registration for team and business types	2026-03-13 14:12:19 +08:00
hkfires	c3d5dbe96f	feat(model_registry): enhance model registration and refresh mechanisms	2026-03-13 10:56:39 +08:00
hkfires	dea3e74d35	feat(antigravity): refactor model handling and remove unused code	2026-03-12 09:24:45 +08:00
Luis Pater	ddaa9d2436	Fixed: #2034 feat(proxy): centralize proxy handling with `proxyutil` package and enhance test coverage - Added `proxyutil` package to simplify proxy handling across the codebase. - Refactored various components (`executor`, `cliproxy`, `auth`, etc.) to use `proxyutil` for consistent and reusable proxy logic. - Introduced support for "direct" proxy mode to explicitly bypass all proxies. - Updated tests to validate proxy behavior (e.g., `direct`, HTTP/HTTPS, and SOCKS5). - Enhanced YAML configuration documentation for proxy options.	2026-03-11 11:08:02 +08:00
hkfires	d1e3195e6f	feat(codex): register models by plan tier	2026-03-10 11:20:37 +08:00
Luis Pater	ce53d3a287	Fixed: #1997 docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details test(auth-scheduler): add benchmarks and priority-based scheduling improvements - Added `BenchmarkManagerPickNextMixedPriority500` for mixed-priority performance assessment. - Updated `pickNextMixed` to prioritize highest ready priority tiers. - Introduced `highestReadyPriorityLocked` and `pickReadyAtPriorityLocked` for better scheduling logic. - Added unit test to validate selection of highest priority tiers in mixed provider scenarios.	2026-03-09 22:27:15 +08:00
Luis Pater	f5941a411c	test(auth): cover scheduler refresh regression paths docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details	2026-03-09 09:27:56 +08:00
DragonFSKY	90afb9cb73	fix(auth): new OAuth accounts invisible to scheduler after dynamic registration When new OAuth auth files are added while the service is running, `applyCoreAuthAddOrUpdate` calls `coreManager.Register()` (which upserts into the scheduler) BEFORE `registerModelsForAuth()`. At upsert time, `buildScheduledAuthMeta` snapshots `supportedModelSetForAuth` from the global model registry — but models haven't been registered yet, so the set is empty. With an empty `supportedModelSet`, `supportsModel()` always returns false and the new auth is never added to any model shard. Additionally, when all existing accounts are in cooldown, the scheduler returns `modelCooldownError`, but `shouldRetrySchedulerPick` only handles `Error` types — so the `syncScheduler` safety-net rebuild never triggers and the new accounts remain invisible. Fix: 1. Add `RefreshSchedulerEntry()` to re-upsert a single auth after its models are registered, rebuilding `supportedModelSet` from the now-populated registry. 2. Call it from `applyCoreAuthAddOrUpdate` after `registerModelsForAuth`. 3. Make `shouldRetrySchedulerPick` also match `modelCooldownError` so the full scheduler rebuild triggers when all credentials are cooling down — catching any similar stale-snapshot edge cases.	2026-03-09 03:11:47 +08:00
Luis Pater	2b134fc378	test(auth-scheduler): add unit tests and scheduler implementation - Added comprehensive unit tests for `authScheduler` and related components. - Implemented `authScheduler` with support for Round Robin, Fill First, and custom selector strategies. - Improved tracking of auth states, cooldowns, and recovery logic in scheduler.	2026-03-08 05:52:55 +08:00
chujian	a52da26b5d	fix(auth): stop draining stream pool goroutines after context cancellation	2026-03-07 18:30:33 +08:00
chujian	522a68a4ea	fix(openai-compat): retry empty bootstrap streams	2026-03-07 18:08:13 +08:00
chujian	a02eda54d0	fix(openai-compat): address review feedback	2026-03-07 17:39:42 +08:00
chujian	7c1299922e	fix(openai-compat): improve pool fallback and preserve adaptive thinking	2026-03-07 16:54:28 +08:00
hkfires	48ffc4dee7	feat(config): support excluded vertex models in config	2026-03-04 18:47:42 +08:00
Luis Pater	79009bb3d4	Fixed: #797 test(auth): add test for preserving ModelStates during auth updates	2026-03-04 02:06:24 +08:00
hkfires	532107b4fa	test(auth): add global model registry usage to conductor override tests	2026-03-03 09:18:56 +08:00
Luis Pater	cc1d8f6629	Fixed: #1747 docker-image / docker_amd64 (push) Has been cancelled Details docker-image / docker_arm64 (push) Has been cancelled Details goreleaser / goreleaser (push) Has been cancelled Details docker-image / docker_manifest (push) Has been cancelled Details feat(auth): add configurable max-retry-credentials for finer control over cross-credential retries	2026-03-01 02:42:36 +08:00

1 2 3 4 5

233 Commits