Luis Pater
564bcbaa54
Merge pull request #487 from router-for-me/amp
...
fix(amp): set status on claude stream errors
2025-12-11 21:18:19 +08:00
hkfires
88bdd25f06
fix(amp): set status on claude stream errors
2025-12-11 20:12:06 +08:00
hkfires
e79f65fd8e
refactor(thinking): use parentheses for metadata suffix
2025-12-11 18:39:07 +08:00
Luis Pater
2760989401
Merge pull request #485 from router-for-me/think
...
Think
2025-12-11 18:27:00 +08:00
hkfires
facfe7c518
refactor(thinking): use bracket tags for thinking meta
...
Align thinking suffix handling on a single bracket-style marker.
NormalizeThinkingModel strips a terminal `[value]` segment from
model identifiers and turns it into either a thinking budget (for
numeric values) or a reasoning effort hint (for strings). Emission
of `ThinkingIncludeThoughtsMetadataKey` is removed.
Executor helpers and the example config are updated so their
comments reference the new `[value]` suffix format instead of the
legacy dash variants.
BREAKING CHANGE: dash-based thinking suffixes (`-thinking`,
`-thinking-N`, `-reasoning`, `-nothinking`) are no longer parsed
for thinking metadata; only `[value]` annotations are recognized.
2025-12-11 18:17:28 +08:00
hkfires
6285459c08
fix(runtime): unify claude thinking config resolution
2025-12-11 17:20:44 +08:00
hkfires
21bbceca0c
docs(runtime): document reasoning effort precedence
2025-12-11 16:35:36 +08:00
hkfires
f6300c72b7
fix(runtime): validate thinking config in iflow and qwen
2025-12-11 16:21:50 +08:00
hkfires
007572b58e
fix(util): do not strip thinking suffix on registered models
...
NormalizeThinkingModel now checks ModelSupportsThinking before removing
"-thinking" or "-thinking-<ver>", avoiding accidental parsing of model
names where the suffix is part of the official id (e.g., kimi-k2-thinking,
qwen3-235b-a22b-thinking-2507).
The registry adds ThinkingSupport metadata for several models and
propagates it via ModelInfo (e.g., kimi-k2-thinking, deepseek-r1,
qwen3-235b-a22b-thinking-2507, minimax-m2), enabling accurate detection
of thinking-capable models and correcting base model inference.
2025-12-11 15:52:14 +08:00
hkfires
3a81ab22fd
fix(runtime): unify reasoning effort metadata overrides
2025-12-11 14:35:05 +08:00
hkfires
519da2e042
fix(runtime): validate reasoning effort levels
2025-12-11 12:36:54 +08:00
hkfires
169f4295d0
fix(util): align reasoning effort handling with registry
2025-12-11 12:20:12 +08:00
hkfires
d06d0eab2f
fix(util): centralize reasoning effort normalization
2025-12-11 12:14:51 +08:00
hkfires
3ffd120ae9
feat(runtime): add thinking config normalization
2025-12-11 11:51:33 +08:00
hkfires
a03d514095
feat(registry): add thinking metadata for models
2025-12-11 11:28:44 +08:00
Luis Pater
1da03bfe15
Merge pull request #479 from router-for-me/claude
...
fix(claude): prevent final events when no content streamed
2025-12-11 08:18:59 +08:00
Luis Pater
423ce97665
feat(util): implement dynamic thinking suffix normalization and refactor budget resolution logic
...
docker-image / docker (push) Has been cancelled
goreleaser / goreleaser (push) Has been cancelled
- Added support for parsing and normalizing dynamic thinking model suffixes.
- Centralized budget resolution across executors and payload helpers.
- Retired legacy Gemini-specific thinking handlers in favor of unified logic.
- Updated executors to use metadata-based thinking configuration.
- Added `ResolveOriginalModel` utility for resolving normalized upstream models using request metadata.
- Updated executors (Gemini, Codex, iFlow, OpenAI, Qwen) to incorporate upstream model resolution and substitute model values in payloads and request URLs.
- Ensured fallbacks handle cases with missing or malformed metadata to derive models robustly.
- Refactored upstream model resolution to dynamically incorporate metadata for selecting and normalizing models.
- Improved handling of thinking configurations and model overrides in executors.
- Removed hardcoded thinking model entries and migrated logic to metadata-based resolution.
- Updated payload mutations to always include the resolved model.
v6.6.0
2025-12-11 03:10:50 +08:00
Luis Pater
e717939edb
Fixed : #478
...
docker-image / docker (push) Has been cancelled
goreleaser / goreleaser (push) Has been cancelled
feat(antigravity): add support for inline image data in client responses
v6.5.65
2025-12-10 23:55:53 +08:00
hkfires
a89514951f
fix(claude): prevent final events when no content streamed
2025-12-10 22:19:55 +08:00
Luis Pater
94d61c7b2b
fix(logging): update response aggregation logic to include all attempts
docker-image / docker (push) Has been cancelled
goreleaser / goreleaser (push) Has been cancelled
v6.5.64
2025-12-10 16:53:48 +08:00
Luis Pater
1249b07eb8
feat(responses): add unique identifiers for responses, function calls, and tool uses
docker-image / docker (push) Has been cancelled
goreleaser / goreleaser (push) Has been cancelled
v6.5.63
2025-12-10 16:02:54 +08:00
Luis Pater
6b37f33d31
feat(antigravity): add unique identifier for tool use blocks in response
docker-image / docker (push) Has been cancelled
goreleaser / goreleaser (push) Has been cancelled
v6.5.62
2025-12-10 15:27:57 +08:00
Luis Pater
f25f419e5a
fix(antigravity): remove references to autopush endpoint and update fallback logic
docker-image / docker (push) Has been cancelled
goreleaser / goreleaser (push) Has been cancelled
v6.5.61
2025-12-10 00:13:20 +08:00
Luis Pater
b7e382008f
Merge pull request #465 from router-for-me/think
...
docker-image / docker (push) Has been cancelled
goreleaser / goreleaser (push) Has been cancelled
Move thinking budget normalization from translators to executor
v6.5.60
2025-12-09 21:10:33 +08:00
hkfires
70d6b95097
feat(amp): add /news.rss proxy route
2025-12-09 21:05:06 +08:00
hkfires
9b202b6c1c
fix(executor): centralize default thinking config
2025-12-09 21:05:06 +08:00
hkfires
6a66b6801a
feat(executor): enforce minimum thinking budget for antigravity models
2025-12-09 21:05:06 +08:00
hkfires
5b6d201408
refactor(translator): remove thinking budget normalization across all translators
2025-12-09 21:05:06 +08:00
hkfires
5ec9b5e5a9
feat(executor): normalize thinking budget across all Gemini executors
2025-12-09 21:05:06 +08:00
Luis Pater
5db3b58717
Merge pull request #470 from router-for-me/agry
...
fix(gemini): normalize model listing output
2025-12-09 21:00:29 +08:00
hkfires
347769b3e3
fix(openai-compat): use model id for auth model display
2025-12-09 18:09:14 +08:00
hkfires
3cfe7008a2
fix(registry): update gpt 5.1 model names
2025-12-09 17:55:21 +08:00
hkfires
da23ddb061
fix(gemini): normalize model listing output
2025-12-09 17:34:15 +08:00
Luis Pater
39b6b3b289
Fixed : #463
...
docker-image / docker (push) Has been cancelled
goreleaser / goreleaser (push) Has been cancelled
fix(antigravity): remove `$ref` and `$defs` from JSON during key deletion
v6.5.59
2025-12-09 17:32:17 +08:00
Luis Pater
c600519fa4
refactor(logging): replace log.Fatalf with log.Errorf and add error handling paths
2025-12-09 17:16:30 +08:00
hkfires
e5312fb5a2
feat(antigravity): support canonical names for antigravity models
2025-12-09 16:54:13 +08:00
Luis Pater
92df0cada9
Merge pull request #461 from router-for-me/aistudio
...
feat(aistudio): normalize thinking budget in request translation
2025-12-09 08:41:46 +08:00
hkfires
96b55acff8
feat(aistudio): normalize thinking budget in request translation
2025-12-09 08:27:44 +08:00
Luis Pater
bb45fee1cf
Merge remote-tracking branch 'origin/dev' into dev
docker-image / docker (push) Has been cancelled
goreleaser / goreleaser (push) Has been cancelled
v6.5.57
2025-12-08 23:28:22 +08:00
Luis Pater
af00304b0c
fix(antigravity): remove exclusiveMaximum from JSON during key deletion
2025-12-08 23:28:01 +08:00
vuonglv(Andy)
5c3a013cd1
feat(config): add configurable host binding for server ( #454 )
...
* feat(config): add configurable host binding for server
2025-12-08 23:16:39 +08:00
Luis Pater
6ad188921c
refactor(logging): remove unused variable in ensureAttempt and redundant function call
docker-image / docker (push) Has been cancelled
goreleaser / goreleaser (push) Has been cancelled
v6.5.56
2025-12-08 22:25:58 +08:00
Luis Pater
15ed98d6a9
Merge pull request #458 from router-for-me/agry
...
feat(antigravity): enforce thinking budget limits for Claude models
2025-12-08 20:55:52 +08:00
hkfires
a283545b6b
feat(antigravity): enforce thinking budget limits for Claude models
2025-12-08 20:36:17 +08:00
Luis Pater
3efbd865a8
Merge pull request #457 from router-for-me/requestlog
...
style(logging): remove redundant separator line from response section
2025-12-08 18:21:24 +08:00
hkfires
aee659fb66
style(logging): remove redundant separator line from response section
2025-12-08 18:18:33 +08:00
Luis Pater
5aa386d8b9
Merge pull request #453 from router-for-me/amp
...
add ampcode management api
2025-12-08 17:42:13 +08:00
Luis Pater
0adc0ee6aa
Merge pull request #455 from router-for-me/requestlog
...
feat(logging): add upstream API request/response capture to streaming logs
2025-12-08 17:40:10 +08:00
hkfires
92f13fc316
feat(logging): add upstream API request/response capture to streaming logs
2025-12-08 17:21:58 +08:00
hkfires
05cfa16e5f
refactor(api): simplify request body parsing in ampcode handlers
2025-12-08 14:45:35 +08:00