feat: Implement shared provider utilities and Proxy Api Server #11495

DeJeune · 2025-11-27T07:30:50Z

CheckList

Provider	isSupported	Screenshot
OpenAICompatible	Yes
Anthropic	Yes
AnthropicCompatible	Yes
OpenAI	Yes
Gemini	Yes
Gemini 3 Pro	Yes
OpenRouter	Yes
OpenRouter-Gemini 3 Pro	Yes
Copilot	No	Cell
Azure OpenAI	No Plan

Summary

This PR implements a Proxy API Server that exposes Cherry Studio's configured AI providers as an Anthropic-compatible API endpoint. This allows external tools and applications to use any AI provider configured in Cherry Studio through a standardized Anthropic Messages API.

Key Features

Provider API host formatting utilities to handle differences between Cherry Studio and AI SDK
Support for all AI SDK providers including OpenAI, Anthropic, Google Gemini, Azure OpenAI, Vertex AI, Amazon Bedrock, OpenRouter, DeepSeek, xAI (Grok), and more
AI SDK configuration utilities for converting Cherry Studio providers to AI SDK configurations
Streaming and non-streaming message generation using the unified AI SDK pipeline
Tool calling support with proper Anthropic format conversion

What this PR does

Before this PR:

Cherry Studio's AI providers are only accessible through the app's internal chat interface
Users cannot use their configured providers with external tools (like Claude Code, Cursor, or custom scripts)
Each provider requires different API endpoints and authentication methods

After this PR:

Any AI provider configured in Cherry Studio can be accessed via a local Anthropic-compatible API
External tools can connect to http://localhost:<port>/v1/messages and use any provider
Model format: <provider-id>:<model-id> (e.g., my-openai:gpt-4o, gemini:gemini-2.0-flash)
Supports streaming responses, tool calling, and multi-turn conversations

Why we need it and why it was done in this way

User Scenario:
Developers and power users who want to use their existing Cherry Studio provider configurations with external AI tools (Claude Code, Cursor, Continue, custom scripts, etc.) without re-configuring API keys in multiple places.

Feature Value:

Single source of truth - Configure providers once in Cherry Studio, use everywhere
Cost management - Use cheaper providers through a standard interface
Privacy - Keep API keys in one secure location (Cherry Studio)
Flexibility - Switch providers without changing external tool configurations

Implementation Approach:

Uses Vercel AI SDK for unified provider handling
Converts between Anthropic Messages API format and AI SDK format
Leverages existing Cherry Studio provider configurations from Redux store
Shares provider resolution logic with the renderer process for consistency

Tradeoffs:

Added complexity in message format conversion
Some provider-specific features may not be fully supported through the unified API

Alternatives Considered:

OpenAI-compatible API: Chosen Anthropic format because it has better tool calling support and is more expressive
Direct provider passthrough: Would require implementing each provider's API separately

Breaking changes

None. This is a new feature that doesn't affect existing functionality.

Special notes for your reviewer

The Proxy API Server is opt-in and must be enabled in settings
All enabled providers in Cherry Studio are available through the API
Token counting is estimated (approximately 4 chars per token for English)

Checklist

PR: The PR description is expressive enough and will help future contributors
Code: Write code that humans can understand and Keep it simple
Refactor: You have left the code cleaner than you found it (Boy Scout Rule)
Upgrade: Impact of this change on upgrade flows was considered and addressed if required
Documentation: A user-guide update was considered and is present (link) or not required. You want a user-guide update if it's a user facing feature.

Release note

feat: Add Proxy API Server - expose Cherry Studio providers as Anthropic-compatible API for use with external tools

- Added provider API host formatting utilities to handle differences between Cherry Studio and AI SDK. - Introduced functions for formatting provider API hosts, including support for Azure OpenAI and Vertex AI. - Created a simple API key rotator for managing API key rotation. - Developed shared provider initialization and mapping utilities for resolving provider IDs. - Implemented AI SDK configuration utilities for converting Cherry Studio providers to AI SDK configurations. - Added support for various providers including OpenRouter, Google Vertex AI, and Amazon Bedrock. - Enhanced error handling and logging in the unified messages service for better debugging. - Introduced functions for streaming and generating unified messages using AI SDK.

…d messages

…age handling

0xfullex

Note

This review was translated by Claude.

PR Review Summary

Thank you for submitting this PR. After careful review, I found some issues that need to be fixed, including several critical bugs that would cause core functionality to not work.

❓ Need Additional Explanation

Please provide a detailed description of the problem this PR solves:

User Scenario: Who is the target audience for this Proxy API Server? What problems are they encountering?
Feature Value: Why do we need to expose any AI provider as an Anthropic-compatible API? What are the specific use cases?
Relationship with Existing Features: How does this feature differ from Cherry Studio's existing API Server? Is it a supplement or a replacement?
Breaking Changes: Will this PR affect existing users' experience?

The following parts of the PR template need to be filled:

"Before this PR" / "After this PR"
"Why we need it and why it was done in this way"
"Fixes #" (if there are related issues)

🔴 Critical Issues (Must Fix)

1. Tool result content is cleared

Location: `src/main/apiServer/services/unified-messages.ts:187-190`

```typescript
// After the values array is correctly populated...
return {
type: 'content',
value: [] // ❌ Should be value: values
}
```

The `values` array is built but never used, causing all tool_result content to be lost. This would completely break the tool calling chain.

2. Pure tool_result messages generate empty user messages

Location: `src/main/apiServer/services/unified-messages.ts:308-327`

When messages only contain `tool_result`, the code first pushes a `role: 'tool'` message, then unconditionally continues to push a `role: 'user'` message, where both `textParts` and `imageParts` are empty arrays:

```typescript
messages.push({
role: 'user',
content: [...textParts, ...imageParts] // Empty array []
})
```

AI SDK/models usually reject empty messages, causing the tool chain to break.

3. Provider filtering contradicts feature declaration

Location: `src/main/apiServer/utils/index.ts:32-34`

```typescript
const supportedProviders = providers.filter(
(p: Provider) => p.enabled && (p.type === 'openai' || p.type === 'anthropic')
)
```

The PR description claims to support "any AI provider supported by AI SDK", but the code only allows `openai` and `anthropic` types. Providers like gemini/vertexai/bedrock cannot be used, which contradicts the claimed functionality.

⚠️ Important Issues

4. Missing test coverage

About 1500 lines of new code in `packages/shared/` have no unit tests:

`AiSdkToAnthropicSSE.ts` (649 lines)
`sdk-config.ts` (240 lines)
`api/index.ts` (177 lines)

It's recommended to add at least tests for core conversion logic.

5. Duplicate code

The `/count_tokens` endpoint has two nearly identical implementations in router and providerRouter (`messages.ts:568-715`). Consider extracting shared logic.

6. Token estimation is too simple

```typescript
// ~4 characters per token for English text
const estimatedTokens = Math.ceil(totalChars / 4) + messages.length * 3
```
Estimation is inaccurate for non-English text (like Chinese), and doesn't handle tool token counting.

📝 Minor Suggestions

Type Safety: `AiSdkToAnthropicSSE.ts:180` uses `as any`, consider improving type definitions
Hardcoded Values: There are hardcoded `APP-Code` and API hosts in `aihubmix.ts`

Summary

Please:

Complete the PR description, explaining the feature's purpose and use cases
Fix Critical Issues, especially tool calling related bugs
Add tests, at least covering core conversion logic
Run `yarn build:check` to ensure everything passes

🤖 Review by Claude Code

- Fix tool result content bug: return `values` array instead of empty array - Fix empty message bug: skip pushing user/assistant messages when content is empty - Expand provider support: remove type restrictions to support all AI SDK providers - Add missing alias for @cherrystudio/ai-sdk-provider in main process config 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

Extract duplicated token estimation code from both count_tokens endpoints into a shared `estimateTokenCount` function to improve maintainability. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

…message handling

…ndling in AI SDK integration

…tegration

…nified-messages integration

…o Zod schema

… in unified messages

… provider

… cache service - Deleted the old ReasoningCache class and its instance. - Introduced CacheService for managing reasoning caches. - Updated unified-messages service to utilize new googleReasoningCache and openRouterReasoningCache. - Added AiSdkToAnthropicSSE adapter to handle streaming events and integrate with new cache service. - Reorganized shared adapters to include the new AiSdkToAnthropicSSE adapter. - Created openrouter adapter with detailed reasoning schemas for better type safety and validation.

…ate the tool call model filter.

Copilot

Pull request overview

This PR implements a comprehensive Proxy API Server that exposes Cherry Studio's configured AI providers through an Anthropic-compatible API endpoint. This enables external tools like Claude Code and Cursor to use any AI provider configured in Cherry Studio via a standardized interface.

Key Changes:

Extracted shared provider utilities to packages/shared for code reuse between renderer and main processes
Implemented AiSdkToAnthropicSSE adapter to convert AI SDK responses to Anthropic SSE format
Created unified message processing service supporting both streaming and non-streaming modes
Added support for PPIO provider's Anthropic-compatible models
Enhanced Claude Code integration to work with any provider via the unified adapter

Reviewed changes

Copilot reviewed 64 out of 65 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
`packages/shared/provider/*`	Shared provider utilities for detection, mapping, formatting, and SDK configuration
`packages/shared/middleware/*`	Shared AI SDK middlewares (Gemini thought signature, OpenRouter reasoning)
`packages/shared/api/*`	Shared API URL formatting and validation utilities
`packages/shared/anthropic/*`	Enhanced Anthropic SDK utilities with sanitization
`src/main/apiServer/adapters/*`	AI SDK to Anthropic SSE conversion adapter
`src/main/apiServer/services/unified-messages.ts`	Core unified message processing service
`src/main/apiServer/routes/messages.ts`	Enhanced message routing with unified processing
`src/main/services/agents/services/claudecode/*`	Claude Code integration improvements
`src/renderer/src/aiCore/provider/*`	Refactored to use shared utilities

No critical issues found. The implementation is well-structured with proper error handling and type safety. The code follows established patterns and includes comprehensive documentation.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-04T13:42:13Z

packages/shared/middleware/middlewares.ts

+          new TransformStream<LanguageModelV2StreamPart, LanguageModelV2StreamPart>({
+            transform(
+              chunk: LanguageModelV2StreamPart,
+              controller: TransformStreamDefaultController<LanguageModelV2StreamPart>
+            ) {
+              if (chunk.type === 'reasoning-delta' && chunk.delta.includes(REDACTED_BLOCK)) {
+                controller.enqueue({
+                  ...chunk,
+                  delta: chunk.delta.replace(REDACTED_BLOCK, '')
+                })
+              } else {
+                controller.enqueue(chunk)
+              }
+            }
+          })


Superfluous argument passed to function TransformStream.

EurFelux · 2025-12-07T13:12:51Z

[!NOTE]

This comment was translated by Claude.

If it doesn't make it for 1.7.2, we'll let #11738 into main first

Original Content

如果赶不上1.7.2，我们就先让 #11738 进main

DeJeune marked this pull request as draft November 27, 2025 07:31

DeJeune added 6 commits November 27, 2025 19:19

feat: Enhance thinking block management and tool conversion in unifie…

192357a

…d messages

chore: format

ccfb942

feat: Implement direct processing for Anthropic SDK and refactor mess…

f225fbe

…age handling

fix: test

36ed062

feat: add ppio

2a1adfe

feat: update agentModelFilter to exclude generate image models

4c4102d

DeJeune marked this pull request as ready for review November 27, 2025 13:13

DeJeune requested a review from 0xfullex as a code owner November 27, 2025 13:13

DeJeune added 2 commits November 27, 2025 21:19

fix: typecheck

f02c0fe

fix: typecheck

dad9cc9

DeJeune added this to the v1.7.1 milestone Nov 27, 2025

DeJeune added 3 commits November 27, 2025 21:28

fix: test

15c0a38

Merge remote-tracking branch 'origin/main' into feat/proxy-api-server

5d1d2b7

feat: add new aliases for ai-core provider and core

0f6ec3e

0xfullex requested changes Nov 27, 2025

View reviewed changes

DeJeune requested a review from 0xfullex November 27, 2025 14:31

DeJeune and others added 3 commits November 27, 2025 22:41

feat: add shared AI SDK middlewares and refactor middleware handling

ce25001

feat: enhance AI SDK integration with middleware support and improve …

356e828

…message handling

This comment was marked as resolved.

Sign in to view

DeJeune added 7 commits November 28, 2025 13:11

feat: implement reasoning cache for improved performance and error ha…

d367040

…ndling in AI SDK integration

feat: enhance provider configuration and error handling for AI SDK in…

9d34098

…tegration

Merge remote-tracking branch 'origin/main' into feat/proxy-api-server

313be44

feat: add additional model IDs for OpenAI Responses endpoint in Copilot

534d27f

feat: add reasoning cache support to AiSdkToAnthropicSSE and update u…

95c18d1

…nified-messages integration

feat: add CherryAI signed fetch wrapper and enhance tool conversion t…

ed769ac

…o Zod schema

feat: enhance reasoning cache integration and update provider options…

e8dccf5

… in unified messages

DeJeune added 7 commits November 30, 2025 16:47

Merge remote-tracking branch 'origin/main' into feat/proxy-api-server

350519a

fix: update 'anthropic-beta' header and add authorization for longcat…

4c1466c

… provider

refactor: import

874d692

feat: Add model exclusion logic for the Azure OpenAI provider and upd…

a231952

…ate the tool call model filter.

Merge remote-tracking branch 'origin/main' into feat/proxy-api-server

14cc38a

fix: params map

fb9a8e7

beyondkmp mentioned this pull request Dec 1, 2025

[Feature]: try supporting different types of agent by using langchain deep agent #11592

Open

4 tasks

EurFelux mentioned this pull request Dec 1, 2025

[Bug]: Cannot add other models for agent #11573

Open

DeJeune added 2 commits December 1, 2025 13:45

gitignore

8c9d79a

filter: copilot

4d77202

DeJeune mentioned this pull request Dec 1, 2025

[Discussion]: Why Custom API cannot be used in the new feat "Agent" #11617

Closed

3 tasks

EurFelux linked an issue Dec 2, 2025 that may be closed by this pull request

[Discussion]: Why Custom API cannot be used in the new feat "Agent" #11617

Closed

3 tasks

EurFelux removed a link to an issue Dec 2, 2025

[Discussion]: Why Custom API cannot be used in the new feat "Agent" #11617

Closed

3 tasks

DeJeune added 4 commits December 2, 2025 19:59

Merge branch 'main' into feat/proxy-api-server

2ba6c11

fix: openrouter

fd0be32

fix: test

08a537b

Merge remote-tracking branch 'origin/main' into feat/proxy-api-server

0e6830b

Copilot AI review requested due to automatic review settings December 4, 2025 13:39

Copilot started reviewing on behalf of DeJeune December 4, 2025 13:39 View session

Copilot finished reviewing on behalf of DeJeune December 4, 2025 13:40

Copilot AI reviewed Dec 4, 2025

View reviewed changes

DeJeune added 3 commits December 4, 2025 22:54

fix: type check

0fc9011

Merge remote-tracking branch 'origin/main' into feat/proxy-api-server

a471e78

fix: type check

39d1c71

Merge remote-tracking branch 'origin/main' into feat/proxy-api-server

e507929

kangfenmao modified the milestones: v1.7.2, v1.7.3 Dec 8, 2025

EurFelux mentioned this pull request Dec 10, 2025

Feature: SiliconFlow model API cannot be applied to Agent #11814

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Implement shared provider utilities and Proxy Api Server #11495

feat: Implement shared provider utilities and Proxy Api Server #11495

DeJeune commented Nov 27, 2025 •

edited

Loading

Uh oh!

0xfullex left a comment •

edited by kangfenmao

Loading

Uh oh!

This comment was marked as resolved.

Copilot AI left a comment

Uh oh!

Copilot AI Dec 4, 2025

Uh oh!

EurFelux commented Dec 7, 2025 •

edited by kangfenmao

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

feat: Implement shared provider utilities and Proxy Api Server #11495

Are you sure you want to change the base?

feat: Implement shared provider utilities and Proxy Api Server #11495

Conversation

DeJeune commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CheckList

Summary

Key Features

What this PR does

Why we need it and why it was done in this way

Breaking changes

Special notes for your reviewer

Checklist

Release note

Uh oh!

0xfullex left a comment • edited by kangfenmao Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

PR Review Summary

❓ Need Additional Explanation

🔴 Critical Issues (Must Fix)

1. Tool result content is cleared

2. Pure tool_result messages generate empty user messages

3. Provider filtering contradicts feature declaration

⚠️ Important Issues

4. Missing test coverage

5. Duplicate code

6. Token estimation is too simple

📝 Minor Suggestions

Summary

Uh oh!

This comment was marked as resolved.

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

EurFelux commented Dec 7, 2025 • edited by kangfenmao Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

DeJeune commented Nov 27, 2025 •

edited

Loading

0xfullex left a comment •

edited by kangfenmao

Loading

EurFelux commented Dec 7, 2025 •

edited by kangfenmao

Loading