Research Notes on Mainstream AI Subscription Plans

Last updated: 2026-03-14

As AI technology advances rapidly, AI subscription services are proliferating. This document surveys and compares the current mainstream AI subscription options to help individuals and teams find the right fit.

Provider categories:

Native AI providers: Companies that build their own models (OpenAI, Google, Anthropic, etc.)

Third-party AI providers: Platforms that aggregate multiple model sources (OpenRouter, Together AI, Replicate, etc.)

Provider Category Overview
Native AI Providers
Third-Party AI Providers
Comparison Summary

Provider Category Overview

Native AI Providers

Definition: Companies that develop their own AI models and offer direct API access.

Characteristics:

✅ Strongest model capabilities (cutting-edge technology)
✅ Mature ecosystem, rich documentation
✅ Official support, high stability
❌ Single vendor, risk of vendor lock-in
❌ Relatively higher prices (though OpenAI allows region-switching tricks for lower rates)
❌ Complex integration when using multiple vendors

Best for:

Projects requiring peak model performance
Enterprise applications with high stability requirements
Global products needing multilingual support
Teams that don’t want to rely on third-party proxies

Third-Party AI Providers (Aggregators)

Definition: Platforms that aggregate multiple AI model sources behind a single unified API.

Characteristics:

✅ Unified interface, lower integration complexity
✅ Rich model selection, flexible switching
✅ Smart routing and automatic failover
✅ Cost optimization, transparent pricing
❌ Extra middleware layer, may introduce additional latency
❌ Dependent on third-party platform stability
❌ Feature set may not be as complete as native providers

Best for:

Projects that need to connect to multiple models simultaneously
Teams wanting to reduce vendor lock-in risk
Cost-sensitive scenarios requiring flexible model switching
Rapid prototyping and testing

Native AI Providers

OpenAI

Official Websites

Overview

OpenAI is the pioneer and leader in large language models, offering the GPT series (GPT-4, GPT-3.5, etc.) and image generation models (DALL-E). It is currently the most mature AI API provider in the industry.

Core Models

Language Models

GPT-4 Turbo: Latest GPT-4, faster and cheaper, supports 128K context
GPT-4: Top-tier language model, supports 8K/32K/128K context
GPT-3.5 Turbo: Great value, fast responses, supports 16K context
GPT-4o: Multimodal model supporting text, images, and audio

Image Models

DALL-E 3: High-quality image generation
DALL-E 2: Previous-generation image generation

Other Models

Whisper: Speech recognition (multilingual)
Embeddings: Text embedding vectors
Text-to-Speech: Voice synthesis
Moderation: Content moderation

Subscription Plans

Free Tier

Price: $0/month
Credits: $5 free credit (new users)
Limits: Lower rate limits

API Pay-as-you-go

GPT-4 Turbo: $0.01 / 1K input tokens, $0.03 / 1K output tokens
GPT-4: $0.03 / 1K input tokens, $0.06 / 1K output tokens
GPT-3.5 Turbo: $0.0015 / 1K input tokens, $0.002 / 1K output tokens
DALL-E 3: $0.04 / image
Whisper: $0.006 / minute

ChatGPT Plus (Personal)

Price: $20/month
Includes:
- GPT-4 access
- DALL-E 3 image generation
- Advanced data analysis
- Browsing capability
- Priority access to new features

Team

Price: $25/user/month
Includes:
- Everything in ChatGPT Plus
- Admin console
- Team collaboration workspace
- Data isolation
- Higher rate limits

Enterprise

Price: Contact sales
Includes:
- Unlimited speed
- Priority support
- API access
- Data encryption
- Custom model fine-tuning
- Compliance certifications (SOC2, HIPAA)

Core Strengths

1. Model Capabilities

Industry-leading language models
Excellent multilingual support
Powerful code generation
Outstanding reasoning and comprehension

2. Ecosystem

API: REST API, Python/JS SDK
LangChain: Native support
Vercel AI SDK: Native support
VS Code plugins: Copilot and more
Rich documentation: Detailed API docs and examples

3. Advanced Features

Function Calling: Call external functions
Streaming: Stream responses
JSON Mode: Guaranteed JSON output
Vision: Image understanding
Fine-tuning: Custom model tuning
Assistants API: Build AI assistants

4. Enterprise Features

Azure OpenAI: Enterprise-grade deployment
Data privacy: Data not used for training (Enterprise tier)
Compliance: SOC2, HIPAA, GDPR
SLA: Enterprise service level agreements
Technical support: Dedicated support team

Best For

Applications that demand peak model performance
Enterprise apps with high stability requirements
Global products needing multilingual support
Teams wanting a complete ecosystem and toolchain
Cost-insensitive scenarios

Pros & Cons

Pros:

✅ Strongest models, industry benchmark
✅ Mature ecosystem, rich tooling
✅ Best documentation and community support
✅ Comprehensive enterprise features
✅ Continuous updates and improvements

Cons:

❌ Relatively higher prices
❌ Single vendor, lock-in risk
❌ Some features require Enterprise tier
❌ Data compliance concerns for users outside the US

Docs & Resources

Official docs: https://platform.openai.com/docs
API Reference: https://platform.openai.com/docs/api-reference
GitHub: https://github.com/openai
Community: OpenAI Developer Forum

Google Gemini

Official Websites

Overview

Google Gemini (formerly Bard) is Google’s multimodal large language model offering strong text, image, and audio understanding, with deep integration across the Google ecosystem.

Core Models

Gemini Series

Gemini Ultra: Most powerful model, multimodal
Gemini Pro: Mainstream model, balanced performance and cost
Gemini Pro Vision: Vision model
Gemini Flash: High-speed response model

Other Models

PaLM 2: Previous-generation language model
Imagen: Image generation
Codey: Code model

Subscription Plans

Free Tier

Price: $0/month
Includes:
- Gemini Pro access
- Daily usage limit
- Web interface access

AI Studio (API Pay-as-you-go)

Gemini Pro: $0.0005 / 1K input tokens, $0.0015 / 1K output tokens
Gemini Pro Vision: $0.0025 / 1K input tokens, $0.0075 / 1K output tokens
Imagen: $0.002 / image

Google One AI Premium (Personal)

Price: $19.99/month
Includes:
- Gemini Ultra access
- 2TB Google Cloud storage
- Google Workspace premium features

Enterprise

Price: Contact sales
Includes:
- Vertex AI platform access
- Custom model fine-tuning
- Data privacy protection
- Compliance certifications
- Technical support

Core Strengths

1. Multimodal Capabilities

Native multimodal (text, images, audio, video)
Cross-modal understanding and generation
Real-time video analysis

2. Google Ecosystem Integration

Google Workspace: Docs, Gmail, Sheets integration
Google Search: Real-time search capability
Google Maps: Geospatial information
YouTube: Video content understanding
Android: Mobile integration

3. Developer Experience

Vertex AI: Enterprise-grade AI platform
AI Studio: Free development environment
Google Cloud: Cloud-native deployment
Kaggle: Data science community

4. Performance Advantages

MLOps: Model deployment and monitoring
A/B Testing: Model comparison
AutoML: Automated machine learning
TPU optimization: Hardware acceleration

Best For

Projects needing Google ecosystem integration
Multimodal application development
Enterprise AI platforms
Teams that need MLOps capabilities
Existing Google Cloud users

Pros & Cons

Pros:

✅ Strong multimodal capabilities
✅ Deep Google ecosystem integration
✅ Relatively lower prices
✅ Mature enterprise AI platform
✅ Rich developer tools

Cons:

❌ Language model capabilities slightly behind GPT-4
❌ Documentation and community not as strong as OpenAI
❌ Some features still in Beta
❌ Access restricted in some regions

Docs & Resources

Official docs: https://ai.google.dev/docs
Vertex AI: https://cloud.google.com/vertex-ai
AI Studio: https://aistudio.google.com/

Anthropic Claude

Official Websites

Overview

Anthropic was founded by former OpenAI employees and focuses on AI safety and alignment. The Claude series is known for its safety, long context window, and natural conversational quality.

Core Models

Claude 3 Series

Claude 3 Opus: Most powerful, highest intelligence
Claude 3 Sonnet: Balanced model, good performance-to-cost ratio
Claude 3 Haiku: Fast model, low cost

Claude 2 Series

Claude 2.1: Long context (200K tokens)
Claude 2: Previous-generation model

Subscription Plans

Free Tier

Price: $0/month
Credits: Limited usage quota

API Pay-as-you-go

Claude 3 Opus: $15 / 1M input tokens, $75 / 1M output tokens
Claude 3 Sonnet: $3 / 1M input tokens, $15 / 1M output tokens
Claude 3 Haiku: $0.25 / 1M input tokens, $1.25 / 1M output tokens
Claude 2.1: $8 / 1M input tokens, $24 / 1M output tokens

Claude Pro (Personal)

Price: $20/month
Includes:
- Claude 3 Opus access
- Higher usage limits
- Priority access to new features

Team

Price: $30/user/month
Includes:
- Everything in Claude Pro
- Team management features
- Higher usage limits

Enterprise

Price: Contact sales
Includes:
- Custom model fine-tuning
- Data privacy protection
- Compliance certifications
- Dedicated support

Core Strengths

1. Safety and Alignment

Leading AI safety research
Constitutional AI methodology
Refusal of harmful content
Strong interpretability

2. Long Context

Claude 2.1 supports 200K tokens
Long document comprehension and summarization
Large codebase analysis

3. Natural Conversation

Highly fluent dialogue
Natural tone and voice
Ideal for chatbots
Creative writing

4. Coding Capabilities

Excellent performance on programming tasks
Code generation and debugging
Technical documentation understanding
Code refactoring suggestions

5. Developer Experience

API: REST API, Python/JS SDK
LangChain: Native support
Function Calling: Call external functions
Streaming: Stream responses
Tool Use: External tool integration

Best For

Scenarios with high safety requirements
Applications requiring long context
Coding assistant tools
Chatbots
Content moderation and compliance

Pros & Cons

Pros:

✅ Best safety and alignment properties
✅ Long context support (200K)
✅ Natural and fluent conversation
✅ Strong coding capabilities
✅ Reasonably priced

Cons:

❌ Model capabilities slightly behind GPT-4
❌ Ecosystem not as mature as OpenAI
❌ Smaller documentation and community
❌ Fewer tools and plugins

Docs & Resources

Official docs: https://docs.anthropic.com/
API Reference: https://docs.anthropic.com/claude/reference
GitHub: https://github.com/anthropics
Research papers: https://www.anthropic.com/research

Zhipu AI

Official Websites

Overview

Zhipu AI is a leading Chinese large model company that developed the GLM series, offering Chinese-optimized language models and multimodal capabilities.

Core Models

Language Models

GLM-4: New-generation LLM, capabilities benchmarked against GPT-4
GLM-3-Turbo: Fast response model
GLM-3-6B: Lightweight model

Multimodal Models

CogView: Image generation
CogVideo: Video generation
CogView3: Third-generation image model

Specialized Models

CodeGeeX: Code model
CharacterGLM: Role-playing
MedicalGLM: Healthcare

Subscription Plans

Free Tier

Price: ¥0/month
Includes:
- Basic model access
- Daily usage limit
- Online chat

API Pay-as-you-go

GLM-4: ¥0.1 / 1K input tokens, ¥0.1 / 1K output tokens
GLM-3-Turbo: ¥0.005 / 1K input tokens, ¥0.005 / 1K output tokens
CogView: ¥0.05 / image

Personal Plan

Price: ¥49/month
Includes:
- GLM-4 premium access
- Higher usage limits
- Priority responses

Enterprise

Price: Contact sales
Includes:
- Private deployment
- Model fine-tuning
- Data privacy protection
- Dedicated technical support
- Compliance certifications

Core Strengths

1. Chinese Language Optimization

Strong Chinese comprehension and generation
Deep understanding of Chinese culture
Excellent Chinese instruction-following

2. Domestic Compliance Support

Compliant with Chinese data regulations
Data stays onshore
Compatible with domestic hardware

3. Multimodal Capabilities

Text, images, and video
Cross-modal understanding
Wide application scenarios

4. Cost Advantage

Relatively lower prices
Optimized for the domestic market
Suitable for large-scale applications

5. Developer Experience

API: REST API, Python/Java SDK
LangChain: Native support
Web interface: Online debugging
Detailed docs: Chinese-language documentation

Best For

Domestic application development
Chinese-primary applications
Scenarios with strict data compliance requirements
Cost-sensitive projects
Teams requiring domestic-only deployment

Pros & Cons

Pros:

✅ Strong Chinese language capabilities
✅ Compliant with Chinese regulations
✅ Relatively lower cost
✅ Domestic deployment support
✅ Solid multimodal capabilities

Cons:

❌ Model capabilities slightly behind GPT-4
❌ Weaker English capabilities
❌ Smaller ecosystem and community
❌ Fewer tools and plugins

Docs & Resources

Official docs: https://open.bigmodel.cn/dev/api
GitHub: https://github.com/THUDM
Open-source projects: GLM-4, CodeGeeX, etc.

Baidu ERNIE Bot

Official Websites

Overview

Baidu’s ERNIE Bot (Wenxin Yiyan) is a large language model based on Baidu’s ERNIE series, with deep integration across the Baidu ecosystem.

Core Models

ERNIE Series

ERNIE 4.0: Latest version, multimodal
ERNIE 3.5: Mainstream version
ERNIE 4.0 Turbo: Fast version

Specialized Models

ERNIE Bot: Dialogue model
ERNIE Speed: High-speed response
ERNIE Lite: Lightweight

Subscription Plans

Free Tier

Price: ¥0/month
Includes:
- Basic dialogue
- Daily usage limit

API Pay-as-you-go

ERNIE 4.0: ¥0.12 / 1K tokens
ERNIE 3.5: ¥0.008 / 1K tokens
ERNIE Speed: ¥0.004 / 1K tokens

VIP Membership

Price: ¥49/month
Includes:
- ERNIE 4.0 access
- Higher usage limits
- Exclusive features

Enterprise

Price: Contact sales
Includes:
- Baidu Cloud integration
- Private deployment
- Model fine-tuning
- Dedicated support

Core Strengths

1. Baidu Ecosystem Integration

Baidu Search: Real-time search
Baidu Maps: Geospatial information
Baidu Baike: Knowledge base
Baidu Netdisk: Cloud storage

2. Chinese Language Optimization

Strong Chinese capabilities
Chinese cultural understanding
Rich Chinese knowledge base

3. Enterprise Features

Baidu Cloud: Cloud-native deployment
Intelligent Cloud: AI development platform
Compliance: Meets domestic requirements
Technical support: 24/7 support

4. Developer Tools

Qianfan Platform: AI development platform
ModelArts: Model training
AppBuilder: Application building

Best For

Baidu ecosystem integration
Domestic enterprise applications
Teams needing Baidu Cloud services
High compliance requirements
SMBs looking for rapid deployment

Pros & Cons

Pros:

✅ Deep Baidu ecosystem integration
✅ Strong Chinese capabilities
✅ Comprehensive enterprise features
✅ Good Baidu Cloud support
✅ Relatively lower prices

Cons:

❌ Average model capabilities
❌ Weaker English capabilities
❌ Relatively closed ecosystem
❌ Fewer developer tools

Docs & Resources

Official docs: https://cloud.baidu.com/doc/WENXINWORKSHOP/
Qianfan Platform: https://console.bce.baidu.com/qianfan/

Alibaba Cloud Qwen

Official Websites

Overview

Alibaba Cloud’s Qwen (Tongyi Qianwen) is a series of large language models ranging from lightweight to ultra-large scale, offering diverse model choices.

Core Models

Qwen Series

Qwen-Max: Most powerful model
Qwen-Plus: Mainstream model
Qwen-Turbo: Fast response
Qwen-Long: Long context

Open-Source Models

Qwen-72B: Open-source large-scale
Qwen-14B: Open-source mid-scale
Qwen-7B: Open-source small-scale

Specialized Models

Qwen-VL: Vision-language model
Qwen-Audio: Audio model
CodeQwen: Code model

Subscription Plans

Free Tier

Price: ¥0/month
Includes:
- Basic model access
- Daily usage limit

API Pay-as-you-go

Qwen-Max: ¥0.04 / 1K tokens
Qwen-Plus: ¥0.008 / 1K tokens
Qwen-Turbo: ¥0.003 / 1K tokens

Enterprise

Price: Contact sales
Includes:
- Deep Alibaba Cloud integration
- Private deployment
- Model fine-tuning
- SLA guarantees

Core Strengths

1. Alibaba Cloud Ecosystem

Alibaba Cloud ECS: Cloud servers
OSS: Object storage
RDS: Database
Function Compute: Serverless

2. Rich Model Selection

Multiple scales of models
Downloadable open-source models
Long context support

3. Chinese Language Optimization

Strong Chinese capabilities
Integration with Alibaba products
Optimized for e-commerce scenarios

4. Developer Tools

DashScope: AI development platform
ModelScope: Model community
PAI: Machine learning platform

Best For

Existing Alibaba Cloud users
E-commerce applications
Enterprise-grade applications
Teams that prefer open-source models
Cost-sensitive projects

Pros & Cons

Pros:

✅ Deep Alibaba Cloud integration
✅ Rich model selection
✅ Downloadable open-source models
✅ Strong Chinese capabilities
✅ Lower prices

Cons:

❌ Average model capabilities
❌ Weaker English capabilities
❌ Relatively closed ecosystem
❌ Toolchain not fully mature

Docs & Resources

Official docs: https://help.aliyun.com/zh/dashscope/
ModelScope: https://modelscope.cn/

ByteDance Doubao

Official Websites

Overview

ByteDance’s Doubao is an AI assistant and multimodal model platform offering dialogue, image generation, voice, and other capabilities.

Core Models

Doubao Series

Doubao-Pro: Professional version
Doubao-Lite: Lightweight version
Doubao-Character: Role-playing

Vision Models

Skylark: Image generation
Skylark-2: Second-generation images

Other Models

Voice models: Speech synthesis
Video models: Video generation

Subscription Plans

Free Tier

Price: ¥0/month
Includes:
- Basic dialogue
- Daily usage limit

API Pay-as-you-go

Doubao-Pro: ¥0.008 / 1K tokens
Doubao-Lite: ¥0.001 / 1K tokens
Skylark: ¥0.05 / image

Enterprise

Price: Contact sales
Includes:
- Volcano Engine integration
- Private deployment
- Dedicated support

Core Strengths

1. ByteDance Ecosystem

Douyin (TikTok): Short video integration
Toutiao: News and content
Feishu (Lark): Workplace collaboration
Volcano Engine: Cloud services

2. Multimodal

Text, images, audio, and video
Cross-modal understanding
Creative content generation

3. Cost Advantage

Lower prices
Suitable for large-scale applications
Generous free quota

4. Developer Tools

Volcano Engine: Development platform
Open Platform: API services
Detailed docs: Chinese-language documentation

Best For

ByteDance ecosystem integration
Multimodal applications
Content creation
Cost-sensitive projects
Consumer-facing (C-end) applications

Pros & Cons

Pros:

✅ ByteDance ecosystem integration
✅ Strong multimodal capabilities
✅ Lower prices
✅ Generous free quota
✅ Suitable for consumer apps

Cons:

❌ Average model capabilities
❌ Fewer enterprise features
❌ Toolchain not fully mature
❌ Relatively closed ecosystem

Docs & Resources

Volcano Engine: https://platform.volcengine.com/
Open Platform: https://open.volcengine.com/

Moonshot Kimi

Official Website

https://www.moonshot.cn/

Overview

Moonshot AI’s Kimi is known for its ultra-long context window — supporting up to 2 million tokens — making it ideal for long document analysis and summarization.

Core Models

Kimi Series

moonshot-v1-128k: 128K context
moonshot-v1-32k: 32K context
moonshot-v1-8k: 8K context

Subscription Plans

Free Tier

Price: ¥0/month
Includes:
- Basic dialogue
- 20 files/day

Pro

Price: ¥68/month
Includes:
- 128K context
- Unlimited file uploads
- Higher usage limits

Enterprise

Price: Contact sales
Includes:
- API access
- Private deployment
- Dedicated support

Core Strengths

1. Ultra-Long Context

2 million token context window
Long document analysis
Large codebase comprehension

2. File Processing

Supports multiple formats
PDF, Word, Excel, and more
Document summarization and analysis

3. Chinese Language Optimization

Strong Chinese capabilities
Chinese document processing
Chinese cultural understanding

4. User Experience

Clean interface
Easy to use
Suited for individual users

Best For

Long document analysis
Codebase comprehension
Research and academic work
Personal knowledge management
Document summarization

Pros & Cons

Pros:

✅ Ultra-long context (2 million tokens)
✅ Strong file processing
✅ Good Chinese language optimization
✅ Pleasant user experience
✅ Great for personal use

Cons:

❌ Average model capabilities
❌ Relatively narrow feature set
❌ Fewer enterprise features
❌ Toolchain not fully mature

Docs & Resources

Official website: https://www.moonshot.cn/
Usage docs: Available on the official website

GitHub Copilot (Microsoft)

Official Websites

Overview

GitHub Copilot is Microsoft’s AI coding assistant that aggregates multiple leading large language models — OpenAI, Anthropic Claude, Google Gemini, and others — with deep integration into VS Code, Visual Studio, JetBrains, and other IDEs. It provides code completion, chat, Agent mode, code review, and comprehensive coding assistance. For developers, Copilot Pro is one of the best-value AI coding subscriptions available today.

One important billing quirk: GitHub Copilot resets usage quotas on the 1st of each month — code completions and Premium request allowances both reset on the 1st. This means subscribing around the 15th lets you get roughly two months of quota for a single month’s payment (if you don’t subscribe continuously, you effectively get 15 full days each month). In practice, I stopped my subscription on the 7th, then re-subscribed on the 15th (same account — interestingly I was only charged $7 that time, and I haven’t fully figured out the billing logic). My Premium request quota for that month remained at whatever was left over from before the 7th. So I’d suggest alternating between two accounts — essentially getting $20 worth of value out of $10, which is now roughly on par with the Claude Pro $20 tier.

Core Models

Copilot supports multiple AI models and you can switch freely within the IDE:

Claude Sonnet 4.6 / Claude 3.7 Sonnet: Anthropic’s strong coding models
Claude Opus 4.5: Anthropic’s most powerful reasoning model
GPT-4.1 / GPT-5 mini: OpenAI’s latest models
Gemini 2.5 Pro / Gemini 3.1 Pro: Google’s high-performance models
o3-mini / o1-mini: Reasoning-enhanced models

Subscription Plans

Free

Price: $0/month
Includes:
- 2,000 code completions/month
- 50 Premium requests/month
- Basic chat functionality
- VS Code and other IDE support

Pro (Personal Professional)

Price: $10/month or $100/year
Includes:
- Unlimited code completions
- 300 Premium requests/month
- Multi-model switching (Claude, Gemini, GPT, etc.)
- Agent mode (autonomous multi-step coding tasks)
- Code Review
- Copilot CLI (command-line assistant)
- Copilot Chat (conversational coding assistance)

Business (Team)

Price: $19/user/month
Includes:
- Everything in Pro
- Organization management dashboard
- Knowledge base integration (index org codebase)
- Custom model selection policies
- IP indemnification
- SAML SSO

Enterprise

Price: $39/user/month
Includes:
- Everything in Business
- Requires GitHub Enterprise Cloud ($21/user/month)
- Custom model fine-tuning
- Pull Request summaries
- Security vulnerability fix suggestions
- Advanced security compliance features

Core Strengths

1. Deep IDE Integration

VS Code: Best experience, native integration
Visual Studio: Seamless within the Microsoft ecosystem
JetBrains suite: IntelliJ, PyCharm, and more
Neovim / Vim: Terminal-friendly
Xcode: Apple ecosystem support

2. Free Multi-Model Switching

Supports OpenAI, Anthropic, Google, and more
One-click model switching within the IDE
Choose the best model for each task
Premium requests consumed based on model complexity

3. Agent Mode

Autonomously understands requirements and executes multi-step coding tasks
Auto-reads files, edits code, runs terminal commands
End-to-end automated coding experience

4. Coding-Scenario Optimization

Code completion: Real-time context-aware completion
Code generation: Generate code from natural language
Code explanation: Understand complex code logic
Bug fixing: Intelligently locate and fix issues
Code refactoring: Optimize code structure and quality
Unit testing: Auto-generate test cases

Best For

Everyday coding development (top recommendation)
Code review and refactoring
Learning new languages and frameworks
Rapid prototyping
Team collaborative coding

Pros & Cons

Pros:

✅ Extremely competitive price ($10/month, outstanding value)
✅ Multi-model support with free switching
✅ Best-in-class IDE integration experience
✅ Agent mode for strong automation
✅ Deep Microsoft ecosystem integration
✅ Free for students

Cons:

❌ Primarily focused on coding; general conversation is limited
❌ Monthly cap on Premium requests
❌ Tied to the GitHub ecosystem
❌ Enterprise tier is pricey

Personal Take

For developers, GitHub Copilot Pro is one of the most worthwhile AI subscriptions out there. $10/month gets you unlimited code completions plus 300 multi-model Premium requests — just the right amount, not too little and nothing wasted. Compared to subscribing separately to ChatGPT Plus ($20) or Claude Pro ($20), Copilot Pro delivers clearly better value, and since you use it directly inside the IDE, your workflow stays seamless.

Docs & Resources

Official docs: https://docs.github.com/en/copilot
Model list: https://docs.github.com/copilot/reference/ai-models/supported-models
GitHub Blog: https://github.blog

Third-Party AI Providers

OpenRouter

Official Website

https://openrouter.ai/

Overview

OpenRouter is a unified AI model API gateway that provides a single interface for accessing hundreds of AI models. Its core idea is to let developers connect to multiple AI providers through one API, simplifying integration work.

Core Strengths

1. Model Ecosystem (300+ Models)

Large language models: GPT-4, Claude, DeepSeek, GLM, Llama, Mistral, and more
Image models: DALL-E, Stable Diffusion, Midjourney, and more
Multimodal: Vision models, audio, video

2. Smart Routing System

Model Fallbacks: Automatic failover
Provider Routing: Intelligent routing selection
Auto Router: Automatically select the best model (powered by NotDiamond)
Optimization by cost, performance, or reliability

3. Advanced Features

Multimodal Support

Image Inputs: Send images to vision models
Image Generation: Generate images
PDF Inputs: Process PDF documents
Audio: Voice input/output
Video Inputs: Video processing

Enhancement Features

Zero Data Retention (ZDR): No data retained
Structured Outputs: JSON Schema validation
Web Search: Real-time web search
Prompt Caching: Cache prompts to reduce costs
Response Healing: Auto-fix malformed responses
Zero Completion Insurance: No charge for failed responses

4. Model Variants

:free - Free model variant
:extended - Extended context window
:exacto - Prioritizes tool call quality
:thinking - Extended reasoning
:online - Real-time web search
:nitro - High-speed inference

5. Developer Experience

SDKs & Frameworks

Official SDKs: TypeScript, Python
Compatible with: OpenAI SDK, Anthropic Agent SDK
Frameworks: LangChain, Vercel AI SDK, PydanticAI, TanStack AI
Tools: Zapier, Infisical, LiveKit

Integration Tools

BYOK (Bring Your Own Key): Use your own API keys
Guardrails: Data policies and model access restrictions
Broadcast: Integrates with Langfuse, Datadog, Braintrust, and more

6. Management Features

Organization Management: Team collaboration and API key management
App Attribution: Application attribution and ranking
Activity Export: Usage data export
Crypto API: Cryptocurrency payment support

Pricing

Billing: Per token
Transparent pricing: Clear pricing per model
Cost optimization: Smart routing reduces costs
Free models: Some models available in :free variant

Best For

Projects connecting to multiple AI models simultaneously
Enterprise apps requiring high availability and failover
Developers looking to reduce migration costs
Scenarios requiring flexible model switching
A/B testing across different models

Pros & Cons

Pros:

✅ Unified interface, lower integration complexity
✅ Hundreds of models, rich selection
✅ Smart routing and failover
✅ Advanced features (caching, structured outputs, etc.)
✅ Compatible with major SDKs, low learning curve
✅ Active community and ecosystem

Cons:

❌ Extra middleware layer, possible additional latency
❌ Dependent on OpenRouter’s service stability
❌ Some advanced features may cost extra

Docs & Resources

Official docs: https://openrouter.ai/docs
GitHub: https://github.com/openrouter
Community projects: Awesome OpenRouter

Together AI

Official Website

https://www.together.ai/

Overview

Together AI is an AI infrastructure provider offering hosted inference for open-source models, along with custom model training and deployment services.

Core Strengths

1. Open-Source Model Hosting

Llama series: Llama 3, Llama 2, and more
Mistral series: Mistral, Mixtral, and more
Other open-source models: Falcon, Vicuna, and more
Regular updates with the latest open-source models

2. High-Performance Inference

GPU optimization: Optimized for specific GPUs
Flash Attention: Accelerated inference
Low latency: Optimized inference engine
High throughput: Supports large-scale concurrency

3. Custom Models

Model fine-tuning: Fine-tuning service
Custom training: Train on your own data
Model evaluation: Model performance benchmarking tools
Model deployment: One-click deployment

4. Developer Tools

Python SDK: Full Python client
OpenAI-compatible: Works with the OpenAI SDK
Monitoring and analytics: Usage tracking
Cost management: Detailed cost analysis

5. Enterprise Features

Private deployment: Support for private cloud
Data privacy: GDPR compliant
SLA guarantees: Enterprise service levels
Technical support: Professional team support

Pricing

Billing: Per token
Transparent pricing: Open-source model prices generally lower than proprietary
Volume discounts: Discounts for high usage
Reserved instances: Reserve capacity for long-term use

Best For

Developers who prefer open-source models
Teams needing custom model training
Cost-sensitive large-scale applications
Enterprises requiring private deployment

Pros & Cons

Pros:

✅ Rich open-source model ecosystem
✅ Well-optimized performance, fast
✅ Supports custom model training
✅ Strong openness and control
✅ Relatively lower costs

Cons:

❌ Does not include proprietary models like GPT or Claude
❌ Model capabilities may not match proprietary models
❌ Limited multimodal support

Docs & Resources

Website: https://www.together.ai/
Docs: https://docs.together.ai/
GitHub: https://github.com/togethercomputer

Replicate

Official Website

https://replicate.com/

Overview

Replicate is an AI model hosting platform that makes it easy for developers to run open-source AI models — including large language models, image generation, audio processing, and more.

Core Strengths

1. Rich Model Library

Language models: Llama, Mistral, Falcon, and more
Image generation: Stable Diffusion series
Image processing: Super-resolution, inpainting, style transfer, and more
Audio processing: Speech synthesis, recognition, and more
Video generation: Video synthesis and editing
Other models: OCR, NLP, and more

2. Ease of Use

Simple API: Clean REST API
Python SDK: Python client
Web Playground: Test models online
Rich examples: Extensive usage examples

3. Custom Models

Upload models: Upload your own models
Docker support: Docker-based model deployment
Cog API: Performance-optimized Cog API
Version control: Model versioning

4. Community Ecosystem

Model sharing: Community model library
Fork models: Build on others’ models
Open-source friendly: Large open-source model collection

5. Developer Experience

Live preview: Preview model output online
Debugging tools: Convenient debugging and optimization
Monitoring dashboard: Usage and cost monitoring
Webhooks: Async task callbacks

Pricing

Billing: By compute time
Transparent pricing: Clear hourly cost
Free credits: Free credits for new users
Pay-as-you-go: Flexible billing

Best For

Rapid prototyping
Testing different models
Small-scale applications
Teams needing diverse model types
Open-source model enthusiasts

Pros & Cons

Pros:

✅ Very rich model library
✅ Easy to use, quick to get started
✅ Supports custom models
✅ Active community
✅ Relatively affordable

Cons:

❌ Does not include proprietary models (GPT, Claude)
❌ Performance may not match dedicated services
❌ Limited enterprise features
❌ Multimodal integration requires manual handling

Docs & Resources

Website: https://replicate.com/
Docs: https://replicate.com/docs
GitHub: https://github.com/replicate

Fireworks.ai

Official Website

https://fireworks.ai/

Overview

Fireworks.ai is a high-performance AI inference platform focused on delivering fast, low-cost AI model inference.

Core Strengths

1. High-Performance Inference

Ultra-fast inference: Industry-leading inference speed
Low latency: Optimized inference engine
High throughput: Supports large-scale concurrency
GPU optimization: Deep hardware-level optimization

2. Model Ecosystem

Open-source models: Llama, Mistral, and more
Optimized models: Fireworks-optimized model variants
Custom models: Support for custom model deployment
Multimodal: Text, images, and more

3. Cost Advantage

Transparent pricing: Clear billing
Pay-as-you-go: Flexible billing model
Volume discounts: Discounts for high usage
Reserved instances: Lower costs for long-term use

4. Developer Experience

OpenAI-compatible: Works with the OpenAI SDK
Python SDK: Full Python client
REST API: Standard REST interface
Monitoring tools: Usage tracking

5. Enterprise Features

Private deployment: Private cloud support
Data security: Enterprise-grade security
SLA guarantees: Service level agreements
Technical support: Professional support team

Technical Highlights

Flash Attention: Accelerated attention computation
KV Cache: Optimized caching mechanism
Quantization: Model quantization to reduce costs
Distributed inference: Distributed deployment support

Pricing

Billing: Per token
Cost advantage: Competitive pricing compared to other providers
Flexible billing: Multiple billing modes supported

Best For

Performance-demanding applications
Cost-sensitive large-scale applications
Scenarios requiring low latency
Projects preferring open-source models

Pros & Cons

Pros:

✅ Extremely fast inference speed
✅ Clear cost advantage
✅ OpenAI-compatible, low migration cost
✅ Well-optimized performance
✅ Enterprise-grade features

Cons:

❌ Relatively fewer models
❌ Does not include proprietary models
❌ Limited multimodal support
❌ Smaller community ecosystem

Docs & Resources

Website: https://fireworks.ai/
Docs: https://fireworks.ai/docs

Hugging Face Inference

Official Website

https://huggingface.co/

Overview

Hugging Face is the largest open-source model community, offering model hosting, inference services, datasets, and more. Hugging Face Inference is its inference API service.

Core Strengths

1. Model Ecosystem (Largest)

Massive model library: Tens of thousands of models
Language models: Llama, Mistral, BERT, T5, and more
Image models: Stable Diffusion, ViT, and more
Audio models: Whisper, AudioLDM, and more
Multimodal: All kinds of multimodal models

2. Community-Driven

Open-source ecosystem: Largest open-source model community
Model sharing: Users can share their models
Collaborative development: Community-driven model improvements
Rich resources: Tutorials, docs, and examples galore

3. Inference Services

Serverless API: Serverless inference
Inference Endpoints: Dedicated inference endpoints
Private deployment: Private cloud support
GPU acceleration: GPU-accelerated inference

4. Developer Tools

Python SDK: The transformers library
JavaScript SDK: Browser support
API clients: Clients for multiple languages
Web UI: Online testing and demos

5. Enterprise Features

Inference Endpoints: Enterprise-grade inference endpoints
Data security: GDPR compliant
SLA guarantees: Service level agreements
Private repositories: Private model repositories

Pricing

Serverless: Pay per usage
Inference Endpoints: Hourly billing (monthly/annual)
Free tier: Free usage available
Enterprise pricing: Customized enterprise plans

Best For

Teams needing specific open-source models
Open-source model enthusiasts
Research and experimentation
Projects requiring diverse model choices
Open-source initiatives

Pros & Cons

Pros:

✅ Most models of any platform
✅ Richest community ecosystem
✅ Open-source friendly
✅ Rich documentation and tutorials
✅ Supports virtually all open-source models

Cons:

❌ Does not include proprietary models (GPT, Claude)
❌ Performance may not match dedicated providers
❌ Enterprise-grade features require extra payment
❌ Inference speed may be slower

Docs & Resources

Website: https://huggingface.co/
Docs: https://huggingface.co/docs/api-inference
GitHub: https://github.com/huggingface

SiliconFlow

Official Website

https://siliconflow.cn/

Overview

SiliconFlow is a Chinese company aiming to become a leading global AI capability provider. It offers multimodal model capabilities spanning language, speech, images, and video, aggregating both domestic and international model sources.

Core Strengths

1. Full-Scenario Product Matrix (Multimodal Aggregation)

Language models: DeepSeek-R1, DeepSeek-V3, QwQ-32B, GLM-4-9B-Chat, and more
Voice models: CosyVoice2-0.5B
Image models: Kolors
Video models: HunyuanVideo-HD, Wan2.1-I2V-14B-720P, Wan2.1-T2V-14B, and more

2. Performance Optimization

High-speed inference: Language model speed improved by 10x+
Low latency: Voice generation latency as low as 100ms
Deep optimization for domestic Chinese models

3. Cost Advantage

Image generation cost savings of 66%
Language model cost savings of 46%
Hosting cost reduction for customers of 52%

4. Enterprise-Grade Features

High Stability

Developer-validated high reliability
Comprehensive monitoring and fault-tolerance
Enterprise-grade professional technical support

High Security

BYOC deployment: Protect data privacy
Compute/network/storage isolation: Comprehensive security
Meets industry standards and compliance requirements
Supports domestic-only deployment

High Scalability

Dynamic scaling to support elastic workloads
One-click custom model deployment
Hybrid cloud deployment support

5. Intelligent Capabilities

Smart scaling for flexible business growth
Intelligent cost analysis for budget control
Access to multiple advanced model services

Technical Advantages

Deep optimization for domestic Chinese LLMs (DeepSeek, GLM, etc.)
Comprehensive multimodal capabilities
Enterprise deployment solutions
Compliant with Chinese data regulations
Localized service support

Pricing

Billing: Per token or per call
Cost advantage: Significant savings compared to overseas providers
Flexible plans: Multiple pricing options available

Best For

Domestic enterprises using Chinese large models
Multimodal AI application development
Scenarios with strict data security and compliance requirements
Cost-sensitive projects
Enterprise-grade deployment scenarios

Pros & Cons

Pros:

✅ Clear cost advantage
✅ Comprehensive multimodal capabilities
✅ Well-optimized for Chinese domestic models
✅ Compliant with Chinese regulations
✅ Localized service support
✅ Comprehensive enterprise features

Cons:

❌ International model coverage not as broad as OpenRouter
❌ Documentation and community relatively new
❌ Lower degree of internationalization

Docs & Resources

Website: https://siliconflow.cn/
API docs: https://docs.siliconflow.cn/

Comparison Summary

Native Providers vs Third-Party Providers

Feature	Native Providers	Third-Party Providers
Model capability	Strongest	Depends on upstream
Model variety	Single vendor	Rich selection
Unified interface	Per vendor	✅ Unified interface
Smart routing	❌	✅
Failover	❌	✅
Integration complexity	High (multi-vendor)	Low
Vendor lock-in	High	Low
Latency	Low	Slightly higher
Stability	High	Platform-dependent
Cost	Higher	More optimization room
Ecosystem	Mature but closed	Open
Enterprise features	Comprehensive	Partial support
Compliance	Needs verification	Mixed

Quick Comparison Table (All Providers)

Feature	OpenAI	Google	Claude	Zhipu	Baidu	Alibaba	Doubao	Kimi	GitHub Copilot	OpenRouter	Together	Replicate	Fireworks	HF	SiliconFlow
Type	Native	Native	Native	Native	Native	Native	Native	Native	Coding tool	Third-party	Third-party	Third-party	Third-party	Third-party	Third-party
Model capability	⭐⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐	⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐⭐
Model variety	1	1	1	1	1	1	1	1	Multi-provider	300+	50+	Thousands	20+	Tens of thousands	Multiple
Chinese	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐⭐
Multimodal	✅	✅	Partial	✅	Partial	Partial	✅	❌	❌	✅	Partial	✅	Partial	✅	✅
Smart routing	❌	❌	❌	❌	❌	❌	❌	❌	❌	✅	❌	❌	❌	❌	Partial
Cost	High	Medium	Medium-high	Medium	Medium	Medium	Low	Medium	Extremely low	Medium	Low	Low	Low	Medium	Low
Enterprise features	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐⭐⭐
Documentation	⭐⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐
Community	⭐⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐⭐	⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐⭐	⭐⭐⭐	⭐⭐⭐⭐⭐	⭐⭐⭐
Compliance	Low	Low	Low	High	High	High	High	High	Medium	Medium	Medium	Medium	Medium	Medium	High

Recommendations

Choose a native provider if:

OpenAI

You need peak model performance
Enterprise-grade apps with high stability requirements
Global products needing multilingual support
You don’t want to rely on third parties
Cost is not a primary concern

Google Gemini

You need Google ecosystem integration
Multimodal application development
You’re on Google Cloud
You need MLOps capabilities

Anthropic Claude

High safety requirements
You need long context (200K)
Coding assistant tools
Chatbots

Zhipu AI

Domestic Chinese application development
Chinese-primary applications
Strict compliance requirements
Cost-sensitive

Baidu ERNIE Bot

Baidu ecosystem integration
Need Baidu Cloud services
SMB rapid deployment

Alibaba Cloud Qwen

Existing Alibaba Cloud users
E-commerce applications
Open-source model preference

ByteDance Doubao

ByteDance ecosystem integration
Multimodal applications
Consumer-facing apps
Cost-sensitive

Moonshot Kimi

Long document analysis
Research and academic work
Personal knowledge management

GitHub Copilot

Everyday coding development (strongly recommended)
Coding scenarios needing multi-model switching
Limited budget but need high-quality AI assistance
Seamless in-IDE use without switching between browser and editor

Choose a third-party provider if:

OpenRouter

You need to connect to multiple models at once
You want smart routing and failover
Reducing vendor lock-in risk
You need A/B testing

Together AI

You prefer open-source models
You need custom model training
Cost-sensitive large-scale applications

Replicate

Rapid prototyping
Testing different models
Small-scale applications
Open-source model enthusiasts

Fireworks.ai

Extremely high performance requirements
Cost-sensitive large-scale applications
Low latency requirements

Hugging Face

Specific open-source models
Research and experimentation
Community-driven development

SiliconFlow

Domestic enterprises
Multimodal applications
Strict compliance requirements
Cost-sensitive

Best Practices

1. Hybrid Strategy

Core features → Native provider (stability, capability)
Cost optimization → Third-party open-source models
Compliance requirements → Locally compliant provider
A/B testing → Third-party aggregation platform

2. Avoiding Vendor Lock-In

Use an abstraction layer to wrap the API
Design swappable model selection strategies
Maintain multi-provider backup plans

3. Cost Optimization

Use caching to reduce repeated requests
Choose models based on task complexity
Monitor usage and costs
Take advantage of free quotas

4. Monitoring and Observability

Track model performance metrics
Monitor usage and costs
Set up alerting mechanisms
Use platform analytics tools

Learning Resources

Native Providers

OpenAI: https://platform.openai.com/docs
Google AI: https://ai.google.dev/docs
Anthropic: https://docs.anthropic.com/
Zhipu AI: https://open.bigmodel.cn/dev/api
Baidu: https://cloud.baidu.com/doc/WENXINWORKSHOP/
Alibaba Cloud: https://help.aliyun.com/zh/dashscope/
ByteDance: https://platform.volcengine.com/
Kimi: https://www.moonshot.cn/
GitHub Copilot: https://docs.github.com/en/copilot

Third-Party Providers

OpenRouter: https://openrouter.ai/docs
Together AI: https://docs.together.ai/
Replicate: https://replicate.com/docs
Fireworks.ai: https://fireworks.ai/docs
Hugging Face: https://huggingface.co/docs/api-inference
SiliconFlow: https://docs.siliconflow.cn/

Search Keywords

AI subscription plan comparison
LLM API pricing
OpenAI vs Claude vs Google
third-party AI provider
Chinese AI model comparison
AI API aggregation platform
OpenRouter tutorial
AI inference platform

Future Updates

This document will be updated continuously to track the latest developments and pricing changes from AI providers. I recommend checking each provider’s official announcements and changelogs regularly.

Update plan:

Update pricing information
Add new models and services
Supplement with real-world use cases
Add performance benchmark data
Update compliance and privacy policies

This document is based on information as of March 2026. AI providers change rapidly — always refer to official sources for the latest information.

AI LLM Claude API Subscription OpenAI Google

Research Notes on Mainstream AI Subscription Plans

Research Notes on Mainstream AI Subscription Plans

Table of Contents

Provider Category Overview

Native AI Providers

Third-Party AI Providers (Aggregators)

Native AI Providers

OpenAI

Official Websites

Overview

Core Models

Language Models

Image Models

Other Models

Subscription Plans

Free Tier

API Pay-as-you-go

ChatGPT Plus (Personal)

Team

Enterprise

Core Strengths

1. Model Capabilities

2. Ecosystem

3. Advanced Features

4. Enterprise Features

Best For

Pros & Cons

Docs & Resources

Google Gemini

Official Websites

Overview

Core Models

Gemini Series

Other Models

Subscription Plans

Free Tier

AI Studio (API Pay-as-you-go)

Google One AI Premium (Personal)

Enterprise

Core Strengths

1. Multimodal Capabilities

2. Google Ecosystem Integration

3. Developer Experience

4. Performance Advantages

Best For

Pros & Cons

Docs & Resources

Anthropic Claude

Official Websites

Overview

Core Models

Claude 3 Series

Claude 2 Series

Subscription Plans

Free Tier

API Pay-as-you-go

Claude Pro (Personal)

Team

Enterprise

Core Strengths

1. Safety and Alignment

2. Long Context

3. Natural Conversation

4. Coding Capabilities

5. Developer Experience

Best For

Pros & Cons

Docs & Resources

Zhipu AI

Official Websites

Overview

Core Models

Language Models

Multimodal Models

Specialized Models

Subscription Plans

Free Tier

API Pay-as-you-go

Personal Plan

Enterprise