Ratio Settings

Ratio settings are a core configuration of the NewAPI billing system, allowing flexible control over the billing standards for various models and user groups by setting different ratios.

Ratio System Overview

NewAPI uses a three-tier ratio system to calculate user quota consumption:

Model Ratio - Defines the base billing multiplier for different AI models
Completion Ratio - Applies additional billing adjustments to output tokens
Group Ratio - Sets differentiated billing multipliers for different user groups

Relationship Between Quota and Ratios

In the New API system, ratios are key parameters for calculating quota consumption. Quota is an internal billing unit of the system, and all API calls are ultimately converted into quota points for deduction.

Quota Unit Conversion:

1 USD = 500,000 quota points
Quota points are the basic unit for internal system billing
User balances and consumption records are based on quota points

Quota Calculation Formulas

Pay-as-you-go Models (Based on Token Consumption)

配额消耗 = (输入token数 + 输出token数 × 补全倍率) × 模型倍率 × 分组倍率

Per-call Billing Models (Fixed Price)

配额消耗 = 模型固定价格 × 分组倍率 × 配额单位(500,000)

Audio Models (Special handling, automatically processed internally by NewAPI)

配额消耗 = (文本输入token + 文本输出token × 补全倍率 + 音频输入token × 音频倍率 + 音频输出token × 音频倍率 × 音频补全倍率) × 模型倍率 × 分组倍率

Pre-consumption and Post-consumption Mechanism

New API adopts a dual billing mechanism of pre-consumption and post-consumption:

Pre-consumption phase: Before an API call, quota consumption is calculated based on estimated token count and pre-deducted.
Post-consumption phase: After an API call is completed, quota consumption is recalculated based on the actual token count.
Difference adjustment: If the actual consumption differs from the pre-consumption, the system automatically adjusts the user's quota balance.

预消费配额 = 预估token数 × 模型倍率 × 分组倍率
实际配额 = 实际token数 × 模型倍率 × 分组倍率
配额调整 = 实际配额 - 预消费配额

Model Ratio Settings

Model ratios define the base billing multipliers for different AI models, and the system pre-sets default ratios for various models.

Common Model Ratio Examples

Model Name	Model Ratio	Completion Ratio	Official Price (Input)	Official Price (Output)
gpt-4o	1.25	4	$2.5/1M Tokens	$10/1M Tokens
gpt-3.5-turbo	0.25	1.33	$0.5/1M Tokens	$1.5/1M Tokens
gpt-4o-mini	0.075	4	$0.15/1M Tokens	$0.6/1M Tokens
o1	7.5	4	$15/1M Tokens	$60/1M Tokens

Ratio Meaning Explanation:

Model Ratio: A multiplier relative to the base billing unit, reflecting cost differences between models
Completion Ratio: The billing multiplier for output tokens relative to input tokens, reflecting output cost differences
Higher ratios consume more quota; lower ratios consume less quota

Setup Methods

JSON Format Setting: Directly edit the model ratio JSON configuration
Visual Editor: Set ratios via a graphical interface

倍率1

Completion Ratio Settings

Completion ratios are used for additional billing of output tokens, primarily to balance the input and output cost differences across various models.

Default Completion Ratios

Model Type	Official Price (Input)	Official Price (Output)	Completion Ratio	Description
gpt-4o	2.5$/1M Tokens	10$/1M Tokens	4	Output is 4x input
gpt-3.5-turbo	0.5$/1M Tokens	1$/1M Tokens	2	Output is 2x input
gpt-image-1	5$/1M Tokens	40$/1M Tokens	8	Output is 8x input
gpt-4o-mini	0.15$/1M Tokens	0.6$/1M Tokens	4	Output is 4x input
Other Models	1	1	1	Output is 1x input

Setup Instructions

Completion ratios primarily affect the billing of output tokens
Setting to 1 means output token billing is the same as input token billing
Greater than 1 means higher billing for output tokens, less than 1 means lower billing for output tokens

Group Ratio Settings

Group ratios allow setting differentiated billing multipliers for different user groups, enabling flexible pricing strategies.

Group Ratio Configuration

{
  "vip": 0.5,
  "premium": 0.8,
  "standard": 1.0,
  "trial": 2.0
}

Group Ratio Priority

User-specific Ratio: Personal ratio set for a specific user
Group Ratio: The ratio of the group the user belongs to
Default Ratio: System default ratio (usually 1.0)

倍率2

Visual Ratio Settings

The visual editor provides an intuitive ratio management interface, supporting:

Batch editing of model ratios
Real-time preview of ratio configurations
Conflict detection and prompts
One-click synchronization of upstream ratios

倍率3

Models Without Ratio Settings

For models without ratio settings, the system will:

Self-use mode: Use the default ratio of 37.5
Commercial mode: Prompt "Ratio or price not configured" error
Automatic detection: Display unconfigured models in the management interface

倍率4

Upstream Ratio Synchronization

The system supports automatic synchronization of ratio settings from upstream channels:

Automatically retrieve upstream model ratios
Batch update local ratio configurations
Stay synchronized with upstream prices
Supports manual adjustment and override

倍率5

Input tokens: 1,000
Output tokens: 500
Model Ratio: 15
Completion Ratio: 2
Group Ratio: 1.0 (Standard user)

Calculation Process:

配额消耗 = (1,000 + 500 × 2) × 15 × 1.0
         = (1,000 + 1,000) × 15
         = 2,000 × 15
         = 30,000 配额点数

Equivalent USD Cost: 30,000 ÷ 500,000 = $0.06

Example 2: GPT-3.5 VIP User Conversation

Scenario Parameters:

Input tokens: 2,000
Output tokens: 1,000
Model Ratio: 0.25
Completion Ratio: 1.33
Group Ratio: 0.5 (VIP user 50% discount)

Calculation Process:

配额消耗 = (2,000 + 1,000 × 1.33) × 0.25 × 0.5
         = (2,000 + 1,330) × 0.125
         = 3,330 × 0.125
         = 416.25 配额点数

Equivalent USD Cost: 416.25 ÷ 500,000 = $0.00083

Example 3: Per-call Billing Model (e.g., Midjourney)

Scenario Parameters:

Model Fixed Price: $0.02
Group Ratio: 1.0 (Standard user)
Quota Unit: 500,000

Calculation Process:

配额消耗 = 0.02 × 1.0 × 500,000
         = 10,000 配额点数

Equivalent USD Cost: 10,000 ÷ 500,000 = $0.02

For more billing rules, please refer to Frequently Asked Questions

Ratio System Overview

Relationship Between Quota and Ratios

Quota Unit Conversion:

Quota Calculation Formulas

Pay-as-you-go Models (Based on Token Consumption)

Per-call Billing Models (Fixed Price)

Audio Models (Special handling, automatically processed internally by NewAPI)

Pre-consumption and Post-consumption Mechanism

Model Ratio Settings

Common Model Ratio Examples

Setup Methods

Completion Ratio Settings

Default Completion Ratios

Setup Instructions

Group Ratio Settings

Group Ratio Configuration

Group Ratio Priority

Visual Ratio Settings

Models Without Ratio Settings

Upstream Ratio Synchronization

Frequently Asked Questions

Q: How to set ratios for new models?

Q: How do group ratios take effect?

Q: What is the purpose of the completion ratio?

Q: How to batch set ratios for similar models?

Quota Calculation Examples

Example 1: GPT-4 Standard User Conversation

Scenario Parameters:

Calculation Process:

Example 2: GPT-3.5 VIP User Conversation

Scenario Parameters:

Calculation Process:

Example 3: Per-call Billing Model (e.g., Midjourney)

Scenario Parameters:

Calculation Process:

On this page