Release Notes and Announcements
- Release Notes
- Security Announcement
- Announcements
Product Introduction
- Overview
- Strengths
- Use Cases
- Comparison Between EdgeOne and CDN Products
- Use Limits
Purchase Guide
- Description of Trial Plan Experience Benefits
- Free Plan Guide
- Billing Overview
- Billing Items
- Subscriptions
- Renewals
- Instructions for overdue and refunds
- Comparison of EdgeOne Plans
- About "clean traffic" billing instructions
- DDoS Protection Capacity Description
Getting Started
- Choose business scenario
- Quick access to website security acceleration
- Quick deploying a website with Pages
- Access Your First Site via the EdgeOne Skill Dialog
Domain Service&Origin Configuration
- Domain Service
- HTTPS Certificate
- Origin Configuration
Site Acceleration
- Overview
- Access Control
- Smart Acceleration
- Cache Configuration
- File Optimization
- Network Optimization
- URL Rewrite
- Modifying Header
- Modify the response content
- Rule Engine
- Image&Video Processing
- Speed limit for single connection download
DDoS & Web Protection
- Overview
- DDoS Protection
- Web Protection
- Bot Management
- API Discovery（Beta）
Edge Functions
- Overview
- Getting Started
- Operation Guide
- Runtime APIs
- Sample Functions
- Best Practices
KV Storage
- Overview
- Operation Guide
Edge inference
- Edge Inference Overview
- Quick Guide
Pages
L4 Proxy
- Overview
- Creating an L4 Proxy Instance
- Modifying an L4 Proxy Instance
- Disabling or Deleting an L4 Proxy Instance
- Batch Configuring Forwarding Rules
- Obtaining Real Client IPs
Data Analysis&Log Service
- Log Service
- Data Analysis
- Alarm Service
Site and Billing Management
- Billing Management
- Site Management
- Version Management
- General Policy
General Reference
- Configuration Syntax
- Request and Response Actions
- Country/region and Corresponding Codes
Terraform
- Overview
- Installing and Configuring Terraform
Practical Tutorial
- EdgeOne Skill User Guide
- Automatic Warm-up/Cache Purge
- Resource Abuse/hotlinking Protection Practical
- HTTPS Related Practices
- Acceleration Optimization
- Scheduling Traffic
- Data Analysis and Alerting
- Log Platform Integration Practices
- Configuring Origin Servers for Cloud Object Storage (Such As COS)
- Configuring Akamai NetStorage as an Origin
- CORS Response Configuration
AI Resources and Terraform
- AI Resources
API Documentation
- History
- Introduction
- API Category
- Making API Requests
- Site APIs
- Accelerated Domain Name Management APIs
- Site Acceleration Configuration APIs
- Edge Function APIs
- Alias Domain APIs
- Security Configuration APIs
- Layer 4 Application Proxy APIs
- Content Management APIs
- Data Analysis APIs
- Log Service APIs
- Billing APIs
- Certificate APIs
- Origin Protection APIs
- Load Balancing APIs
- Diagnostic Tool APIs
- Custom Response Page APIs
- Version Management APIs
- API Security APIs
- DNS Record APIs
- Content Identifier APIs
- Legacy APIs
- Ownership APIs
- Image and Video Processing APIs
- Multi-Channel Security Gateway APIs
- KV Storage APIs
- Data Types
- Error Codes
FAQs
- Product Features FAQs
- DNS Record FAQs
- Domain Configuration FAQs
- Site Acceleration FAQs
- Data and Log FAQs
- Security Protection-related Queries
- Origin Configuration FAQs
Troubleshooting
- Reference for Abnormal Status Codes
- Troubleshooting Guide for EdgeOne 4XX/5XX Status Codes
- 520/524 Status Code Troubleshooting Guide
- 521/522 Status Code Troubleshooting Guide
- Tool Guide
Agreements
- Service Level Agreement
- Origin Protection Enablement Conditions of Use
TEO Policy
- Privacy Policy
- Data Processing And Security Agreement
Contact Us
Glossary

Edge inference fee (postpaid)

Download

Focus Mode

Font Size

Last updated: 2026-05-25 15:44:46

Edge Inference provides GPU inference services based on EdgeOne edge nodes, allowing users to deploy custom model images or platform-preset models to edge nodes for inference. Edge Inference is only supported in the Enterprise Edition plan and adopts a postpaid billing method based on instance usage duration. With a single inference instance as the smallest billing unit, postpaid bills are generated based on the instance running duration.
Instance running duration: refers to the total time (in seconds) from the startup to the termination of an inference instance. Billing is calculated per second with a minimum charge of 1 second, and any fraction of a second will be rounded up to 1 second.
Note:
The Edge Inference feature is currently in beta testing and requires being added to the allowlist for access. It is only supported in the Enterprise Edition plan. If needed, please Contact Us.
Edge Inference Cost
Edge Inference is billed on a postpaid basis according to the running duration of instances with different GPU specifications and settled monthly. The billing method and pricing for postpaid are as follows:
Billable Item
GPU specifications
List price (USD/second)
Settlement Cycle
Custom Inference Service
Entry-level (Tier A)
0.000217 USD/second
Hour or month
﻿
Basic (Tier B)
0.000220 USD/second
﻿
﻿
Enhanced (Tier C)
0.000250 USD/second
﻿
Note:
1. Edge Inference is billed based on the actual running duration of instances. Billing stops as soon as the instance is stopped.
2. If AS is enabled, each instance is independently metered, and the total cost is the cumulative value of the running duration costs for all instances.
Example: Creating a Custom Inference Service Under the Enterprise Plan, Selecting an Entry-Level GPU (16 GB VRAM), and Manually Setting the Number of Instances to 1
Assume that the user starts the service at 10:00:00 on March 3. Between 10:00:00 - 11:00:00 on March 3, the instance runs continuously for 1 hour (3600 seconds). At 11:00:00 on March 3, the user adjusts the number of instances from 1 to 2. Both instances run continuously between 11:00:00 - 12:00:00 for 1 hour (3600 seconds).
The unit price of Entry-level GPU is 0.000217 USD/second, therefore the cost settlement for these two hours is as follows:
On March 3, 10:00:00 - 11:00:00, 1 instance was running, and the cost for this hour: 0.000217 × 3600 × 1 = 0.7812 USD.
On March 3, 11:00:00 - 12:00:00, 2 instances were running, and the cost for this hour: 0.000217 × 3600 × 2 = 1.5624 USD.
Total cost for two hours: 0.7812 + 1.5624 = 2.3436 USD.
﻿
﻿
﻿

Help and Support

Was this page helpful?

You can also Contact sales or Submit a Ticket for help.

Help us improve! Rate your documentation experience in 5 mins.

Feedback

tencent cloud

Tencent Cloud EdgeOne

Edge inference fee (postpaid)

Edge Inference Cost

Example: Creating a Custom Inference Service Under the Enterprise Plan, Selecting an Entry-Level GPU (16 GB VRAM), and Manually Setting the Number of Instances to 1

Help and Support

Billable Item	GPU specifications	List price (USD/second)	Settlement Cycle
Custom Inference Service	Entry-level (Tier A)	0.000217 USD/second	Hour or month
		Basic (Tier B)		0.000220 USD/second
		Enhanced (Tier C)		0.000250 USD/second