Release Notes and Announcements
- Release Notes
- Security Announcement
- Announcements
Product Introduction
- Overview
- Strengths
- Use Cases
- Comparison Between EdgeOne and CDN Products
- Use Limits
Purchase Guide
- Description of Trial Plan Experience Benefits
- Free Plan Guide
- Billing Overview
- Billing Items
- Subscriptions
- Renewals
- Instructions for overdue and refunds
- Comparison of EdgeOne Plans
- About "clean traffic" billing instructions
- DDoS Protection Capacity Description
Getting Started
- Choose business scenario
- Quick access to website security acceleration
- Quick deploying a website with Pages
- Access Your First Site via the EdgeOne Skill Dialog
Domain Service&Origin Configuration
- Domain Service
- HTTPS Certificate
- Origin Configuration
Site Acceleration
- Overview
- Access Control
- Smart Acceleration
- Cache Configuration
- File Optimization
- Network Optimization
- URL Rewrite
- Modifying Header
- Modify the response content
- Rule Engine
- Image&Video Processing
- Speed limit for single connection download
DDoS & Web Protection
- Overview
- DDoS Protection
- Web Protection
- Bot Management
- API Discovery（Beta）
Edge Functions
- Overview
- Getting Started
- Operation Guide
- Runtime APIs
- Sample Functions
- Best Practices
KV Storage
- Overview
- Operation Guide
Edge inference
- Edge Inference Overview
- Quick Guide
Pages
L4 Proxy
- Overview
- Creating an L4 Proxy Instance
- Modifying an L4 Proxy Instance
- Disabling or Deleting an L4 Proxy Instance
- Batch Configuring Forwarding Rules
- Obtaining Real Client IPs
Data Analysis&Log Service
- Log Service
- Data Analysis
- Alarm Service
Site and Billing Management
- Billing Management
- Site Management
- Version Management
- General Policy
General Reference
- Configuration Syntax
- Request and Response Actions
- Country/region and Corresponding Codes
Terraform
- Overview
- Installing and Configuring Terraform
Practical Tutorial
- EdgeOne Skill User Guide
- Automatic Warm-up/Cache Purge
- Resource Abuse/hotlinking Protection Practical
- HTTPS Related Practices
- Acceleration Optimization
- Scheduling Traffic
- Data Analysis and Alerting
- Log Platform Integration Practices
- Configuring Origin Servers for Cloud Object Storage (Such As COS)
- Configuring Akamai NetStorage as an Origin
- CORS Response Configuration
AI Resources and Terraform
- AI Resources
API Documentation
- History
- Introduction
- API Category
- Making API Requests
- Site APIs
- Accelerated Domain Name Management APIs
- Site Acceleration Configuration APIs
- Edge Function APIs
- Alias Domain APIs
- Security Configuration APIs
- Layer 4 Application Proxy APIs
- Content Management APIs
- Data Analysis APIs
- Log Service APIs
- Billing APIs
- Certificate APIs
- Origin Protection APIs
- Load Balancing APIs
- Diagnostic Tool APIs
- Custom Response Page APIs
- Version Management APIs
- API Security APIs
- DNS Record APIs
- Content Identifier APIs
- Legacy APIs
- Ownership APIs
- Image and Video Processing APIs
- Multi-Channel Security Gateway APIs
- KV Storage APIs
- Data Types
- Error Codes
FAQs
- Product Features FAQs
- DNS Record FAQs
- Domain Configuration FAQs
- Site Acceleration FAQs
- Data and Log FAQs
- Security Protection-related Queries
- Origin Configuration FAQs
Troubleshooting
- Reference for Abnormal Status Codes
- Troubleshooting Guide for EdgeOne 4XX/5XX Status Codes
- 520/524 Status Code Troubleshooting Guide
- 521/522 Status Code Troubleshooting Guide
- Tool Guide
Agreements
- Service Level Agreement
- Origin Protection Enablement Conditions of Use
TEO Policy
- Privacy Policy
- Data Processing And Security Agreement
Contact Us
Glossary

Edge Inference Overview

Download

Focus Mode

Font Size

Last updated: 2026-04-15 14:38:58

The edge inference service provided by EdgeOne is a high-performance AI inference solution built on EdgeOne edge cloud distributed nodes + Serverless elastic architecture. The core goal is to address the pain points of traditional cloud inference, such as "high latency, high bandwidth costs," and local deployment, such as "difficult Ops, lack of elasticity." For AI businesses that require real-time response and localized data processing, it provides inference computing power support with "nearby scheduling, AS, Ops-free management, security and compliance."
Benefit
1. Low-latency inference: nearby response with millisecond-level feedback
Core highlights: Leveraging EdgeOne's global edge nodes to enable user business traffic to access nodes nearby, achieving inference response latency as low as millisecond-level.
Customer value: Meets scenarios with high real-time requirements, avoids latency overhead caused by cloud transmission, and enhances business response speed and user experience.
2. Auto Scaling: On-demand allocation to reduce costs and improve efficiency
Core highlights: Built on a Serverless architecture, it supports elastic scaling to automatically adjust computing resources based on inference request volume. Resources are released during idle periods and seamlessly scaled during peak periods, without reserving redundant computing power.
Customer value: Pay-as-you-go billing based on actual computing resource usage duration avoids idle hardware costs associated with on-premises deployment. SMBs eliminate significant hardware procurement investments, while enterprise clients can flexibly respond to traffic peaks.
3. Ops-free management: Simplified deployment to focus on core business
Core highlights: Provides fully managed inference services where the platform automatically handles edge node Ops, computing power scheduling, model deployment, version updates, and self-healing from failures, enabling developers to focus on core business without needing to manage underlying resources.
Customer value: Lowers the barrier to AI business adoption, reduces Ops team investment (no dedicated Ops personnel required for edge node maintenance), and shortens product launch cycles (only 30 minutes from model upload to service activation).
4. Security protection: Full-stack safeguarding ensuring API stability
Core highlights: Delivers a full-stack security protection system tailored for inference service APIs, covering Layer 4 and Layer 7 defense capabilities. Layer 4 protection defends against DDoS attacks, while Layer 7 protection integrates WAF to precisely identify and block application-layer attacks such as SQL injection, XSS cross-site scripting, and malicious crawlers.
Customer value: Prevents service outages, data breaches, or malicious consumption of computing resources caused by API attacks. Ensures 24/7 stable operation of inference services, reduces business operational risks, and is particularly tailored for industries with stringent security requirements such as financial and government sectors.
Quick Start
1. Deploy Your First Model Service—Quick Guide.
2. Learn about Edge Inference billing—Edge Inference Fees (Postpaid).
﻿

Help and Support

Was this page helpful?

You can also Contact sales or Submit a Ticket for help.

Help us improve! Rate your documentation experience in 5 mins.

Feedback

tencent cloud

Tencent Cloud EdgeOne

Edge Inference Overview

Benefit

Quick Start

Help and Support