tencent cloud

DescribeModelServiceGroups
Last updated: 2025-11-13 20:53:05
DescribeModelServiceGroups
Last updated: 2025-11-13 20:53:05

1. API Description

Domain name for API request: tione.intl.tencentcloudapi.com.

This API is used to list online inference service groups.

A maximum of 20 requests can be initiated per second for this API.

We recommend you to use API Explorer
Try it
API Explorer provides a range of capabilities, including online call, signature authentication, SDK code generation, and API quick search. It enables you to view the request, response, and auto-generated examples.

2. Input Parameters

The following request parameter list only provides API request parameters and some common parameters. For the complete common parameter list, see Common Request Parameters.

Parameter Name Required Type Description
Action Yes String Common Params. The value used for this API: DescribeModelServiceGroups.
Version Yes String Common Params. The value used for this API: 2021-11-11.
Region Yes String Common Params. For more information, please see the list of regions supported by the product.
Offset No Integer Offset. Default value: 0.
Limit No Integer Number of returned results. The default value is 20, and the maximum value is 100.
Order No String The sorting order of the output list. Valid values: ASC (ascending order) and DESC (descending order).
OrderField No String Field to sort by. Valid values: CreateTime and UpdateTime.
Filters.N No Array of Filter Pagination parameters. Supported filterable field names include:["ClusterId", "ServiceId", "ServiceGroupName", "ServiceGroupId","Status","CreatedBy","ModelVersionId"]
TagFilters.N No Array of TagFilter Tag filtering parameters.
ServiceCategory No String Service classification.

3. Output Parameters

Parameter Name Type Description
TotalCount Integer Number of inference service groups.Note: This field may return null, indicating that no valid values can be obtained.
ServiceGroups Array of ServiceGroup Service group information.Note: This field may return null, indicating that no valid values can be obtained.
RequestId String The unique request ID, generated by the server, will be returned for every request (if the request fails to reach the server for other reasons, the request will not obtain a RequestId). RequestId is required for locating a problem.

4. Example

Example1 Listing All Service Groups

Input Example

POST / HTTP/1.1
Host: tione.intl.tencentcloudapi.com
Content-Type: application/json
X-TC-Action: DescribeModelServiceGroups
<Common request parameters>

{
    "Limit": 10,
    "Offset": 0,
    "Filters": []
}

Output Example

{
    "Response": {
        "ServiceGroups": [
            {
                "ServiceGroupId": "ms-vzd8qdk5",
                "ServiceGroupName": "llm_test",
                "CreatedBy": "100032054859",
                "CreateTime": "2023-08-05T05:46:58Z",
                "UpdateTime": "2023-08-08T14:27:02Z",
                "Uin": "100005348929",
                "ServiceCount": 1,
                "RunningServiceCount": 0,
                "Services": [
                    {
                        "ServiceGroupId": "ms-vzd8qdk5",
                        "ServiceId": "ms-vzd8qdk5-1",
                        "ServiceGroupName": "llm_test",
                        "ServiceDescription": "",
                        "ServiceInfo": {
                            "Replicas": 0,
                            "ImageInfo": {
                                "ImageType": "PRE_SET",
                                "ImageUrl": "tione.tencentcloudcr.com/qcloud-ti-platform/chatglm-demo:latest",
                                "RegistryRegion": "",
                                "RegistryId": ""
                            },
                            "Env": [],
                            "Resources": {
                                "Cpu": 12000,
                                "Memory": 45056,
                                "Gpu": 100,
                                "RealGpu": 100,
                                "GpuType": "A10",
                                "RealGpuDetailSet": [
                                    {
                                        "Name": "A10",
                                        "Value": 100
                                    }
                                ]
                            },
                            "InstanceType": "TI.GNV4.3XLARGE44.POST",
                            "ModelInfo": {
                                "ModelId": "m-838176671237008128",
                                "ModelName": "chatglm-6b-tiacc-ft",
                                "ModelVersionId": "mv-v1-838176671237008129",
                                "ModelVersion": "v1",
                                "ModelSource": "COS",
                                "ModelType": "NORMAL",
                                "CosPathInfo": {
                                    "Bucket": "danerli-guangzhou-1256580188",
                                    "Region": "ap-guangzhou",
                                    "Paths": [
                                        "output/ai_market_algo_test_thu_tiacc_ft/train-793371624977165056/output/adgen-chatglm-6b-ft-1e-8/checkpoint-500/model_service.py"
                                    ]
                                },
                                "AlgorithmFramework": "PYTORCH",
                                "ModelFormat": "PYTORCH"
                            },
                            "InferCodeInfo": {
                                "CosPathInfo": {
                                    "Bucket": "danerli-guangzhou-1256580188",
                                    "Region": "ap-guangzhou",
                                    "Paths": [
                                        "output/ai_market_algo_test_thu_tiacc_ft/train-793371624977165056/output/adgen-chatglm-6b-ft-1e-8/checkpoint-500/model_service.py"
                                    ]
                                }
                            },
                            "VolumeMount": {
                                "VolumeSourceType": "CFS",
                                "CFSConfig": {
                                    "Id": "cfs-mpjk7vit",
                                    "Path": "/",
                                    "MountType": "",
                                    "Protocol": ""
                                }
                            },
                            "LogEnable": false,
                            "LogConfig": null,
                            "AuthorizationEnable": false,
                            "ScaleMode": "",
                            "HorizontalPodAutoscaler": null,
                            "CronScaleJobs": [],
                            "ScaleStrategy": "",
                            "ScheduledAction": null,
                            "Status": {
                                "Replicas": 0,
                                "UpdatedReplicas": 0,
                                "ReadyReplicas": 0,
                                "AvailableReplicas": 0,
                                "UnavailableReplicas": 0,
                                "Status": "Stopped",
                                "Reason": "",
                                "Conditions": [
                                    {
                                        "Message": "Deployment has minimum availability.",
                                        "Reason": "MinimumReplicasAvailable",
                                        "Status": "True",
                                        "Type": "Available",
                                        "LastTransitionTime": "2023-08-08T22:27:01+08:00",
                                        "LastUpdateTime": "2023-08-08T22:27:01+08:00"
                                    },
                                    {
                                        "Message": "ReplicaSet \"ms-vzd8qdk5-1-7456b4cc97\" has successfully progressed.",
                                        "Reason": "NewReplicaSetAvailable",
                                        "Status": "True",
                                        "Type": "Progressing",
                                        "LastTransitionTime": "2023-08-08T22:27:02+08:00",
                                        "LastUpdateTime": "2023-08-08T22:27:02+08:00"
                                    }
                                ]
                            },
                            "Weight": 0,
                            "PodList": [],
                            "Pods": null,
                            "PodInfos": [],
                            "ResourceTotal": null,
                            "OldReplicas": 1,
                            "HybridBillingPrepaidReplicas": 0,
                            "OldHybridBillingPrepaidReplicas": 0,
                            "ServiceLimit": null,
                            "ModelHotUpdateEnable": false,
                            "ModelTurboEnable": false
                        },
                        "ClusterId": "",
                        "Region": "ap-guangzhou",
                        "Namespace": "infer-100005348929",
                        "ChargeType": "POSTPAID_BY_HOUR",
                        "ResourceGroupId": "",
                        "ResourceGroupName": "",
                        "Tags": [],
                        "IngressName": "user-ingress-1",
                        "CreatedBy": "100032054859",
                        "CreateTime": "2023-08-05T05:46:58Z",
                        "UpdateTime": "2023-08-08T14:27:02Z",
                        "Uin": "100005348929",
                        "SubUin": "",
                        "AppId": 0,
                        "BusinessStatus": "CREATE_SUCCEED",
                        "CreateFailedReason": "CREATE_SUCCEED",
                        "Status": "Stopped",
                        "BillingInfo": "",
                        "Weight": 100,
                        "CreateSource": "DEFAULT",
                        "Version": "1",
                        "LatestVersion": "",
                        "ServiceLimit": null,
                        "ScheduledAction": null
                    }
                ],
                "Status": "Stopped",
                "Tags": [],
                "BusinessStatus": "",
                "WeightUpdateStatus": "",
                "CreateSource": "DEFAULT",
                "BillingInfo": "",
                "LatestVersion": ""
            }
        ],
        "TotalCount": 1,
        "RequestId": "e5daa146-abed-4d31-ab15-ac9e251fc13c"
    }
}

5. Developer Resources

SDK

TencentCloud API 3.0 integrates SDKs that support various programming languages to make it easier for you to call APIs.

Command Line Interface

6. Error Code

The following only lists the error codes related to the API business logic. For other error codes, see Common Error Codes.

Error Code Description
AuthFailure CAM signature/authentication error.
AuthFailure.CamException Cloud Access Management (CAM) system exception.
AuthFailure.UnauthorizedOperation Unauthorized operation.
FailedOperation Operation failed.
FailedOperation.ApiGatewayQueryFailed Failed to access the API Gateway. Please try again.
FailedOperation.BillingQueryFailed Failed to access the billing module.
FailedOperation.ClusterQueryFailed Cluster access failed.
FailedOperation.DuplicateName Duplicated name.
FailedOperation.ExecDatabaseFail Database execution error.
FailedOperation.ExecTagFail Tag operation failed.
FailedOperation.NoPermission No permissions.
FailedOperation.NotAllow The operation is not allowed.
FailedOperation.QueryDatabaseFail Database query error.
FailedOperation.QueryTagFail Failed to query the tag service.
FailedOperation.StsQueryFailed Failed to access the key service. Please try again.
FailedOperation.UnknownInstanceType Unknown instance specifications.
FailedOperation.UnmarshalData Data parsing failed.
InternalError Internal error.
InvalidParameter Parameter error.
InvalidParameterValue Parameter value error.
InvalidParameterValue.BadName Invalid name.
MissingParameter Error of missing parameters.
OperationDenied Operation denied.
OperationDenied.BalanceInsufficient Insufficient balance. Failed to create or update.
ResourceNotFound The resource does not exist.
Was this page helpful?
You can also Contact Sales or Submit a Ticket for help.
Yes
No

Feedback