Domain name for API request: tione.intl.tencentcloudapi.com.
This API is used to list online inference service groups.
A maximum of 20 requests can be initiated per second for this API.
The following request parameter list only provides API request parameters and some common parameters. For the complete common parameter list, see Common Request Parameters.
| Parameter Name | Required | Type | Description |
|---|---|---|---|
| Action | Yes | String | Common Params. The value used for this API: DescribeModelServiceGroups. |
| Version | Yes | String | Common Params. The value used for this API: 2021-11-11. |
| Region | Yes | String | Common Params. For more information, please see the list of regions supported by the product. |
| Offset | No | Integer | Offset. Default value: 0. |
| Limit | No | Integer | Number of returned results. The default value is 20, and the maximum value is 100. |
| Order | No | String | The sorting order of the output list. Valid values: ASC (ascending order) and DESC (descending order). |
| OrderField | No | String | Field to sort by. Valid values: CreateTime and UpdateTime. |
| Filters.N | No | Array of Filter | Pagination parameters. Supported filterable field names include:["ClusterId", "ServiceId", "ServiceGroupName", "ServiceGroupId","Status","CreatedBy","ModelVersionId"] |
| TagFilters.N | No | Array of TagFilter | Tag filtering parameters. |
| ServiceCategory | No | String | Service classification. |
| Parameter Name | Type | Description |
|---|---|---|
| TotalCount | Integer | Number of inference service groups.Note: This field may return null, indicating that no valid values can be obtained. |
| ServiceGroups | Array of ServiceGroup | Service group information.Note: This field may return null, indicating that no valid values can be obtained. |
| RequestId | String | The unique request ID, generated by the server, will be returned for every request (if the request fails to reach the server for other reasons, the request will not obtain a RequestId). RequestId is required for locating a problem. |
POST / HTTP/1.1
Host: tione.intl.tencentcloudapi.com
Content-Type: application/json
X-TC-Action: DescribeModelServiceGroups
<Common request parameters>
{
"Limit": 10,
"Offset": 0,
"Filters": []
}{
"Response": {
"ServiceGroups": [
{
"ServiceGroupId": "ms-vzd8qdk5",
"ServiceGroupName": "llm_test",
"CreatedBy": "100032054859",
"CreateTime": "2023-08-05T05:46:58Z",
"UpdateTime": "2023-08-08T14:27:02Z",
"Uin": "100005348929",
"ServiceCount": 1,
"RunningServiceCount": 0,
"Services": [
{
"ServiceGroupId": "ms-vzd8qdk5",
"ServiceId": "ms-vzd8qdk5-1",
"ServiceGroupName": "llm_test",
"ServiceDescription": "",
"ServiceInfo": {
"Replicas": 0,
"ImageInfo": {
"ImageType": "PRE_SET",
"ImageUrl": "tione.tencentcloudcr.com/qcloud-ti-platform/chatglm-demo:latest",
"RegistryRegion": "",
"RegistryId": ""
},
"Env": [],
"Resources": {
"Cpu": 12000,
"Memory": 45056,
"Gpu": 100,
"RealGpu": 100,
"GpuType": "A10",
"RealGpuDetailSet": [
{
"Name": "A10",
"Value": 100
}
]
},
"InstanceType": "TI.GNV4.3XLARGE44.POST",
"ModelInfo": {
"ModelId": "m-838176671237008128",
"ModelName": "chatglm-6b-tiacc-ft",
"ModelVersionId": "mv-v1-838176671237008129",
"ModelVersion": "v1",
"ModelSource": "COS",
"ModelType": "NORMAL",
"CosPathInfo": {
"Bucket": "danerli-guangzhou-1256580188",
"Region": "ap-guangzhou",
"Paths": [
"output/ai_market_algo_test_thu_tiacc_ft/train-793371624977165056/output/adgen-chatglm-6b-ft-1e-8/checkpoint-500/model_service.py"
]
},
"AlgorithmFramework": "PYTORCH",
"ModelFormat": "PYTORCH"
},
"InferCodeInfo": {
"CosPathInfo": {
"Bucket": "danerli-guangzhou-1256580188",
"Region": "ap-guangzhou",
"Paths": [
"output/ai_market_algo_test_thu_tiacc_ft/train-793371624977165056/output/adgen-chatglm-6b-ft-1e-8/checkpoint-500/model_service.py"
]
}
},
"VolumeMount": {
"VolumeSourceType": "CFS",
"CFSConfig": {
"Id": "cfs-mpjk7vit",
"Path": "/",
"MountType": "",
"Protocol": ""
}
},
"LogEnable": false,
"LogConfig": null,
"AuthorizationEnable": false,
"ScaleMode": "",
"HorizontalPodAutoscaler": null,
"CronScaleJobs": [],
"ScaleStrategy": "",
"ScheduledAction": null,
"Status": {
"Replicas": 0,
"UpdatedReplicas": 0,
"ReadyReplicas": 0,
"AvailableReplicas": 0,
"UnavailableReplicas": 0,
"Status": "Stopped",
"Reason": "",
"Conditions": [
{
"Message": "Deployment has minimum availability.",
"Reason": "MinimumReplicasAvailable",
"Status": "True",
"Type": "Available",
"LastTransitionTime": "2023-08-08T22:27:01+08:00",
"LastUpdateTime": "2023-08-08T22:27:01+08:00"
},
{
"Message": "ReplicaSet \"ms-vzd8qdk5-1-7456b4cc97\" has successfully progressed.",
"Reason": "NewReplicaSetAvailable",
"Status": "True",
"Type": "Progressing",
"LastTransitionTime": "2023-08-08T22:27:02+08:00",
"LastUpdateTime": "2023-08-08T22:27:02+08:00"
}
]
},
"Weight": 0,
"PodList": [],
"Pods": null,
"PodInfos": [],
"ResourceTotal": null,
"OldReplicas": 1,
"HybridBillingPrepaidReplicas": 0,
"OldHybridBillingPrepaidReplicas": 0,
"ServiceLimit": null,
"ModelHotUpdateEnable": false,
"ModelTurboEnable": false
},
"ClusterId": "",
"Region": "ap-guangzhou",
"Namespace": "infer-100005348929",
"ChargeType": "POSTPAID_BY_HOUR",
"ResourceGroupId": "",
"ResourceGroupName": "",
"Tags": [],
"IngressName": "user-ingress-1",
"CreatedBy": "100032054859",
"CreateTime": "2023-08-05T05:46:58Z",
"UpdateTime": "2023-08-08T14:27:02Z",
"Uin": "100005348929",
"SubUin": "",
"AppId": 0,
"BusinessStatus": "CREATE_SUCCEED",
"CreateFailedReason": "CREATE_SUCCEED",
"Status": "Stopped",
"BillingInfo": "",
"Weight": 100,
"CreateSource": "DEFAULT",
"Version": "1",
"LatestVersion": "",
"ServiceLimit": null,
"ScheduledAction": null
}
],
"Status": "Stopped",
"Tags": [],
"BusinessStatus": "",
"WeightUpdateStatus": "",
"CreateSource": "DEFAULT",
"BillingInfo": "",
"LatestVersion": ""
}
],
"TotalCount": 1,
"RequestId": "e5daa146-abed-4d31-ab15-ac9e251fc13c"
}
}
TencentCloud API 3.0 integrates SDKs that support various programming languages to make it easier for you to call APIs.
The following only lists the error codes related to the API business logic. For other error codes, see Common Error Codes.
| Error Code | Description |
|---|---|
| AuthFailure | CAM signature/authentication error. |
| AuthFailure.CamException | Cloud Access Management (CAM) system exception. |
| AuthFailure.UnauthorizedOperation | Unauthorized operation. |
| FailedOperation | Operation failed. |
| FailedOperation.ApiGatewayQueryFailed | Failed to access the API Gateway. Please try again. |
| FailedOperation.BillingQueryFailed | Failed to access the billing module. |
| FailedOperation.ClusterQueryFailed | Cluster access failed. |
| FailedOperation.DuplicateName | Duplicated name. |
| FailedOperation.ExecDatabaseFail | Database execution error. |
| FailedOperation.ExecTagFail | Tag operation failed. |
| FailedOperation.NoPermission | No permissions. |
| FailedOperation.NotAllow | The operation is not allowed. |
| FailedOperation.QueryDatabaseFail | Database query error. |
| FailedOperation.QueryTagFail | Failed to query the tag service. |
| FailedOperation.StsQueryFailed | Failed to access the key service. Please try again. |
| FailedOperation.UnknownInstanceType | Unknown instance specifications. |
| FailedOperation.UnmarshalData | Data parsing failed. |
| InternalError | Internal error. |
| InvalidParameter | Parameter error. |
| InvalidParameterValue | Parameter value error. |
| InvalidParameterValue.BadName | Invalid name. |
| MissingParameter | Error of missing parameters. |
| OperationDenied | Operation denied. |
| OperationDenied.BalanceInsufficient | Insufficient balance. Failed to create or update. |
| ResourceNotFound | The resource does not exist. |
Feedback