published on Thursday, May 14, 2026 by Volcengine
published on Thursday, May 14, 2026 by Volcengine
When using large language models for inference services, the inference endpoint is the key entry point for model invocation. Developers can initiate API/SDK inference requests to the large model simply by specifying the ID, enabling quick access and flexible invocation. The system provides a unified API invocation method, call monitoring, rate limiting strategies, and security mechanisms to ensure stability and security during inference
Example Usage
Example coming soon!
Example coming soon!
Example coming soon!
Example coming soon!
Example coming soon!
resources:
endpointDemo:
type: volcenginecc:ark:Endpoint
name: EndpointDemo
properties:
name: EndpointDemo
description: endpoint description
modelReference:
foundation_model:
name: doubao-1-5-thinking-***
modelVersion: '250428'
tags:
- key: env
value: test
projectName: default
rateLimit:
tpm: 5
rpm: 10
Example coming soon!
Create Endpoint Resource
Resources are created with functions called constructors. To learn more about declaring and configuring resources, see Resources.
Constructor syntax
new Endpoint(name: string, args: EndpointArgs, opts?: CustomResourceOptions);@overload
def Endpoint(resource_name: str,
args: EndpointArgs,
opts: Optional[ResourceOptions] = None)
@overload
def Endpoint(resource_name: str,
opts: Optional[ResourceOptions] = None,
model_reference: Optional[EndpointModelReferenceArgs] = None,
name: Optional[str] = None,
description: Optional[str] = None,
model_unit_id: Optional[str] = None,
moderation: Optional[EndpointModerationArgs] = None,
project_name: Optional[str] = None,
rate_limit: Optional[EndpointRateLimitArgs] = None,
tags: Optional[Sequence[EndpointTagArgs]] = None)func NewEndpoint(ctx *Context, name string, args EndpointArgs, opts ...ResourceOption) (*Endpoint, error)public Endpoint(string name, EndpointArgs args, CustomResourceOptions? opts = null)
public Endpoint(String name, EndpointArgs args)
public Endpoint(String name, EndpointArgs args, CustomResourceOptions options)
type: volcenginecc:ark:Endpoint
properties: # The arguments to resource properties.
options: # Bag of options to control resource's behavior.
resource "volcenginecc_ark_endpoint" "name" {
# resource properties
}Parameters
- name string
- The unique name of the resource.
- args EndpointArgs
- The arguments to resource properties.
- opts CustomResourceOptions
- Bag of options to control resource's behavior.
- resource_name str
- The unique name of the resource.
- args EndpointArgs
- The arguments to resource properties.
- opts ResourceOptions
- Bag of options to control resource's behavior.
- ctx Context
- Context object for the current deployment.
- name string
- The unique name of the resource.
- args EndpointArgs
- The arguments to resource properties.
- opts ResourceOption
- Bag of options to control resource's behavior.
- name string
- The unique name of the resource.
- args EndpointArgs
- The arguments to resource properties.
- opts CustomResourceOptions
- Bag of options to control resource's behavior.
- name String
- The unique name of the resource.
- args EndpointArgs
- The arguments to resource properties.
- options CustomResourceOptions
- Bag of options to control resource's behavior.
Constructor example
The following reference example uses placeholder values for all input properties.
var endpointResource = new Volcenginecc.Ark.Endpoint("endpointResource", new()
{
ModelReference = new Volcenginecc.Ark.Inputs.EndpointModelReferenceArgs
{
CustomModelId = "string",
FoundationModel = new Volcenginecc.Ark.Inputs.EndpointModelReferenceFoundationModelArgs
{
ModelVersion = "string",
Name = "string",
},
},
Name = "string",
Description = "string",
ModelUnitId = "string",
Moderation = new Volcenginecc.Ark.Inputs.EndpointModerationArgs
{
Strategy = "string",
},
ProjectName = "string",
RateLimit = new Volcenginecc.Ark.Inputs.EndpointRateLimitArgs
{
Rpm = 0,
Tpm = 0,
},
Tags = new[]
{
new Volcenginecc.Ark.Inputs.EndpointTagArgs
{
Key = "string",
Value = "string",
},
},
});
example, err := ark.NewEndpoint(ctx, "endpointResource", &ark.EndpointArgs{
ModelReference: &ark.EndpointModelReferenceArgs{
CustomModelId: pulumi.String("string"),
FoundationModel: &ark.EndpointModelReferenceFoundationModelArgs{
ModelVersion: pulumi.String("string"),
Name: pulumi.String("string"),
},
},
Name: pulumi.String("string"),
Description: pulumi.String("string"),
ModelUnitId: pulumi.String("string"),
Moderation: &ark.EndpointModerationArgs{
Strategy: pulumi.String("string"),
},
ProjectName: pulumi.String("string"),
RateLimit: &ark.EndpointRateLimitArgs{
Rpm: pulumi.Int(0),
Tpm: pulumi.Int(0),
},
Tags: ark.EndpointTagArray{
&ark.EndpointTagArgs{
Key: pulumi.String("string"),
Value: pulumi.String("string"),
},
},
})
resource "volcenginecc_ark_endpoint" "endpointResource" {
model_reference = {
custom_model_id = "string"
foundation_model = {
model_version = "string"
name = "string"
}
}
name = "string"
description = "string"
model_unit_id = "string"
moderation = {
strategy = "string"
}
project_name = "string"
rate_limit = {
rpm = 0
tpm = 0
}
tags {
key = "string"
value = "string"
}
}
var endpointResource = new com.volcengine.volcenginecc.ark.Endpoint("endpointResource", com.volcengine.volcenginecc.ark.EndpointArgs.builder()
.modelReference(EndpointModelReferenceArgs.builder()
.customModelId("string")
.foundationModel(EndpointModelReferenceFoundationModelArgs.builder()
.modelVersion("string")
.name("string")
.build())
.build())
.name("string")
.description("string")
.modelUnitId("string")
.moderation(EndpointModerationArgs.builder()
.strategy("string")
.build())
.projectName("string")
.rateLimit(EndpointRateLimitArgs.builder()
.rpm(0)
.tpm(0)
.build())
.tags(EndpointTagArgs.builder()
.key("string")
.value("string")
.build())
.build());
endpoint_resource = volcenginecc.ark.Endpoint("endpointResource",
model_reference={
"custom_model_id": "string",
"foundation_model": {
"model_version": "string",
"name": "string",
},
},
name="string",
description="string",
model_unit_id="string",
moderation={
"strategy": "string",
},
project_name="string",
rate_limit={
"rpm": 0,
"tpm": 0,
},
tags=[{
"key": "string",
"value": "string",
}])
const endpointResource = new volcenginecc.ark.Endpoint("endpointResource", {
modelReference: {
customModelId: "string",
foundationModel: {
modelVersion: "string",
name: "string",
},
},
name: "string",
description: "string",
modelUnitId: "string",
moderation: {
strategy: "string",
},
projectName: "string",
rateLimit: {
rpm: 0,
tpm: 0,
},
tags: [{
key: "string",
value: "string",
}],
});
type: volcenginecc:ark:Endpoint
properties:
description: string
modelReference:
customModelId: string
foundationModel:
modelVersion: string
name: string
modelUnitId: string
moderation:
strategy: string
name: string
projectName: string
rateLimit:
rpm: 0
tpm: 0
tags:
- key: string
value: string
Endpoint Resource Properties
To learn more about resource properties and how to use them, see Inputs and Outputs in the Architecture and Concepts docs.
Inputs
In Python, inputs that are objects can be passed either as argument classes or as dictionary literals.
The Endpoint resource accepts the following input properties:
- Model
Reference Volcengine.Endpoint Model Reference - Endpoint access model
- Name string
- Endpoint name
- Description string
- Endpoint description
- Model
Unit stringId - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- Moderation
Volcengine.
Endpoint Moderation - Content guardrail scheme
- Project
Name string - Project name the resource belongs to; default value is "default"
- Rate
Limit Volcengine.Endpoint Rate Limit - Endpoint rate limiting
-
List<Volcengine.
Endpoint Tag>
- Model
Reference EndpointModel Reference Args - Endpoint access model
- Name string
- Endpoint name
- Description string
- Endpoint description
- Model
Unit stringId - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- Moderation
Endpoint
Moderation Args - Content guardrail scheme
- Project
Name string - Project name the resource belongs to; default value is "default"
- Rate
Limit EndpointRate Limit Args - Endpoint rate limiting
-
[]Endpoint
Tag Args
- model_
reference object - Endpoint access model
- name string
- Endpoint name
- description string
- Endpoint description
- model_
unit_ stringid - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- moderation object
- Content guardrail scheme
- project_
name string - Project name the resource belongs to; default value is "default"
- rate_
limit object - Endpoint rate limiting
- list(object)
- model
Reference EndpointModel Reference - Endpoint access model
- name String
- Endpoint name
- description String
- Endpoint description
- model
Unit StringId - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- moderation
Endpoint
Moderation - Content guardrail scheme
- project
Name String - Project name the resource belongs to; default value is "default"
- rate
Limit EndpointRate Limit - Endpoint rate limiting
-
List<Endpoint
Tag>
- model
Reference EndpointModel Reference - Endpoint access model
- name string
- Endpoint name
- description string
- Endpoint description
- model
Unit stringId - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- moderation
Endpoint
Moderation - Content guardrail scheme
- project
Name string - Project name the resource belongs to; default value is "default"
- rate
Limit EndpointRate Limit - Endpoint rate limiting
-
Endpoint
Tag[]
- model_
reference EndpointModel Reference Args - Endpoint access model
- name str
- Endpoint name
- description str
- Endpoint description
- model_
unit_ strid - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- moderation
Endpoint
Moderation Args - Content guardrail scheme
- project_
name str - Project name the resource belongs to; default value is "default"
- rate_
limit EndpointRate Limit Args - Endpoint rate limiting
-
Sequence[Endpoint
Tag Args]
- model
Reference Property Map - Endpoint access model
- name String
- Endpoint name
- description String
- Endpoint description
- model
Unit StringId - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- moderation Property Map
- Content guardrail scheme
- project
Name String - Project name the resource belongs to; default value is "default"
- rate
Limit Property Map - Endpoint rate limiting
- List<Property Map>
Outputs
All input properties are implicitly available as output properties. Additionally, the Endpoint resource produces the following output properties:
- Batch
Only bool - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- Certificate
Volcengine.
Endpoint Certificate - Certificate information
- Create
Time string - Endpoint creation time, RFC3339 format
- Endpoint
Id string - Inference access point ID.
- Endpoint
Model stringType - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- Id string
- The provider-assigned unique ID for this managed resource.
- Rolling
Id string - Endpoint switch ID
- Scale
Tier stringId - If a TPM protection package exists, this is the TPM protection package ID.
- Status string
- Endpoint status
- Status
Reason string - Additional status information
- Support
Rolling bool - Whether endpoint supports switching model versions
- Support
Scale boolTier - Whether TPM protection package is supported
- Update
Time string - Endpoint update time, RFC3339 format
- Batch
Only bool - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- Certificate
Endpoint
Certificate - Certificate information
- Create
Time string - Endpoint creation time, RFC3339 format
- Endpoint
Id string - Inference access point ID.
- Endpoint
Model stringType - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- Id string
- The provider-assigned unique ID for this managed resource.
- Rolling
Id string - Endpoint switch ID
- Scale
Tier stringId - If a TPM protection package exists, this is the TPM protection package ID.
- Status string
- Endpoint status
- Status
Reason string - Additional status information
- Support
Rolling bool - Whether endpoint supports switching model versions
- Support
Scale boolTier - Whether TPM protection package is supported
- Update
Time string - Endpoint update time, RFC3339 format
- batch_
only bool - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- certificate object
- Certificate information
- create_
time string - Endpoint creation time, RFC3339 format
- endpoint_
id string - Inference access point ID.
- endpoint_
model_ stringtype - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- id string
- The provider-assigned unique ID for this managed resource.
- rolling_
id string - Endpoint switch ID
- scale_
tier_ stringid - If a TPM protection package exists, this is the TPM protection package ID.
- status string
- Endpoint status
- status_
reason string - Additional status information
- support_
rolling bool - Whether endpoint supports switching model versions
- support_
scale_ booltier - Whether TPM protection package is supported
- update_
time string - Endpoint update time, RFC3339 format
- batch
Only Boolean - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- certificate
Endpoint
Certificate - Certificate information
- create
Time String - Endpoint creation time, RFC3339 format
- endpoint
Id String - Inference access point ID.
- endpoint
Model StringType - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- id String
- The provider-assigned unique ID for this managed resource.
- rolling
Id String - Endpoint switch ID
- scale
Tier StringId - If a TPM protection package exists, this is the TPM protection package ID.
- status String
- Endpoint status
- status
Reason String - Additional status information
- support
Rolling Boolean - Whether endpoint supports switching model versions
- support
Scale BooleanTier - Whether TPM protection package is supported
- update
Time String - Endpoint update time, RFC3339 format
- batch
Only boolean - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- certificate
Endpoint
Certificate - Certificate information
- create
Time string - Endpoint creation time, RFC3339 format
- endpoint
Id string - Inference access point ID.
- endpoint
Model stringType - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- id string
- The provider-assigned unique ID for this managed resource.
- rolling
Id string - Endpoint switch ID
- scale
Tier stringId - If a TPM protection package exists, this is the TPM protection package ID.
- status string
- Endpoint status
- status
Reason string - Additional status information
- support
Rolling boolean - Whether endpoint supports switching model versions
- support
Scale booleanTier - Whether TPM protection package is supported
- update
Time string - Endpoint update time, RFC3339 format
- batch_
only bool - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- certificate
Endpoint
Certificate - Certificate information
- create_
time str - Endpoint creation time, RFC3339 format
- endpoint_
id str - Inference access point ID.
- endpoint_
model_ strtype - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- id str
- The provider-assigned unique ID for this managed resource.
- rolling_
id str - Endpoint switch ID
- scale_
tier_ strid - If a TPM protection package exists, this is the TPM protection package ID.
- status str
- Endpoint status
- status_
reason str - Additional status information
- support_
rolling bool - Whether endpoint supports switching model versions
- support_
scale_ booltier - Whether TPM protection package is supported
- update_
time str - Endpoint update time, RFC3339 format
- batch
Only Boolean - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- certificate Property Map
- Certificate information
- create
Time String - Endpoint creation time, RFC3339 format
- endpoint
Id String - Inference access point ID.
- endpoint
Model StringType - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- id String
- The provider-assigned unique ID for this managed resource.
- rolling
Id String - Endpoint switch ID
- scale
Tier StringId - If a TPM protection package exists, this is the TPM protection package ID.
- status String
- Endpoint status
- status
Reason String - Additional status information
- support
Rolling Boolean - Whether endpoint supports switching model versions
- support
Scale BooleanTier - Whether TPM protection package is supported
- update
Time String - Endpoint update time, RFC3339 format
Look up Existing Endpoint Resource
Get an existing Endpoint resource’s state with the given name, ID, and optional extra properties used to qualify the lookup.
public static get(name: string, id: Input<ID>, state?: EndpointState, opts?: CustomResourceOptions): Endpoint@staticmethod
def get(resource_name: str,
id: str,
opts: Optional[ResourceOptions] = None,
batch_only: Optional[bool] = None,
certificate: Optional[EndpointCertificateArgs] = None,
create_time: Optional[str] = None,
description: Optional[str] = None,
endpoint_id: Optional[str] = None,
endpoint_model_type: Optional[str] = None,
model_reference: Optional[EndpointModelReferenceArgs] = None,
model_unit_id: Optional[str] = None,
moderation: Optional[EndpointModerationArgs] = None,
name: Optional[str] = None,
project_name: Optional[str] = None,
rate_limit: Optional[EndpointRateLimitArgs] = None,
rolling_id: Optional[str] = None,
scale_tier_id: Optional[str] = None,
status: Optional[str] = None,
status_reason: Optional[str] = None,
support_rolling: Optional[bool] = None,
support_scale_tier: Optional[bool] = None,
tags: Optional[Sequence[EndpointTagArgs]] = None,
update_time: Optional[str] = None) -> Endpointfunc GetEndpoint(ctx *Context, name string, id IDInput, state *EndpointState, opts ...ResourceOption) (*Endpoint, error)public static Endpoint Get(string name, Input<string> id, EndpointState? state, CustomResourceOptions? opts = null)public static Endpoint get(String name, Output<String> id, EndpointState state, CustomResourceOptions options)resources: _: type: volcenginecc:ark:Endpoint get: id: ${id}import {
to = volcenginecc_ark_endpoint.example
id = "${id}"
}
- name
- The unique name of the resulting resource.
- id
- The unique provider ID of the resource to lookup.
- state
- Any extra arguments used during the lookup.
- opts
- A bag of options that control this resource's behavior.
- resource_name
- The unique name of the resulting resource.
- id
- The unique provider ID of the resource to lookup.
- name
- The unique name of the resulting resource.
- id
- The unique provider ID of the resource to lookup.
- state
- Any extra arguments used during the lookup.
- opts
- A bag of options that control this resource's behavior.
- name
- The unique name of the resulting resource.
- id
- The unique provider ID of the resource to lookup.
- state
- Any extra arguments used during the lookup.
- opts
- A bag of options that control this resource's behavior.
- name
- The unique name of the resulting resource.
- id
- The unique provider ID of the resource to lookup.
- state
- Any extra arguments used during the lookup.
- opts
- A bag of options that control this resource's behavior.
- Batch
Only bool - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- Certificate
Volcengine.
Endpoint Certificate - Certificate information
- Create
Time string - Endpoint creation time, RFC3339 format
- Description string
- Endpoint description
- Endpoint
Id string - Inference access point ID.
- Endpoint
Model stringType - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- Model
Reference Volcengine.Endpoint Model Reference - Endpoint access model
- Model
Unit stringId - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- Moderation
Volcengine.
Endpoint Moderation - Content guardrail scheme
- Name string
- Endpoint name
- Project
Name string - Project name the resource belongs to; default value is "default"
- Rate
Limit Volcengine.Endpoint Rate Limit - Endpoint rate limiting
- Rolling
Id string - Endpoint switch ID
- Scale
Tier stringId - If a TPM protection package exists, this is the TPM protection package ID.
- Status string
- Endpoint status
- Status
Reason string - Additional status information
- Support
Rolling bool - Whether endpoint supports switching model versions
- Support
Scale boolTier - Whether TPM protection package is supported
-
List<Volcengine.
Endpoint Tag> - Update
Time string - Endpoint update time, RFC3339 format
- Batch
Only bool - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- Certificate
Endpoint
Certificate Args - Certificate information
- Create
Time string - Endpoint creation time, RFC3339 format
- Description string
- Endpoint description
- Endpoint
Id string - Inference access point ID.
- Endpoint
Model stringType - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- Model
Reference EndpointModel Reference Args - Endpoint access model
- Model
Unit stringId - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- Moderation
Endpoint
Moderation Args - Content guardrail scheme
- Name string
- Endpoint name
- Project
Name string - Project name the resource belongs to; default value is "default"
- Rate
Limit EndpointRate Limit Args - Endpoint rate limiting
- Rolling
Id string - Endpoint switch ID
- Scale
Tier stringId - If a TPM protection package exists, this is the TPM protection package ID.
- Status string
- Endpoint status
- Status
Reason string - Additional status information
- Support
Rolling bool - Whether endpoint supports switching model versions
- Support
Scale boolTier - Whether TPM protection package is supported
-
[]Endpoint
Tag Args - Update
Time string - Endpoint update time, RFC3339 format
- batch_
only bool - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- certificate object
- Certificate information
- create_
time string - Endpoint creation time, RFC3339 format
- description string
- Endpoint description
- endpoint_
id string - Inference access point ID.
- endpoint_
model_ stringtype - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- model_
reference object - Endpoint access model
- model_
unit_ stringid - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- moderation object
- Content guardrail scheme
- name string
- Endpoint name
- project_
name string - Project name the resource belongs to; default value is "default"
- rate_
limit object - Endpoint rate limiting
- rolling_
id string - Endpoint switch ID
- scale_
tier_ stringid - If a TPM protection package exists, this is the TPM protection package ID.
- status string
- Endpoint status
- status_
reason string - Additional status information
- support_
rolling bool - Whether endpoint supports switching model versions
- support_
scale_ booltier - Whether TPM protection package is supported
- list(object)
- update_
time string - Endpoint update time, RFC3339 format
- batch
Only Boolean - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- certificate
Endpoint
Certificate - Certificate information
- create
Time String - Endpoint creation time, RFC3339 format
- description String
- Endpoint description
- endpoint
Id String - Inference access point ID.
- endpoint
Model StringType - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- model
Reference EndpointModel Reference - Endpoint access model
- model
Unit StringId - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- moderation
Endpoint
Moderation - Content guardrail scheme
- name String
- Endpoint name
- project
Name String - Project name the resource belongs to; default value is "default"
- rate
Limit EndpointRate Limit - Endpoint rate limiting
- rolling
Id String - Endpoint switch ID
- scale
Tier StringId - If a TPM protection package exists, this is the TPM protection package ID.
- status String
- Endpoint status
- status
Reason String - Additional status information
- support
Rolling Boolean - Whether endpoint supports switching model versions
- support
Scale BooleanTier - Whether TPM protection package is supported
-
List<Endpoint
Tag> - update
Time String - Endpoint update time, RFC3339 format
- batch
Only boolean - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- certificate
Endpoint
Certificate - Certificate information
- create
Time string - Endpoint creation time, RFC3339 format
- description string
- Endpoint description
- endpoint
Id string - Inference access point ID.
- endpoint
Model stringType - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- model
Reference EndpointModel Reference - Endpoint access model
- model
Unit stringId - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- moderation
Endpoint
Moderation - Content guardrail scheme
- name string
- Endpoint name
- project
Name string - Project name the resource belongs to; default value is "default"
- rate
Limit EndpointRate Limit - Endpoint rate limiting
- rolling
Id string - Endpoint switch ID
- scale
Tier stringId - If a TPM protection package exists, this is the TPM protection package ID.
- status string
- Endpoint status
- status
Reason string - Additional status information
- support
Rolling boolean - Whether endpoint supports switching model versions
- support
Scale booleanTier - Whether TPM protection package is supported
-
Endpoint
Tag[] - update
Time string - Endpoint update time, RFC3339 format
- batch_
only bool - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- certificate
Endpoint
Certificate Args - Certificate information
- create_
time str - Endpoint creation time, RFC3339 format
- description str
- Endpoint description
- endpoint_
id str - Inference access point ID.
- endpoint_
model_ strtype - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- model_
reference EndpointModel Reference Args - Endpoint access model
- model_
unit_ strid - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- moderation
Endpoint
Moderation Args - Content guardrail scheme
- name str
- Endpoint name
- project_
name str - Project name the resource belongs to; default value is "default"
- rate_
limit EndpointRate Limit Args - Endpoint rate limiting
- rolling_
id str - Endpoint switch ID
- scale_
tier_ strid - If a TPM protection package exists, this is the TPM protection package ID.
- status str
- Endpoint status
- status_
reason str - Additional status information
- support_
rolling bool - Whether endpoint supports switching model versions
- support_
scale_ booltier - Whether TPM protection package is supported
-
Sequence[Endpoint
Tag Args] - update_
time str - Endpoint update time, RFC3339 format
- batch
Only Boolean - Whether only BatchChat is supported. If true, only batch inference interfaces can be accessed; otherwise, only online inference interfaces can be accessed
- certificate Property Map
- Certificate information
- create
Time String - Endpoint creation time, RFC3339 format
- description String
- Endpoint description
- endpoint
Id String - Inference access point ID.
- endpoint
Model StringType - Access point model type. Options: FoundationModel for base model; CustomModel for custom model.
- model
Reference Property Map - Endpoint access model
- model
Unit StringId - Model unit. If empty, indicates postpaid; otherwise, prepaid.
- moderation Property Map
- Content guardrail scheme
- name String
- Endpoint name
- project
Name String - Project name the resource belongs to; default value is "default"
- rate
Limit Property Map - Endpoint rate limiting
- rolling
Id String - Endpoint switch ID
- scale
Tier StringId - If a TPM protection package exists, this is the TPM protection package ID.
- status String
- Endpoint status
- status
Reason String - Additional status information
- support
Rolling Boolean - Whether endpoint supports switching model versions
- support
Scale BooleanTier - Whether TPM protection package is supported
- List<Property Map>
- update
Time String - Endpoint update time, RFC3339 format
Supporting Types
EndpointCertificate, EndpointCertificateArgs
- Not
After int - X509 private certificate expiration time, represented as a timestamp in seconds
- Not
Before int - X509 private certificate activation time, represented as a timestamp in seconds
- Pca
Host string - Certificate subject
- Pca
Instance stringCertificate - PEM format leaf certificate
- Pca
Name string - Certificate name
- Pca
Root stringCa Certificate - PEM format root certificate
- Pca
Sub stringCa Certificate - Intermediate certificate and root certificate in PEM format
- Not
After int - X509 private certificate expiration time, represented as a timestamp in seconds
- Not
Before int - X509 private certificate activation time, represented as a timestamp in seconds
- Pca
Host string - Certificate subject
- Pca
Instance stringCertificate - PEM format leaf certificate
- Pca
Name string - Certificate name
- Pca
Root stringCa Certificate - PEM format root certificate
- Pca
Sub stringCa Certificate - Intermediate certificate and root certificate in PEM format
- not_
after number - X509 private certificate expiration time, represented as a timestamp in seconds
- not_
before number - X509 private certificate activation time, represented as a timestamp in seconds
- pca_
host string - Certificate subject
- pca_
instance_ stringcertificate - PEM format leaf certificate
- pca_
name string - Certificate name
- pca_
root_ stringca_ certificate - PEM format root certificate
- pca_
sub_ stringca_ certificate - Intermediate certificate and root certificate in PEM format
- not
After Integer - X509 private certificate expiration time, represented as a timestamp in seconds
- not
Before Integer - X509 private certificate activation time, represented as a timestamp in seconds
- pca
Host String - Certificate subject
- pca
Instance StringCertificate - PEM format leaf certificate
- pca
Name String - Certificate name
- pca
Root StringCa Certificate - PEM format root certificate
- pca
Sub StringCa Certificate - Intermediate certificate and root certificate in PEM format
- not
After number - X509 private certificate expiration time, represented as a timestamp in seconds
- not
Before number - X509 private certificate activation time, represented as a timestamp in seconds
- pca
Host string - Certificate subject
- pca
Instance stringCertificate - PEM format leaf certificate
- pca
Name string - Certificate name
- pca
Root stringCa Certificate - PEM format root certificate
- pca
Sub stringCa Certificate - Intermediate certificate and root certificate in PEM format
- not_
after int - X509 private certificate expiration time, represented as a timestamp in seconds
- not_
before int - X509 private certificate activation time, represented as a timestamp in seconds
- pca_
host str - Certificate subject
- pca_
instance_ strcertificate - PEM format leaf certificate
- pca_
name str - Certificate name
- pca_
root_ strca_ certificate - PEM format root certificate
- pca_
sub_ strca_ certificate - Intermediate certificate and root certificate in PEM format
- not
After Number - X509 private certificate expiration time, represented as a timestamp in seconds
- not
Before Number - X509 private certificate activation time, represented as a timestamp in seconds
- pca
Host String - Certificate subject
- pca
Instance StringCertificate - PEM format leaf certificate
- pca
Name String - Certificate name
- pca
Root StringCa Certificate - PEM format root certificate
- pca
Sub StringCa Certificate - Intermediate certificate and root certificate in PEM format
EndpointModelReference, EndpointModelReferenceArgs
- Custom
Model stringId - Custom model ID
- Foundation
Model Volcengine.Endpoint Model Reference Foundation Model - Base model
- Custom
Model stringId - Custom model ID
- Foundation
Model EndpointModel Reference Foundation Model - Base model
- custom_
model_ stringid - Custom model ID
- foundation_
model object - Base model
- custom
Model StringId - Custom model ID
- foundation
Model EndpointModel Reference Foundation Model - Base model
- custom
Model stringId - Custom model ID
- foundation
Model EndpointModel Reference Foundation Model - Base model
- custom_
model_ strid - Custom model ID
- foundation_
model EndpointModel Reference Foundation Model - Base model
- custom
Model StringId - Custom model ID
- foundation
Model Property Map - Base model
EndpointModelReferenceFoundationModel, EndpointModelReferenceFoundationModelArgs
- Model
Version string - Base model version
- Name string
- Base model name. Note: Model names are lowercase. The model name format for doubao 1.5 generation models is "doubao-1-5-**".
- Model
Version string - Base model version
- Name string
- Base model name. Note: Model names are lowercase. The model name format for doubao 1.5 generation models is "doubao-1-5-**".
- model_
version string - Base model version
- name string
- Base model name. Note: Model names are lowercase. The model name format for doubao 1.5 generation models is "doubao-1-5-**".
- model
Version String - Base model version
- name String
- Base model name. Note: Model names are lowercase. The model name format for doubao 1.5 generation models is "doubao-1-5-**".
- model
Version string - Base model version
- name string
- Base model name. Note: Model names are lowercase. The model name format for doubao 1.5 generation models is "doubao-1-5-**".
- model_
version str - Base model version
- name str
- Base model name. Note: Model names are lowercase. The model name format for doubao 1.5 generation models is "doubao-1-5-**".
- model
Version String - Base model version
- name String
- Base model name. Note: Model names are lowercase. The model name format for doubao 1.5 generation models is "doubao-1-5-**".
EndpointModeration, EndpointModerationArgs
- Strategy string
- Content safety guardrail scheme. Values: Default: default scheme. Lower latency, broadly covers general content safety requirements; Basic: basic scheme (public beta). Supports providing triggered risk categories in interface responses, currently only includes two risk types: severe violations and aggressive behavior. Note: If using streaming interface calls, this scheme is not recommended, as it will increase the length of each output fragment and raise latency
- Strategy string
- Content safety guardrail scheme. Values: Default: default scheme. Lower latency, broadly covers general content safety requirements; Basic: basic scheme (public beta). Supports providing triggered risk categories in interface responses, currently only includes two risk types: severe violations and aggressive behavior. Note: If using streaming interface calls, this scheme is not recommended, as it will increase the length of each output fragment and raise latency
- strategy string
- Content safety guardrail scheme. Values: Default: default scheme. Lower latency, broadly covers general content safety requirements; Basic: basic scheme (public beta). Supports providing triggered risk categories in interface responses, currently only includes two risk types: severe violations and aggressive behavior. Note: If using streaming interface calls, this scheme is not recommended, as it will increase the length of each output fragment and raise latency
- strategy String
- Content safety guardrail scheme. Values: Default: default scheme. Lower latency, broadly covers general content safety requirements; Basic: basic scheme (public beta). Supports providing triggered risk categories in interface responses, currently only includes two risk types: severe violations and aggressive behavior. Note: If using streaming interface calls, this scheme is not recommended, as it will increase the length of each output fragment and raise latency
- strategy string
- Content safety guardrail scheme. Values: Default: default scheme. Lower latency, broadly covers general content safety requirements; Basic: basic scheme (public beta). Supports providing triggered risk categories in interface responses, currently only includes two risk types: severe violations and aggressive behavior. Note: If using streaming interface calls, this scheme is not recommended, as it will increase the length of each output fragment and raise latency
- strategy str
- Content safety guardrail scheme. Values: Default: default scheme. Lower latency, broadly covers general content safety requirements; Basic: basic scheme (public beta). Supports providing triggered risk categories in interface responses, currently only includes two risk types: severe violations and aggressive behavior. Note: If using streaming interface calls, this scheme is not recommended, as it will increase the length of each output fragment and raise latency
- strategy String
- Content safety guardrail scheme. Values: Default: default scheme. Lower latency, broadly covers general content safety requirements; Basic: basic scheme (public beta). Supports providing triggered risk categories in interface responses, currently only includes two risk types: severe violations and aggressive behavior. Note: If using streaming interface calls, this scheme is not recommended, as it will increase the length of each output fragment and raise latency
EndpointRateLimit, EndpointRateLimitArgs
EndpointTag, EndpointTagArgs
Import
$ pulumi import volcenginecc:ark/endpoint:Endpoint example "endpoint_id"
To learn more about importing existing cloud resources, see Importing resources.
Package Details
- Repository
- volcenginecc volcengine/pulumi-volcenginecc
- License
- MPL-2.0
- Notes
- This Pulumi package is based on the
volcengineccTerraform Provider.
published on Thursday, May 14, 2026 by Volcengine
