@cdklabs/generative-ai-cdk-constructs • Docs
@cdklabs/generative-ai-cdk-constructs / bedrock / CrossRegionInferenceProfile
Cross-region inference enables you to seamlessly manage unplanned traffic bursts by utilizing compute across different AWS Regions. With cross-region inference, you can distribute traffic across multiple AWS Regions, enabling higher throughput and enhanced resilience during periods of peak demands.
https://docs.aws.amazon.com/bedrock/latest/userguide/cross-region-inference.html
readonly
inferenceProfileArn:string
'arn:aws:bedrock:us-east-1:123456789012:inference-profile/us.anthropic.claude-3-5-sonnet-20240620-v1:0'
IInferenceProfile
.inferenceProfileArn
readonly
inferenceProfileId:string
'us.anthropic.claude-3-5-sonnet-20240620-v1:0'
IInferenceProfile
.inferenceProfileId
readonly
inferenceProfileModel:BedrockFoundationModel
The underlying model supporting cross-region inference.
readonly
invokableArn:string
This equals to the inferenceProfileArn property, useful just to implement IInvokable interface
readonly
type:InferenceProfileType
InferenceProfileType.SYSTEM_DEFINED
grantInvoke(
grantee
):Grant
Gives the appropriate policies to invoke and use the Foundation Model.
• grantee: IGrantable
Grant
grantProfileUsage(
grantee
):Grant
Grants appropriate permissions to use the cross-region inference profile. Does not grant permissions to use the model in the profile.
• grantee: IGrantable
Grant
IInferenceProfile
.grantProfileUsage
static
fromConfig(config
):CrossRegionInferenceProfile
• config: CrossRegionInferenceProfileProps