Creating a Model Deployment with Autoscaling

Learn how to create a model deployment with compute and load balancer autoscaling configured.

Consider using a custom scaling metric type for setting up autoscaling with more advanced options and metrics on the model deployment.

Ensure that you have added the necessary policy required for autoscaling to work.

You can create and run model deployments using the Console, the OCI CLI, or the Data Science API.

Was this article helpful?