Managing Dedicated AI Clusters

Dedicated AI clusters are compute resources that you can use to fine-tune custom models or to host endpoints for the pretrained base models and custom models in OCI Generative AI. The clusters are dedicated to your models and not shared with users in other tenancies.

If you have manage permissions for generative-ai-family, you can perform the following tasks for dedicated AI clusters:

Tip

When you perform the preceding tasks, for the dedicated AI cluster unit size that matches each base model, see Matching Base Models to Clusters. For rules about creating endpoints for the models hosted on clusters, see Adding Endpoints to Hosting Clusters.

Was this article helpful?