If you have models you want to use instead of the service cached models provided by Data Science, you can bring them into AI Quick Actions from Object Storage or from Hugging Face by registering the
model.
Hugging Face
is an open source model repository. You can bring in models from here to use in AI Quick
Actions. Hugging Face offers certain gated models that require the acceptance of user
agreement. To bring a gated model from Hugging Face into AI Quick Actions, sign in to Hugging
Face using the Hugging Face CLI and your Hugging Face token from a terminal inside the
Notebook. This is to verify your access to the model. See the Hugging Face guides to see how to sign in with the Hugging Face CLI.
If you don't have a Hugging Face token, see this Hugging Face article on security tokens to generate one. If you try to register a gated model which you
haven't been granted access to in Hugging Face or fail to sign in with the Hugging Face CLI,
the registration process fails.
A service verified model is one the Data Science service has
tested the configurations for deployment and fine tuning.
Note
The difference between a
service cached model and a verified model is that, for a verified model, you must register
the model in AI Quick Actions before using it.
Service Managed Inference Containers
Four inference containers are available to use with Bring Your Own Model.
For cached and verified models, Data Science has tested
which inference container works best with each model and so the inference container can't be
chosen. For unverified models, you must decide which inference container is most suitable for
each model. Four service managed inference containers are available:
for models compatible with inference engine vLLM 0.7.1
for models compatible with TGI 2.0.1
for models compatible with inference framework llama.cpp 0.3.2 (for models in GGUF format)
Before you can bring a model from Object Storage and register it in AI quick Actions, you
need to download the model artifacts, such as scoring logic, ML model, and dependent
files, from the model source and upload them to a versioned OCI
Object Storage bucket. If you don't have a
versioned Object Storage bucket, follow these steps to create one. Then you can
upload the model artifacts to Object Storage.
Register Service Verified Models 🔗
Data Science has models you can select to use that
have been tested.
Follow the steps in Prerequisites.Steps 3 and 4 show different ways of registering a model. Model registration is a
necessary process for a model to be brought into AI Quick Actions.
Select Models if it's not already shown.
Opt to import a new model from Object Storage or Hugging Face to register, or
to use a service verified model:
To import a new model, select My models.
To use a service verified model, select a model card with the
Ready-to-register tag.
If in step 2 you selected My models:
Select Import new model.
Under Model artifact, select one of:
Download from Hugging Face to download a model
from Hugging Face.
I have artifacts in Object storage to download
a model from Object Storage.
Select Register service verified model to select
a model that's been tested by Oracle Data Science for deployment and
fine-tuning.
From the Select model list, select the model
name.
From the Select compartment list, select the
compartment of the Object Storage bucket.
From the Object storage location list, select a
bucket.
If downloading from Object Storage, it's the bucket where the
model artifact is stored.
If downloading from Hugging Face, it's the bucket where the
model artifact is downloaded.
Provide a directory path for the object storage.
Select Register to register the model. When the
model registration finishes, the Model
Information screen is displayed. The model is included
in the list of models under My models.
Select Fine-tune to fine tune the model.
Select Deploy to deploy the model.
If in step 2 you selected a model card with the
Ready-to-register tag:
Select register in the upper right corner of model information
page.
Under Model artifact, select one of:
Download from Hugging Face to download a model
from Hugging Face.
I have artifacts in Object storage to download
a model from Object Storage.
From the Select compartment list, select the
compartment of the Object Storage bucket..
From the Object storage location list, select a
bucket.
If downloading from Object Storage, it's the bucket where the
model artifact is stored.
If downloading from Hugging Face, it's the bucket where the
model artifact is downloaded.
Provide a directory path for the object storage.
Select Register to register the model. When the
model registration finishes, the Model
Information screen is displayed. The model is included
in the list of models under My models.
Select Fine-tune to fine tune the model.
(Optional) Under Advanced options, enter the file
or file format to exclude in the Exclusions
list.
Select Deploy to deploy the model.
For a complete list of parameters and values for AI Quick Actions CLI commands, see
AI Quick Actions CLI.
This task can't be performed using the API.
Register Any Model 🔗
Follow these steps to use models that haven't been tested by Data Science.