When you create a model deployment resource, you can provide custom environment variables as part of the default configuration. These environment variables are available for use inside the container at runtime.
Format constraint
Key must be under 32 characters.
Key must contain only letters, digits, and underscore (_).
Key must start with a letter.
Key must have at least 2 characters.
Key must not end with underscore. For example, TEST_.
Key, if added, can't be empty. Value can be empty.
Value could be any character.
Key can't be service-provided environment variables.
If ports are passed as an environment variable, then limit them between 1024 and 65535 and also exclude ports 24224, 8446, 8447.
Size constraint
Individual values don't have specific size limits. The overall environment variables
including key value pairs is limited to 2048 bytes. If the size is more than 2048
bytes, specify the environment variables by using the model artifact or use OCI Object Storage to pull the data.
Service Provided Environment Variables 🔗
The service has reserved environment variables inside the container at runtime for logging or other functional use. These environment variables can't be overridden.
Name
Custom Container
Default Container
MODEL_DEPLOYMENT_OCID
Model deployment resource OCID.
Model deployment resource OCID.
MODEL_SERVING_PYTHON_VERSION
None.
The Python version from runtime.yaml used for model serving.
TENANCY_OCID
Tenancy OCID.
Tenancy OCID.
CONTAINER_CUSTOM_IMAGE
Container image path (with tag or digest) that's provided through the API environment configuration object.
None.
Application Environment Variables 🔗
These environment variables functionally control how the model deployment is created and can be overridden or configured by the customers by passing in the default configuration.
Name
Custom Container
Managed Container
WEB_CONCURRENCY
Based on available memory and size of model artifact, a heuristic method calculates the number of workers needed to run inside the custom container. Customers can also override the variable or pass a custom concurrency variable as required.
Based on available memory and size of model artifact, a heuristic method calculates the number of workers needed to run inside the custom container. Customers can also override the variable or pass a custom concurrency variable as required.
CONTAINER_TYPE
Type of container viz TRITON. Only TRITON is supported.