Retiring the Models

OCI Generative AI retires its large language models (LLMs) based on each model's type and serving mode. The LLMs serve user requests in either an on-demand mode or a dedicated mode. Review the following sections to learn about deprecation and removal times and to decide what serving mode works best for you.

Retirement for On-Demand Mode

When a model is retired in the on-demand mode, it's no longer available for use in the Generative AI service playground or through the Generative AI inference API.

Retirement for Dedicated Mode

When a model is retired in the dedicated mode, you can no longer create a dedicated AI cluster for the retired model, but an active dedicated AI cluster running a retired model will continue to run. A custom model, that's running off a retired model will also continue to be available for active dedicated AI clusters and you can continue to create new dedicated AI clusters with a custom model that was created on a retired model. However, Oracle offers limited support for these scenarios, and Oracle engineering might ask you to upgrade to a supported model to resolve issues related to your model.

To request for a model to stay alive longer than the retirement date in a dedicated mode, create a support ticket.

Deprecation: When a model is deprecated it remains available in the Generative AI service, but will have a defined amount of time that it can be used before it's retired. This amount of time is longer for the dedicated mode.

Important

All models that were supported for the text generation and summarization APIs (including the playground) are now retired.

About On-Demand And Dedicated Modes

Model Retirement Dates (On-Demand Mode)

The following table shows the retirement dates for models supported for the on-demand serving mode.


Model	Release Date	Retirement Date	Suggested Replacement Options
`meta.llama-3.3-70b-instruct`	2025-02-07	At least one month after the release of the 1^st replacement model.	Tentative
`cohere.command-r-08-2024`	2024-11-14	At least one month after the release of the 1^st replacement model.	Tentative
`cohere.command-r-plus-08-2024`	2024-11-14	At least one month after the release of the 1^st replacement model.	Tentative
`meta.llama-3.2-90b-vision-instruct`	2024-11-14	At least one month after the release of the 1^st replacement model.	Tentative
`meta.llama-3.1-405b-instruct`	2024-09-19	At least one month after the release of the 1^st replacement model.	Tentative
`meta.llama-3.1-70b-instruct`	2024-09-19	`2025-03-28`	`meta.llama-3.3-70b-instruct` `meta.llama-3.1-405b-instruct`3
`cohere.command-r-plus`	2024-06-18	`2025-01-16`	`cohere.command-r-plus-08-2024`
`cohere.command-r-16k`	2024-06-04	`2025-01-16`	`cohere.command-r-08-2024`
`cohere.embed-english-v3.0`	2024-02-07	At least 6 months after the release of the 1^st replacement model.	Tentative
`cohere.embed-multilingual-v3.0`	2024-02-07	At least 6 months after the release of the 1^st replacement model.	Tentative
`cohere.embed-english-light-v3.0`	2024-02-07	At least 6 months after the release of the 1^st replacement model.	Tentative
`cohere.embed-multilingual-light-v3.0`	2024-02-07	At least 6 months after the release of the 1^st replacement model.	Tentative
`meta.llama-3-70b-instruct`	2024-06-04	`2024-11-12`	`meta.llama-3.1-70b-instruct` `meta.llama-3.1-405b-instruct`
`cohere.command`	2024-02-07	`2024-10-02`	`cohere.command-r-plus` `cohere.command-r-16k`
`cohere.command-light`	2024-02-07	`2024-10-02`	`cohere.command-r-plus` `cohere.command-r-16k`
`meta.llama-2-70b-chat`	2024-01-22	`2024-10-02`	`meta.llama-3.1-70b-instruct` `meta.llama-3.1-405b-instruct`

Note

Deprecation times might change in the future.

Model Retirement Dates (Dedicated Mode)

Important

If you need a dedicated serving mode model to stay alive longer than the retirement date, create a support ticket.

The following table shows the retirement dates for models supported for the dedicated serving mode.


Model	Release Date	Retirement Date	Suggested Replacement Options
`meta.llama-3.3-70b-instruct`	2025-02-07	At least 6 months after the release of the 1^st replacement model.	Tentative
`cohere.command-r-08-2024`	2024-11-14	At least 6 months after the release of the 1^st replacement model.	Tentative
`cohere.command-r-plus-08-2024`	2024-11-14	At least 6 months after the release of the 1^st replacement model.	Tentative
`meta.llama-3.2-11b-vision-instruct`	2024-11-14	At least 6 months after the release of the 1^st replacement model.	Tentative
`meta.llama-3.2-90b-vision-instruct`	2024-11-14	At least 6 months after the release of the 1^st replacement model.	Tentative
`meta.llama-3.1-405b-instruct`	2024-09-19	At least 6 months after the release of the 1^st replacement model.	Tentative
`meta.llama-3.1-70b-instruct`	2024-09-19	No sooner than `2025-08-07`	`meta.llama-3.3-70b-instruct` `meta.llama-3.1-405b-instruct`3
`cohere.command-r-plus`	2024-06-18	`2025-05-14`	`cohere.command-r-plus-08-2024`
`cohere.command-r-16k`	2024-06-04	`2025-05-14`	`cohere.command-r-08-2024`
`cohere.embed-english-v3.0`	2024-02-07	At least 6 months after the release of the 1^st replacement model.	Tentative
`cohere.embed-multilingual-v3.0`	2024-02-07	At least 6 months after the release of the 1^st replacement model.	Tentative
`cohere.embed-english-light-v3.0`	2024-02-07	At least 6 months after the release of the 1^st replacement model.	Tentative
`cohere.embed-multilingual-light-v3.0`	2024-02-07	At least 6 months after the release of the 1^st replacement model.	Tentative
`meta.llama-3-70b-instruct`	2024-06-04	No sooner than `2025-03-19`	`meta.llama-3.1-70b-instruct` `meta.llama-3.1-405b-instruct`
`cohere.command`	2024-02-07	No sooner than `2025-01-18`	`cohere.command-r-plus` `cohere.command-r-16k`
`cohere.command-light`	2024-02-07	No sooner than `2025-01-04`	`cohere.command-r-plus` `cohere.command-r-16k`
`meta.llama-2-70b-chat`	2024-01-22	`2025-03-07`	`meta.llama-3.1-70b-instruct` `meta.llama-3.1-405b-instruct`

Note

Deprecation times might change in the future.

Security Vulnerabilities and Bug Fixes for Foundational Models

Oracle Cloud Infrastructure Documentation Try Free Tier

Retiring the Models

Oracle Cloud Infrastructure Documentation
Try Free Tier