Cluster Networks with Instance Pools
Cluster networks use instance pools to manage groups of identical high performance computing (HPC), GPU, or optimized instances that are connected with a high-bandwidth, ultra low-latency network. Each node in the cluster is a bare metal machine located in close physical proximity to the other nodes. A remote direct memory access (RDMA) network between nodes provides latency as low as single-digit microseconds, comparable to on-premises HPC clusters.
Cluster networks are built on top of the instance pools feature. Most operations in the instance pool are managed directly by the cluster network, though you can resize the underlying instance pool, change the instance configuration that the pool uses to create new instances, monitor the pool, and add tags.
If you want to manage instances in the RDMA network independently of each other or use different types of instances in the network group, then use compute clusters instead.
For steps to manage cluster networks with instance pools, see the following topics:
- Creating a Cluster Network with Instance Pools
- Detaching Instances from a Cluster Network with Instance Pools
- Resizing a Cluster Network with Instance Pools
- Updating the Instance Configuration for a Cluster Network with Instance Pools
- Tagging Cluster Networks
- Renaming a Cluster Network with Instance Pools
- Deleting a Cluster Network with Instance Pools
For more information about how to access and store the data that you want to process in cluster networks, see FastConnect Overview, Overview of File Storage, Overview of Object Storage, and Overview of Block Volume.
Supported Shapes
The following shapes support cluster networks:
- BM.HPC2.36
- BM.GPU.A100-v2.8
- BM.GPU4.8
- BM.Optimized3.36
Typically, to create multiple HPC, GPU, or optimized instances that are contained in a cluster network, you must request a service limit increase.
Supported Regions and Availability Domains
Cluster networks are supported in selected regions within the Oracle Cloud Infrastructure commercial realm and Government Cloud realms.
- Australia East (Sydney)
- Australia Southeast (Melbourne)
- Brazil East (Sao Paulo)
- Brazil Southeast (Vinhedo)
- Canada Southeast (Montreal)
- Canada Southeast (Toronto)
- France Central (Paris)
- France South (Marseille)
- Germany Central (Frankfurt)
- India South (Hyderabad)
- India West (Mumbai)
- Israel Central (Jerusalem)
- Italy Northwest (Milan)
- Japan Central (Osaka)
- Japan East (Tokyo)
- Netherlands Northwest (Amsterdam)
- Saudi Arabia West (Jeddah)
- Singapore (Singapore)
- South Africa Central (Johannesburg)
- South Korea Central (Seoul)
- South Korea North (Chuncheon)
- Sweden Central (Stockholm)
- Switzerland North (Zurich)
- UAE East (Dubai)
- UK South (London)
- US East (Ashburn)
- US Midwest (Chicago)
- US West (Phoenix)
- US West (San Jose)
- UK Gov South (London)
- UK Gov West (Newport)
- US Gov East (Ashburn)
The availability domain that you create the cluster network in must have hardware that supports cluster networks.
Required IAM Policy
To use Oracle Cloud Infrastructure, you must be granted security access in a policy by an administrator. This access is required whether you're using the Console or the REST API with an SDK, CLI, or other tool. If you get a message that you don't have permission or are unauthorized, verify with your administrator what type of access you have and which compartment to work in.
For administrators: For a typical policy that gives access to cluster networks, see Let users manage Compute instance configurations, instance pools, and cluster networks.
Before You Begin
Create an instance configuration for the instance pool that is managed by the cluster network. Use the following settings:
- Image: Click Change image, and then click Oracle images. Select the Oracle Linux HPC cluster networking image.
-
Shape: Click Change shape. Select Bare metal machine. Then, select a shape that supports cluster networks.
For more information about these shapes, see Compute Shapes.