Ray Clusters Overview — Ray 2.47.1 (original) (raw)

Ray enables seamless scaling of workloads from a laptop to a large cluster. While Ray works out of the box on single machines with just a call to ray.init, to run Ray applications on multiple nodes you must first deploy a Ray cluster.

A Ray cluster is a set of worker nodes connected to a common Ray head node. Ray clusters can be fixed-size, or they may autoscale up and down according to the resources requested by applications running on the cluster.

Where can I deploy Ray clusters?#

Ray provides native cluster deployment support on the following technology stacks:

Advanced users may want to deploy Ray manuallyor onto platforms not listed here.

Note

Multi-node Ray clusters are only supported on Linux. At your own risk, you may deploy Windows and OSX clusters by setting the environment variableRAY_ENABLE_WINDOWS_OR_OSX_CLUSTER=1 during deployment.

What’s next?#

Understand the key concepts and main ways of interacting with a Ray cluster.

Deploy a Ray application to a Kubernetes cluster. You can run the tutorial on a Kubernetes cluster or on your laptop via Kind.

Take a sample application designed to run on a laptop and scale it up in the cloud. Access to an AWS or GCP account is required.

Guide to submitting applications as Jobs to existing Ray clusters.