LSFEnvironment — PyTorch Lightning 2.5.1.post0 documentation (original) (raw)
class lightning.pytorch.plugins.environments.LSFEnvironment[source]¶
Bases: ClusterEnvironment
An environment for running on clusters managed by the LSF resource manager.
It is expected that any execution using this ClusterEnvironment was executed using the Job Step Manager i.e. jsrun
.
This plugin expects the following environment variables:
LSB_JOBID
The LSF assigned job ID
LSB_DJOB_RANKFILE
The OpenMPI compatible rank file for the LSF job
JSM_NAMESPACE_LOCAL_RANK
The node local rank for the task. This environment variable is set by jsrun
JSM_NAMESPACE_SIZE
The world size for the task. This environment variable is set by jsrun
JSM_NAMESPACE_RANK
The global rank for the task. This environment variable is set by jsrun
Returns True
if the current process was launched using the jsrun
command.
Return type:
The world size is read from the environment variable JSM_NAMESPACE_RANK
.
Return type:
The local rank is read from the environment variable JSM_NAMESPACE_LOCAL_RANK.
Return type:
The node rank is determined by the position of the current hostname in the OpenMPI host rank file stored inLSB_DJOB_RANKFILE
.
Return type:
The world size is read from the environment variable JSM_NAMESPACE_SIZE
.
Return type:
property creates_processes_externally_: bool_¶
LSF creates subprocesses, i.e., PyTorch Lightning does not need to spawn them.
The main address is read from an OpenMPI host rank file in the environment variableLSB_DJOB_RANKFILE
.
The main port is calculated from the LSF job ID.