Slurm Publications - SchedMD (original) (raw)

Discover more of our presentations and articles over the past years.

*Please note that older presentations may contain outdated information.*

Presentation from Cray User Group, May 2025

Pdf icon1

Skyler Malinowski, Alan Mutschelknaus, Marlow Warnicke, Tim Wickberg, SchedMD

Download PDF

Presentation from KubeCon Europe, April 2025

Pdf icon1

Slinky: Slurm in Kubernetes, Performant AI and HPC Workload Management

Tim Wickberg, SchedMD

Download PDF

Presentations from SC24, November 24

Pdf icon1

Skyler Malinowski & Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Slurm Community Birds-of-a-Feather

Tim Wickberg & Danny Auble, SchedMD

Download PDF

Presentations from Slurm User Group Meeting, September 2024

Pdf icon1

ORNL Site Report & Feature Discussion

Matt Ezell and Paul Peltz, Oak Ridge National Laboratory

Download PDF

Pdf icon1

Bringing in Robust, Memory-Driven Affinity to Slurm

Edgar A. León, Lawrence Livermore National Laboratory

Download PDF

Pdf icon1

Step Management Enhancements

Felip Moll, Oriol Vilarrubí, and Brian Christiansen, SchedMD

Download PDF

Pdf icon1

Site Report: Jump Trading

Matthieu Hautreux and Larry Pezzaglia, Jump Trading

Download PDF

Pdf icon1

The Evolution of Slurm at CSCS: From Monolithic Service to Multi-tenant vService

Gennaro Oliva, CSCS

Download PDF

Pdf icon1

Slinky – Slurm Operator

Skyler Malinowski, Alan Mutschelknaus, and Marlow Warnicke, SchedMD

Download PDF

Pdf icon1

No-Touch Administration: Managing Slurm at Scale

Dr. Urban Borštnik, ETH Zürich

Download PDF

Pdf icon1

TrailblazingTurtle: A Comprehensive Web Portal for Maximizing HPC Resource Utilization

Simon Guilbault, Université Laval

Download PDF

Pdf icon1

Field Notes 8: How to Make the Most of Slurm, and Avoid Common Issues

Alejandro Sánchez, SchedMD

Download PDF

Pdf icon1

Enabling Event-Driven Workflows With AWS and the Slurm API

Cory Lueninghoener (Sandia National Laboratory), Lowell Wofford (AWS)

Download PDF

Pdf icon1

Gaining More Control Over Node Scheduling with the Topology/Block Plugin

Vasileios Karakasis, Felix Abecassis, Craig Tierney, and Douglas Wightman, NVIDIA

Download PDF

Pdf icon1

Improving Job Throughput in HPC with Adaptive Time Limit Management

Thomas Jakobsche, University of Basel

Download PDF

Pdf icon1

Slurm on SuperMUC-NG at LRZ

Dr. Alexander Block, Leibniz Supercomputing Centre (LRZ)

Download PDF

Pdf icon1

Slinky – Slurm Bridge

Skyler Malinowski, Alan Mutschelknaus, and Marlow Warnicke, SchedMD

Download PDF

Pdf icon1

Slurm Wiki and Tools – a Niflheim site report

Dr. Ole Helm Nielsen, Technical University of Denmark (DTU)

Download PDF

Pdf icon1

Maximizing HPC Efficiency for Ansys Simulations: Addressing Critical IT Concerns with Slurm Resource Management and Scheduling

David Clifton and Morten Loderup, Ansys

Download PDF

Pdf icon1

Magic Castle: Canadian HPC as a Service

Félix-Antoine Fortin, Digital Research Alliance of Canada

Download PDF

Pdf icon1

Slurm 24.05, 24.11, and Beyond

Danny Auble, SchedMD

Download PDF

Presentations from SC23, November 23

Pdf icon1

Slurm and/or/vs Kubernetes

Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Slurm 23.02, 23.11, and Beyond

Tim Wickberg, SchedMD

Download PDF

Presentations from Slurm User Group Meeting, September 2023

Pdf icon1

Keynote: Improving Quinoa Through the Development of Genetic and Genomic Resources

David Jarvis, Brigham Young University

Download PDF

Pdf icon1

Never Use Slurm HA Again: Solve All Your Problems with Kubernetes

Chris Samuel and Doug Jacobsen, NERSC

Download PDF

Pdf icon1

Build a Flexible and Powerful High Performance Computing Foundation with Google Cloud

Volker Eyrich (Google) and Joshua Fryer (Recursion)

Download PDF

Pdf icon1

Demand Driven Cluster Elasticity

Mike Fazio, Dow

Download PDF

Pdf icon1

Field Notes 7 – How to Make the Most of Slurm and Avoid Common Issues

Jason Booth, SchedMD

Download PDF

Pdf icon1

Accelerating Genomics Research Machine Learning with Slurm

Willy Markuske, San Diego Supercomputing Center (SDSC)

Download PDF

Pdf icon1

Saving Power with Slurm

Ole Nielsen, Technical University of Denmark (DTU)

Download PDF

Pdf icon1

Site Report: CINECA Experience with Slurm

Alessandro Marani, CINECA

Download PDF

Pdf icon1

Step Management Enhancements

Brian Christiansen, SchedMD

Download PDF

Pdf icon1

System and Job Scheduling Simulation for Enhancing Production HPC

Vivian Hafener, LANL

Download PDF

Pdf icon1

Site Update: Georgia Institute of Technology

Marian Zvada and Aaron Jezghani, Georgia Tech

Download PDF

Pdf icon1

Building Blocks in the Cloud: Scaling LEGO Engineering with AWS High-Performance Computing

Brian Skjerven and Matt Vaugh, AWS

Download PDF

Pdf icon1

Slurm 23.02, 23.11, and Beyond (Roadmap)

Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Optimizing Diverse Workloads and Resource Usage with Slurm

Chansup Byun et al, LLSC

Download PDF

Presentations from Dell HPC Community, September 2023

Pdf icon1

Slurm and/or/vs Kubernetes

Tim Wickberg, SchedMD

Download PDF

Presentations from Cray User Group, May 2023

Pdf icon1

Slurm 23.02, 23.11, and Beyond

Tim Wickberg, SchedMD

Download PDF

Presentations from SC22, November 2022

Pdf icon1

Slurm ♥ Containers

Nate Rini & Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Doing More with Slurm Advanced Capabilities

Shawn Hoopes, SchedMD

Download PDF

Pdf icon1

Slurm 22.05, 23.02, and Beyond

Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Slurm and/or/vs Kubernetes

Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Accelerating HPC and AI with Slurm and SchedMD

Nick Ihli, SchedMD

Download PDF

Presentations from the HPC Containers Advisory Working Group, November 2022

Presentations from CNCF Research End User Group, October 2022

Presentations from Slurm User Group Meeting, September 2022

Pdf icon1

Burst Buffer Lua Plugin for Lustre w/video

Kota Tsuyuzaki / Rikimaru Honjo / Yusuke Kaneko / Kohei Tahara, NTT Computer and Data Science Laboratory / NTT TechnoCross Corporation

Download PDF
Watch Video

Presentations from NHR Container Workshop, December 2021

Presentations from SC21, November 2021

Presentations from Slurm User Group Meeting, September 2021

Presentations from SC20, November 2020

Presentations from Slurm User Group Meeting, September 2020

Presentations from PEARC HPCSYSPROS Workshop, August 2020

Presentations from Slurm User Group Meeting, September 2019

Pdf icon1

Technical: GPU Scheduling and the cons_tres plugin

Chad Vizino and Morris Jette, SchedMD

Download PDF

Pdf icon1

Tutorial: Cgroups and pam_slurm_adopt

Marshall Garey, SchedMD

Download PDF

Pdf icon1

Site Report: Enabling and Scaling Diverse Work Loads Efficiently with Slurm

Chansup Byun et al., MIT Lincoln Laboratory

Download PDF

Pdf icon1

Tutorial: Slurm: Seamless Integration with Unprivileged Containers

Luke Yeager et al., NVIDIA

Download PDF

Pdf icon1

Technical: Job Container Plugin for Managing Node Local Namespaces

Aditi Gaur, NERSC

Download PDF

Pdf icon1

Technical: VMs and Containers for a Slurm-Based Development Cluster

François Daikhaté, CEA

Download PDF

Pdf icon1

Technical: High Throughput Computing

Broderick Gardner, SchedMD

Download PDF

Pdf icon1

Site Report: Slurm on Sherlock

Kilian Cavalotti, Stanford Research Computing Center

Download PDF

Pdf icon1

Slurm + GCP

Brian Christiansen (SchedMD) and Keith Binder (Google)

Download PDF

Pdf icon1

Technical: Monitoring Slurm with a Splunk App

Nicole Dobson, LANL

Download PDF

Pdf icon1

Tutorial: Troubleshooting

Albert Gil and Jason Booth, SchedMD

Download PDF

Pdf icon1

Technical: Slurm Account Synchronization with UNIX Groups and Users

Ole Nielsen, Technical University of Denmark (DTU)

Download PDF

Pdf icon1

Technical: A Fully Configurable HPC Web Portal for Managing Slurm Jobs

Patrice Calegari, Atos

Download PDF

Pdf icon1

Technical: Slurm 20.02 and Beyond

Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Technical: Field Notes From a MadMan

Tim Wickberg, SchedMD

Download PDF

Presentations from Slurm User Group Meeting, September 2018

Pdf icon1

Tutorial: Slurm Overview

Felip Moll Marquès, SchedMD

Download PDF

Pdf icon1

Technical: Workload Management Requirements for an Interactive Computing e-Infrastructure

Sadaf Alam (CSCS) and the ICEI team (BSC, CEA, CINECA, CSCS, Jülich)

Download TRES PDF

Pdf icon1

Technical: Slurm in a Container Only World – Are We Crazy?

Paul Peltz and Lowell Wofford (LANL)

Download PDF

Pdf icon1

Technical: Kraken – A Stateful Approach to Cluster Management

Paul Peltz and Lowell Wofford (LANL)

Download PDF

Pdf icon1

Technical: A Declarative Programming Style Job Submission Filter

Douglas Jacobsen, NERSC

Download PDF

Pdf icon1

Technical: Generalized Hypercube (GHC) – A Topology Plugin

M. Clayer and A. Faure, Atos

Download PDF

Pdf icon1

Technical: Keeping Accounts Consistent Across Clusters Using LDAP and YAML

Christian Clémonçon, Ewan Roche, Ricardo Silva (EPFL)

Download PDF

Pdf icon1

Technical: Real-Time Job Monitoring Using an Extended slurmctld Generic Plugin – Introducing the Plugin Architecture SPACE

Mike Arnhold, Ulf Markwardt, and Danny Rotscher (Dresden)

Download PDF

Pdf icon1

Technical: Scheduling by Trackable Resource (cons_tres)

Morris Jette and Dominik Bartkiewicz, SchedMD

Download PDF

Pdf icon1

Technical: Slurm 18.08 Overview

Brian Christiansen, SchedMD

Download PDF

Pdf icon1

Technical: Layout for Checkpoint Restart on Specialized Blades

Bill Brophy, Martin Perry, Doug Parisek, and Steve Mehlberg (Atos)

Download PDF

Pdf icon1

Site Report: CEA Site Report

Regine Gaudin, CEA

Download PDF

Pdf icon1

Site Report: Colliding High Energy Physics with HPC, Cloud, and Parallel Filesystems

Carolina Lindqvist, Pablo Llopis, and Nils Høimyr (CERN)

Download PDF

Pdf icon1

Technical: Slurm Simulator Improvements and Evaluation

Marco D’Amico, Ana Jokanovic, Julita Corbalan (BSC)

Download PDF

Pdf icon1

Site Report: CETA-CIEMAT Site Report

Alfonso Pardo, CETA-CIEMAT

Download PDF

Pdf icon1

Site Report: Tuning Slurm the CSCS Way

Miguel Gila, CSCS

Download PDF

Pdf icon1

Technical: Workload Scheduling and Power Management

Morris Jette and Alejandro Sanchez, SchedMD

Download PDF

Pdf icon1

Site Report: LANL Site Report – One Year Post Migration

Joseph ‘Joshi’ Fullop, LANL

Download PDF

Pdf icon1

Technical: Field Notes Mark 2: Random Musings From Under a New Hat

Tim Wickberg, SchedMD

Download PDF

Presentations from Slurm Booth and Birds of a Feather, SC17, November 2017

Pdf icon1

Booth: Slurm Overview

Brian Christiansen, Marshall Garey, Isaac Hartung (SchedMD)

Download PDF

Pdf icon1

Booth: Heterogeneous Job Support

Morris Jette, Tim Wickberg (SchedMD)

Download TRES PDF

Pdf icon1

Booth: From Moab to Slurm: 12 HPC Systems in 2 Months

Paul Peltz, Los Alamos National Laboratory

Download PDF

Pdf icon1

Booth: PMIx Multi-Cluster Operations

Ralph H. Castain

Download PDF

Pdf icon1

Booth: Federated Cluster Support

Brian Christiansen, SchedMD

Download PDF

Pdf icon1

Booth: PMIx Plugin with UCX Support

Artem Polyakov, Mellanox

Download PDF

Pdf icon1

BOF: Slurm Birds of a Feather

Tim Wickberg, SchedMD

Download PDF

Presentations from Slurm User Group Meeting, September 2017

Pdf icon1

Keynote: Supernova Cosmology & Supercomputing

Alex Kim, Lawrence Berkeley National Laboratory

Download PDF

Pdf icon1

Tutorial: Introduction to Slurm

Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Technical: SLURMFS – Resource Manager File System for Slurm

Steven Senator, Los Alamos National Laboratory

Download PDF

Pdf icon1

Technical: Federated Cluster Support

Brian Christiansen and Danny Auble, SchedMD

Download PDF

Pdf icon1

Technical: Utilizing Slurm and Passive Nagios Plugins for Scalable KNL Compute Node Monitoring

Tony Quan and Basil Lalli, NERSC/LBNL

Download PDF

Pdf icon1

Technical: Field Notes From the Frontlines of Slurm Support

Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Technical: Towards Modular Supercomputing with Slurm

Dorian Krause et al. JSC

Download PDF

Pdf icon1

Technical: Heterogeneous Job Support

Morris Jette, SchedMD

Download PDF

Pdf icon1

Technical: cli_filter – command line filtration, manipulation, and introspection of job submissions

Douglas Jacobsen, NERSC

Download PDF

Pdf icon1

Technical: Slurm – Some Slightly Unconventional Use Cases

Chris Hill (MIT), Rajul Kumar (Northeastern), Evan Weinberg and Naved Ansari (BU), Tim Donahue

Download PDF

Pdf icon1

Technical: Managing Diversity in Complex Workloads in a Complex Environment

Nicholas Cardo, CSCS

Download PDF

Pdf icon1

Technical: SELinux policy for Slurm

Gilles Wiber and Mathieu Blanc (CEA), M’hamed Bouaziz and Liana Bozga (Atos)

Download PDF

Pdf icon1

Site Report: From Moab to Slurm: 12 HPC Systems in 2 Months

Peltz, Fullop, Jennings, Senator, Grunau (Los Alamos National Laboratory)

Download PDF

Pdf icon1

Site Report: NERSC Site Report

James Botts and Douglas Jacobsen

Download PDF

Pdf icon1

Technical: Slurm Roadmap – 17.11, 18.08, and Beyond

Danny Auble, Morris Jette, Tim Wickberg (SchedMD)

Download PDF

Pdf icon1

Technical: New Statistics Using TRES

Bill Brophy, Martin Perry, Thomas Cadeau (Atos)

Download PDF

Pdf icon1

Technical: Enabling web-based interactive notebooks on geographically distributed HPC resources

Alexandre Beche, EPFL

Download PDF

Pdf icon1

Technical: Slurm Singularity Spank Plugin

Martin Perry, Steve Mehlberg, Thomas Cadeau (Atos)

Download PDF

Pdf icon1

Site Report: A Slurm Odyssey: Slurm at Harvard FAS Research Computing

Paul Edmond

Download PDF

Pdf icon1

Site Report: LLSC Adoption of Slurm for Managing Diverse Resources and Workloads

Chansup Byun et al. MIT Lincoln Laboratory

Download PDF

Pdf icon1

Site Report: Cyfronet Site Report – Improving Slurm Usability and Monitoring

M Pawlik, J. Budzowski, L. Flis, P Lason, M. Magrys

Download PDF

Pdf icon1

Technical: When You Have a Hammer, Everything Looks Like a Nail – Checkpoint / Restart in Slurm

Manuel Rodríguez-Pascual, J.A. Moríñigo, and Rafael Mayo-García, CIEMAT

Download PDF

Presentations from Slurm Booth and Birds of a Feather, SC16, November 2016

Pdf icon1

Booth: Process Management Interface – Exascale (PMIx)

Ralph H. Castain

Download PDF

Pdf icon1

Yiannis Georgiou, Bull Atos

Download PDF
View Video

Pdf icon1

Booth: Transition Hangout (a.k.a. how we converted to Slurm)

Ryan Cox (BYU), Bruce Pfaff (NASA)

Download PDF

Pdf icon1

Booth: Expanding Serial Analysis with Slurm Arrays

Christopher Coffey, Northern Arizona University

Download PDF

Pdf icon1

Booth: Intel HPC Orchestrator

Tom Krueger, Intel

Download PDF

Pdf icon1

BOF: Slurm State of the Union; v16.05, v17.02 and Beyond

Tim Wickberg, SchedMD

Download PDF

Presentations from Slurm User Group Meeting, September 2016

Pdf icon1

Keynote: Computer-aided drug design for novel anti-cancer agents

Dr. Zoe Cournia (Biomedical Research Foundation, Academy of Athens)

Download PDF

Pdf icon1

Technical: Overview of Slurm Version 16.05

Danny Auble (SchedMD), Yiannis Georgiou (Bull)

Download PDF

Pdf icon1

Technical: MCS (Multi-Category Security) Plugin

Aline Roy, CEA

Download PDF

Pdf icon1

Technical: Slurm Burst Buffer Integration

David Paul, NERSC

Download PDF

Pdf icon1

Technical: Slurm Configuration Impact on Benchmarking

José Moríñgo, Manuel Rodríguez-Pascual, and Rafael Mayo-García, CIEMAT

Download PDF

Pdf icon1

Technical: Real-time monitoring Slurm jobs with InfluxDB

Carlos Fenoy García

Download PDF

Pdf icon1

Technical: Optimising HPC Resource Allocation Through Monitoring

Alexandre Beche, EPFL

Download PDF

Pdf icon1

Technical: Simunix, a large scale platform simulator

David Glesser and Adrien Faure, Bull Atos

Download PDF

Pdf icon1

Site Report: Swiss national Supercomputer Centre (CSCS)

Nicholas Cardo

Download PDF

Pdf icon1

Technical: Configure a Slurm cluster with Ansible

Johan Guldmyr, CSC

Download PDF

Pdf icon1

Technical: Checkpoint/restart in Slurm: current status and new developments

Manuel Rodríguez-Pascual, J.A. Moríñigo, and Rafael Mayo-García, CIEMAT

Download PDF

Pdf icon1

Technical: Intel Knights Landing (KNL)

Morris Jette and Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Technical: Job Packs – A New Slurm Feature for Enhanced Support of Heterogeneous Resources

Andry Razafinjatovo, Martin Perry, and Yiannis Georgiou (Bull Atos), Matthieu Hautreux (CEA)

Download PDF

Pdf icon1

Technical: Improving system utilization under strict power budget using the layouts

Dineshkumar Rajagopal, Yiannis Georgiou, and David Glesser, Bull Atos

Download PDF

Pdf icon1

Technical: High definition power and energy monitoring support

Thomas Cadeau and Yiannis Georgiou, Bull Atos

Download PDF

Pdf icon1

Technical: Federated Cluster Scheduling

Dominik Bartkiewicz and Brian Christiansen, SchedMD

Download PDF

Pdf icon1

Technical: Slurm Roadmap – SchedMD

Danny Auble, SchedMD

Download PDF

Pdf icon1

Technical: Slurm Roadmap – Bull

Yiannis Georgiou and Andry Razafinjatovo, Bull Atos

Download PDF

Pdf icon1

Site Report: Electricité de France (EDF)

Cécile Yoshikawa

Download PDF

Pdf icon1

Site Report: Leibniz-Rechenzentrum (LRZ)

Juan Pancorbo Armada

Download PDF

Pdf icon1

Site Report: NERSC Site Report – One Year of Slurm

Douglas Jacobsen

Download PDF

Pdf icon1

Site Report: Experience Using Slurm on ARIS HPC System

Nikos Nikoloutsakos, GRNET

Download PDF

Presentations from Slurm Booth and Birds of a Feather, SC15, November 2015

Pdf icon1

Booth: PMIx – Enabling Application-Driven Execution at Exascale

Ralph H. Castain

Download PDF

Pdf icon1

Booth: Brigham Young University – Site Report

Ryan Cox, BYU

Download PDF

Pdf icon1

Booth: Slurm Overview

Brian Christiansen and Danny Auble, SchedMD

Download PDF

Pdf icon1

Booth: Never Port Your Code Again – Docker Functionality with Shifter using Slurm

Shane Canon, NERSC

Download PDF

Pdf icon1

Booth: Slurm Burst Buffer Support

Tim Wickberg, SchedMD

Download PDF

Pdf icon1

Booth: Slurm Overview and Elasticsearch Plugin

Alejandro Sanchez, SchedMD

Download PDF

Pdf icon1

Booth: All Things TRES

Brian Christiansen, SchedMD

Download PDF

Pdf icon1

BOF: Improving Backfilling by using Machine Learning to Predict Running Times in Slurm

David Glesser, Bull

Download PDF

Presentations from Slurm User Group Meeting, September 2015

Pdf icon1

Keynote: 10-Years of Computing and Atmospheric Research at NASA: 1 day per day

Bill Putnam, NASA

Download PDF

Pdf icon1

Technical: Overview of Slurm Version 15.08

Morris Jette and Danny Auble (SchedMD), Yiannis Georgiou (Bull)

Download PDF

Pdf icon1

Technical: Trackable Resources (TRES)

Brian Christiansen and Danny Auble, SchedMD

Download PDF

Pdf icon1

Technical: Message Aggregation

Danny Abule (SchedMD), Yiannis Georgiou and Martin Perry (Bull)

Download PDF

Pdf icon1

Technical: Slurm Burst Buffer Support

Morris Jette (SchedMD), Tim Wickberg (GW)

Download PDF

Pdf icon1

Technical: Slurm Power Management Support

Morris Jette, SchedMD

Download PDF

Pdf icon1

Technical: Slurm Layouts Framework

Matthieu Hautreux, CEA

Download PDF

Pdf icon1

Technical: Power Adaptive Scheduling

Yiannis Georgiou and David Glesser (Bull), Matthieu Hautreux (CEA), Denis Trystram (LIG)

Download PDF

Pdf icon1

Technical: Never Port Your Code Again – Docker Functionality with Shifter Using Slurm

Douglas Jacobsen, James Botts, and Shane Canon, NERSC

Download PDF

Pdf icon1

Technical: Increasing Cluster Thoughput with Slurm and rCUDA

Federico Silla, Technical University of Valencia Spain

Download PDF

Pdf icon1

Technical: Running Virtual Machines in a Slurm Batch System

Ulf Markwardt, Technische Universität Dresden

Download PDF

Pdf icon1

Technical: Supporting SR-IOV and IVSHMEM in MVAPICH2 on Slurm

Xiaoyi Lu, Jie Zhang, et al., The Ohio State University

Download PDF

Pdf icon1

Technical: Heterogeneous Resources and MPMD (aka Job Pack)

Rod Schultz and Martin Perry (Atos), Matthieu Hautreaux (CEA), Yiannis Georgiou (Atos)

Download PDF

Pdf icon1

Technical: Towards Multi-Objective Resource Selection

Dineshkumar Rajagopal, David Glesser, Yiannis Georgiou, Bull

Download PDF

Pdf icon1

Technical: Enhancing Startup Performance of Parallel Applications with Slurm

Sourav Chakraborty, et al., OSU/LLNL

Download PDF

Pdf icon1

Technical: Adaptable Profile-Driven TestBed (“Apt”)

Brian Haymore, The University of Utah

Download PDF

Pdf icon1

Technical: Using and Modifying the BSC Slurm Workload Simulator

Stephen Trofinoff and Massimo Benini, CSCS

Download PDF

Pdf icon1

Technical: Improving Job Scheduling by Using Machine Learning

David Glesser, Yiannis Georgiou (Bull) and Denis Trystram (LIG)

Download PDF

Pdf icon1

Technical: Federated Cluster Scheduling

Brian Christiansen and Danny Auble, SchedMD

Download PDF

Pdf icon1

Technical: Native Slurm on the XC30

Douglas Jacobsen, James Botts, NERSC

Download PDF

Pdf icon1

Technical: Slurm Roadmap – Versions 16.05 and Beyond

Morris Jette and Danny Auble (SchedMD), Yiannis Georgiou (Bull)

Download PDF

Pdf icon1

Technical: Exascale Process Management Interface

Ralph Castain (Intel), Joshua Ladd, Artem Polyakov (Mellanox), David Bigagli (SchedMD), Gary Brown (Adaptive Computing)

Download PDF

Pdf icon1

Site Report: Brigham Young Iniversity

Ryan Cox, BYU

Download PDF

Pdf icon1

Site Report: University of South Florida

John DeSantis, USF

Download PDF

Pdf icon1

Site Report: NASA Center for Climate Simulation

Bruce Pfaff, NASA

Download PDF

Pdf icon1

Site Report: Jülich Supercomputing Centre

Dorian Krause, JSC

Download PDF

Pdf icon1

Site Report: The George Washington University

Tim Wickberg, GW

Download PDF

Presentations from Slurm Booth and Birds of a Feather, SC14, November 2014

Pdf icon1

Slurm Overview

Danny Auble and Brian Christiansen, SchedMD

Download PDF

Pdf icon1

Slurm Version 15.08 Roadmap

Jacob Jenson, SchedMD

Download PDF

Pdf icon1

Fair Tree: Fairshare Algorithm for Slurm

Ryan Cox and Levi Morrison (Brigham Young University)

Download PDF

Presentations from Slurm User Group Meeting, September 2014

Pdf icon1

Welcoming Address

Colin McMurtie (Swiss National Supercomputing Centre, CSCS)

Download PDF

Pdf icon1

Overview of Slurm Versions 14.03 and 14.11

Jacob Jenson (SchedMD) and Yiannis Georgiou (Bull)

Download PDF

Pdf icon1

Warewulf Node Health Check

Jacqueline Scoggins and Michael Jennings (Lawrence Berkeley National Lab)

Download PDF

Pdf icon1

Slurm Process Isolation

Bill Brophy, Martin Perry and Yiannis Georgiou (Bull), Morris JEtte (SchedMD), Matthieu Hautreux (CEA)

Download PDF

Pdf icon1

Improving Message Forwarding Logic in Slurm

Rod Schultz, Martin Perry and Yiannis Georgiou (Bull), Matthieu Hautreux (CEA), Danny Auble and Morris Jette (SchedMD)

Download PDF

Pdf icon1

Tuning Slurm Scheduling for Optimal Responsiveness and Utilization

Morris Jette (SchedMD)

Download PDF

Pdf icon1

Improving HPC Applications Scheduling with Predictions Based on Automatically-Collected Historical Data

Carlos Fenoy García (Barcelona Supercomputing Centre)

Download PDF

Pdf icon1

OStrich: Fair Scheduler for Burst Submissions of Parallel Job

Krzysztof Rzadca (University of Warsaw) and Filip Skalski (University of Warsaw / Google)

Download PDF

Pdf icon1

Adaptive Resource and Job Management for Limited Power Consumption

Yiannis Georgiou and David Glesser (Bull), Matthieu Hautreux (CEA), Denis Trystram (University Grenoble-Alpes)

Download PDF

Pdf icon1

Introducing Energy Based Fair-Share Scheduling

Yiannis Georgiou and David Glesser (Bull), Krzysztof Rzadca (University of Warsaw), Denis Trystram (University Grenoble-Alpes)

Download PDF

Pdf icon1

High Performance Data Movement Between Lustre and Enterprise Storage Systems

Aamir Rashid (Terascala)

Download PDF

Pdf icon1

Extending Slurm with Support for Remote GPU Virtualization

Sergio Iserte, Adrián Castelló, Rafael Mayo, Enrique S. Quintana-Ortlí, Federico Silla, Jose Duato (Universitat Jaume and Universitat Politècnica de València)

Download PDF

Pdf icon1

Slurm Migration Experience

Jacqueline Scoggins (Lawrence Berkeley National Lab)

Download PDF

Pdf icon1

Budget Checking Plugin for Slurm

Huub Stoffers (SURF sara)

Download PDF

Pdf icon1

Fair Tree: Fairshare Algorithm for Slurm

Ryan Cox and Levi Morrison (Brigham Young University)

Download PDF

Pdf icon1

Integrating Layouts Framework in Slurm

Thomas Cadeau and Yiannis Georgiou (Bull), Matthieu Hautreux (CEA)

Download PDF

Pdf icon1

Topology-Aware Resource Selection

Emmanuel Jeannot, Guillaume Mercier, and Adèle Villiermet (Inria)

Download PDF

Pdf icon1

Slurm Native Workload Management on Cray Systems

Morris Jette (SchedMD)

Download PDF

Pdf icon1

Slurm Roadmap

Yiannis Georgiou (Bull), Morris Jette and Jacob Jenson (SchedMD)

Download PDF

Pdf icon1

Private / tmp for Each Job Using SPANK

Magnus Jonsson (Umeå Universitet)

Download PDF

Pdf icon1

ICM Warsaw University Site Report

Dominik Bartkiewicz and Marcin Stolarek (ICM Warsaw University)

Download PDF

Pdf icon1

Swiss National Supercomputing Centre Site Report

Massimo Benini (Swiss National Supercomputing Centre, CSCS)

Download PDF

Pdf icon1

Aalto University Site Report

Janne Blomqvist, Ivan Degtyarenko and Mikko Hakala (Aalto University)

Download PDF

Pdf icon1

The George Washington University Site Report

Tim Wickberg, George Washington University

Download PDF

Presentations from Slurm Birds of a Feather, SC13, November 2013

Pdf icon1

Slurm Workload Manager Project Report

Morris Jette and Danny Auble, SchedMD

Download PDF

Presentations from Slurm User Group Meeting, September 2013

Pdf icon1

Keynote: Future Outlook for Advanced Computing

Dona Crawford (LLNL)

Download PDF

Pdf icon1

Technical: Overview of Slurm version 2.6

Morris Jette and Danny Auble (SchedMD), Yiannis Georgiou (Bull)

Download PDF

Pdf icon1

Tutorial: Energy Accounting and External Sensor Plugins

Yiannis Georgiou, Martin Perry, Thomas Cadeau (Bull), Danny Auble (SchedMD)

Download PDF

Pdf icon1

Technical: Debugging Large Machines

Matthieu Hautreux (CEA)

Download PDF

Pdf icon1

Technical: Creating Easy to Use HPC Portals with NICE EnginFrame and Slurm

Alberto Falzone, Paolo Maggi (Nice Software)

Download PDF

Pdf icon1

Tutorial: Usage of New Profiling Functionalities

Rod Schultz, Yiannis Georgiou (Bull), Danny Auble (SchedMD)

Download PDF

Pdf icon1

Technical: Fault Tolerant Workload Management

David Bigagli, Morris Jette (SchedMD)

Download PDF

Pdf icon1

Technical: Slurm Layouts Framework

Yiannis Georgiou (Bull), Matthieu Hautreux (CEA)

Download PDF

Pdf icon1

Technical: License Management

Bill Brophy (Bull)

Download PDF

Pdf icon1

Technical: Multi-Cluster Management

Juan Pancorbo Armada (IRZ)

Download PDF

Pdf icon1

Technical: Depth Oblivious Hierarchical Fairshare Priority Factor

Francois Daikhate, Matthieu Hautreux (CEA)

Download PDF

Pdf icon1

Technical: Refactoring ALPS

Dave Wallace (Cray)

Download PDF

Pdf icon1

Site Report: CEA

Francois Diakhate, Francis Belot, Matthieu Hautreux (CEA)

Download PDF

Pdf icon1

Site Report: George Washington University

Tim Wickberg (George Washington University)

Download PDF

Pdf icon1

Site Report: Brigham Young University

Ryan Cox (BYU)

Download PDF

Pdf icon1

Site Report: Technische Universitat Dresden

Dr. Ulf Markwardt (Technische Universität Dresden)

Download PDF

Presentations from Slurm Birds of a Feather, SC12, November 2012

Pdf icon1

Slurm Workload Manager Project Report

Morris Jette and Danny Auble, SchedMD

Download PDF

Pdf icon1

Using Slurm for Data Aware Scheduling in the Cloud

Martijn de Vries, BrightComputing

Download PDF

Pdf icon1

MapReduce Support in Slurm: Releasing the Elephant

Ralph H. Castain, Wangda Tan, Jimmy Cao and Michael Lv, Greenplum/EMC

Download PDF

Pdf icon1

Slurm at Rensselaer

Tim Wickberg, Rensselaer Polytechnic Institute

Download PDF

Presentations from Slurm User Group Meeting, October 2012

Pdf icon1

Jesus Labarta, BSC

Pdf icon1

Slurm Status Report

Morris Jette and Danny Auble, SchedMD

Download PDF

Pdf icon1

Site Report: BSC/RES

Alejandro Lucero and Carles Fenoy, BSC

Download PDF

Pdf icon1

Site Report: CETA/CIEMAT

Alfonso Pardo Diaz, CIEMAT

Pdf icon1

Porting Slurm to Bluegene/Q

Don Lipari, LLNL

Pdf icon1

Tutorial: Slurm Database Use, Accounting and Limits

Danny Auble (SchedMD)

Download PDF

Pdf icon1

Tutorial: The Slurm Scheduler Design

Don Lipari, LLNL

Download PDF

Pdf icon1

Tutorial: Cgroup Support on Slurm

Martin Perry and Yiannis Georgiou (Bull), Matthieu Hautreux (CEA)

Download PDF

Pdf icon1

Tutorial: Kerberos and Slurm using Auks

Matthieu Hautreux, CEA

Download PDF

Pdf icon1

Keynote: Challenges in Evaluating Parallel Job Schedulers

Dror Feitelson, Hebrew University

Download PDF

Pdf icon1

Integration of Slurm with IBM’s Parallel Environment

Morris Jette and Danny Auble, SchedMD

Download PDF

Pdf icon1

Slurm Bank

Jimmy Tang and Paddy Doyle, Trinity College, Dublin

Download PDF

Pdf icon1

Using Slurm for Data Aware Scheduling in the Cloud

Martijn de Vries, Bright Computing

Download PDF

Pdf icon1

Enhancing Slurm with Energy Consumption Monitoring and Control Features

Yiannis Georgiou, Bull

Download PDF

Pdf icon1

MapReduce Support in Slurm: Releasing the Elephant

Ralph H. Castain, et al., Greenplum/EMC

Download PDF

Pdf icon1

High Throughput Computing with Slurm

Morris Jette and Danny Auble, SchedMD

Download PDF

Pdf icon1

Evaluating Scalability and Efficiency of the Resource and Job Management System on Large HPC Clusters

Yiannis Georgiou (Bull) and Matthieu Hautreux (CEA)

Download PDF

Pdf icon1

Integer Programming Based Herogeneous CPU-GPU Clusters

Seren Soner, Bogazici University

Download PDF

Pdf icon1

Job Resource Utilization as a Metric for Clusters Comparison and Optimization

Joseph Emeras, INRIA/LIG

Download PDF

Presentations from the Sixth Linux Collaboration Summit, April 2012

Pdf icon1

Resource Management with Linux Control Groups in HPC Clusters

Yiannis Georgiou, Bull

Download PDF

Presentations from Slurm Birds of a Feather, SC11, November 2011

Pdf icon1

Slurm Version 2.3 and Beyond

Morris Jette, SchedMD LLC

Download PDF

Pdf icon1

Cloud Bursting with Slurm and Bright Cluster Manager

Martijn de Vries, Bright Computing

Download PDF

Presentations from Slurm User Group Meeting, September 2011

Pdf icon1

Basic Configuration and Usage

Rod Schultz, Groupe Bull

Download PDF

Pdf icon1

CPU Management Allocation and Binding

Martin Perry, Groupe Bull

Download PDF

Pdf icon1

Configuring Slurm for HA

David Egolf and Bill Brophy, Groupe Bull

Download PDF

Pdf icon1

Slurm Resources Isolation Through cgroups

Yiannis Georgiou (Groupe Bull), Matthieu Hautreux (CEA)

Download PDF

Pdf icon1

Slurm Operation on Cray XT and XE

Moe Jette, SchedMD LLC

Download PDF

Pdf icon1

Challenges and Opportunities for Exascale Resource Management and How Today’s Petascale Systems are Guiding the Way

William Kramer, NCSA

Download PDF

Pdf icon1

Slurm Version 2.3 and Beyond

Moe Jette, SchedMD LLC

Download PDF

Pdf icon1

Proposed Design for Enhanced Enterprise-wide Scheduling

Don Lipari, LLNL

Download PDF

Pdf icon1

Bright Cluster Manager & Slurm

Robert Stober, Bright Computing

Download PDF

Pdf icon1

Job Step Management in User Space

Moe Jette, SchedMD LLC

Download PDF

Pdf icon1

Slurm Operation IBM BlueGene/Q

Danny Auble, SchedMD LLC

Download PDF

Presentations from Slurm Birds of a Feather, SC10, November 2010

Pdf icon1

Slurm Version 2.2: Features and Release Plans

Morris Jette, Danny Auble, and Donald Lipari, Lawrence Livermore National Laboratory

Download PDF

Presentations from Slurm User Group Meeting, October 2010

Pdf icon1

Slurm: Resource Management from the Simple to the Sophisticated

Morris Jette and Danny Auble, Lawrence Livermore National Laboratory

Download PDF

Pdf icon1

Slurm Support for Linux Control Groups

Martin Perry, Bull Information Systems

Download PDF

Pdf icon1

Slurm at BSC

Carles Fenoy and Alejandro Lucero, Barcelona Supercomputing Center

Download PDF

Pdf icon1

Porting Slurm to the Cray XT and XE

Neil Stringfellow and Gerrit Renker, Swiss National Supercomputer Centre

Download PDF

Pdf icon1

Real Scale Experimentations of Slurm Resource and Job Management System

Yiannis Georgiou, Bull Information Systems

Download PDF

Pdf icon1

Slurm Version 2.2: Features and Release Plans

Morris Jette and Danny Auble, Lawrence Livermore National Laboratory

Download PDF

Presentations from Slurm Birds of a Feather, SC09, November 2009

Pdf icon1

Slurm Community Meeting

Morris Jette, Danny Auble, and Donald Lipari, Lawrence Livermore National Laboratory

Download PDF

Presentations from Slurm Birds of a Feather, SC08, November 2008

Pdf icon1

High Scalability Resource Management with Slurm

Morris Jette, Lawrence Livermore National Laboratory

Download PDF

Pdf icon1

Slurm Status Report

Morris Jette and Danny Auble, Lawrence Livermore National Laboratory

Download PDF

Other Presentations

Pdf icon1

Slurm Version 1.3

Morris Jette and Danny Auble, Lawrence Livermore National Laboratory (May 2008)

Download PDF

Pdf icon1

Managing Clusters with Moab and Slurm

Morris Jette and Donald Lipari, Lawrence Livermore National Laboratory (May 2008)

Download PDF

Pdf icon1

Resource Management at LLNL, Slurm Version 1.2

Morris Jette, Danny Auble and Chris Morrone, Lawrence Livermore National Laboratory (April 2007)

Download PDF

Pdf icon1

Resource Management Using Slurm

Morris Jette, Lawrence Livermore National Laboratory (Tutorial, The 7th International Conference on Linux Clusters, May 2006)

Download PDF

Publications

Pdf icon1

Energy Accounting and Control with Slurm Resource and Job Management System

Yiannis Georgiou, et. al. (ICDCN 2014, January 2014)

Pdf icon1

Evaluating scalability and efficiency of the Resource and Job Management System on large HPC Clusters

Yiannis Georgiou (BULL S.A.S, France); Matthieu Hautreux (CEA-DAM, France) (16th Workshop on Job Scheduling Strategies for Parallel Processing, May 2012)

Download PDF

Pdf icon1

GreenSlot: Scheduling Energy Consumption in Green Datacenters

Inigo Goiri, et. al. (SuperComputing 2011, November 2011)

Pdf icon1

Contributions for Resource and Job Management in High Performance Computing

Yiannis Georgiou, Universite Joseph Fourier (Thesis, December 2010)

Download PDF

Pdf icon1

Caos NSA and Perceus: All-in-one Cluster Software Stack

Jeffrey B. Layton, Linux Magazine, 5 February 2009

Pdf icon1

Enhancing an Open Source Resource Manager with Multi-Core/Multi-threaded Support

S. M. Balle and D. Palermo, Job Scheduling Strategies for Parallel Processing, 2007

Pdf icon1

Slurm: Simple Linux Utility for Resource Management

M. Jette and M. Grondona, Proceedings of ClusterWorld Conference and Expo, San Jose, California, June 2003

Download PDF

Pdf icon1

Slurm: Simple Linux Utility for Resource Management

A. Yoo, M. Jette, and M. Grondona, Job Scheduling Strategies for Parallel Processing, volume 2862 of Lecture Notes in Computer Science, pages 44-60, Springer-Verlag, 2003

Interview

Pdf icon1

RCE 10: Slurm (podcast)

Brock Palen and Jeff Squyres speak with Morris Jette and Danny Auble of LLNL about Slurm

Download PDF

Other Resources

Pdf icon1

Learning Chef: Compute Cluter with Slurm

A Slurm Cookbook by Adam DeConinck

Download PDF