Due to a scheduled data center power outage on Sunday 06/02/2024, ORC will start its biannual maintenance window to coincide with that to minimize future disruptions. This is a reminder that the Hopper and Argo ORC clusters will be unavailable between 6:00AM on Sunday, 06/02/2024 and the End of Tuesday, 06/04/2024 for scheduled maintenance. We plan to upgrade the compute, network, and storage infrastructure of the cluster during this maintenance window.
Maintenance Schedule:
– Start Time: June 02, 6:00AM
– End Time: June 04, End of Day
Affected services:
1. Login/Head and Compute Nodes: All login/head and compute nodes will be inaccessible. Users will not be able to log in or submit jobs during this time.
2. Open On-Demand (OOD): The Open On-Demand (OOD) interface will be inaccessible. Users are advised to plan their activities accordingly and save any unsaved work.
3. Storage and Related Services: Most storage services, including Samba and Globus, will have very limited and intermittent availability.
4. Virtual Machines: Most virtual machines will have intermittent availability. Users with virtual machines should be prepared for potential interruptions in service.
All SLURM partitions will be drained when the maintenance window starts. Any jobs started between now and the maintenance period must be timed to end before 6:00AM Sunday June 2. When starting a job, make sure to set the time parameter in SLURM to less than 6 days and reduce it as the maintenance window gets closer, for example:
#SBATCH — time= 4-00:00:00 ## Days-Hours:Mins:Secs - calculate backwards from 6/2 6:00AM
Please note that any jobs that are configured to run past the planned downtime dates will not start.
The clusters should be back online by 06/4/2024.
If you have any questions or concerns, contact the ORC at orchelp@gmu.edu.