Author Archives: Mark Miller

JHPCE unavailable April 21st – May 1st for ARCH Cooling maintenance

Dear JHPCE community, There is going to be a scheduled downtime for the JHPCE cluster from Friday, April 21 starting at 5:00 PM and going until Monday, May 1st at 5:00 PM.  The ARCH/Bayview Colocation facility will be down for … Continue reading

Posted in JHPCE Announcements | Comments Off on JHPCE unavailable April 21st – May 1st for ARCH Cooling maintenance

Globus Server update on JHPCE, Saturday Dec. 10th at 9:00 PM

Dear JHPCE community, We will be upgrading the Globus Server software on the JHPCE cluster this Saturday evening, December 10th, starting at 9:00 PM.  We expect that the upgrade will take 1 hour to complete.  During the upgrade, the Globus … Continue reading

Posted in JHPCE Announcements | Comments Off on Globus Server update on JHPCE, Saturday Dec. 10th at 9:00 PM

Please be judicious in your use of the email option in sbatch

One commonly used feature on the JHPCE cluster is the “send me an email when my job completes” option in SLURM. This option can be enabled by adding the “–mail-type=FAIL,END –mail-user=john@jhu.edu” options to your qsub command. $ sbatch –mail-type=FAIL,END –mail-user=john@jhu.edu … Continue reading

Posted in JHPCE Announcements | Comments Off on Please be judicious in your use of the email option in sbatch

Heat issue at MARCC colocation – JHPCE cluster unavailable.

Dear JHPCE community, Update: 2022-06-22 13:00 – The cooling issue has been resolved, and the cluster is once again available. There is currently an issue at the MARCC colocation facility with the cooling system.  We had a number of compute … Continue reading

Posted in Cluster Status Updates | Comments Off on Heat issue at MARCC colocation – JHPCE cluster unavailable.

JHPCE Cluster to be unavailable from April 11th – April 15th for scheduled preventative maintenance on the HVAC equipment

The JHPCE cluster will be unavailable from April 11th – April 15th in order to accommodate scheduled preventative maintenance to be done on the HVAC system at the MARCC datacenter. We are planning to take the JHPCE cluster down beginning at … Continue reading

Posted in JHPCE Announcements | Comments Off on JHPCE Cluster to be unavailable from April 11th – April 15th for scheduled preventative maintenance on the HVAC equipment

2021-08-14 JHPCE cluster unavailable due to cooling issues at datacenter

The JHPCE cluster is currently down due to cooling issues at the Bayview/MARCC datacenter. We will keep you advised as the status changes.

Posted in Cluster Status Updates | Comments Off on 2021-08-14 JHPCE cluster unavailable due to cooling issues at datacenter

Setting per-user job limit on JHPCE cluster

As of June 17th, 2021, we are imposing a limit of 10,000 submitted jobs per user. Previously, there had been no limit, and this has caused issues in the past where the cluster scheduler was overloaded when there were 100s … Continue reading

Posted in JHPCE Announcements | Comments Off on Setting per-user job limit on JHPCE cluster

2021-05-24 – JHPCE cluster unavailable due to cooling issue

The JHPCE cluster is currently unavailable due to cooling issues at the Bayview/MARCC datacenter where the JHPCE cluster is located.  We apologize for any inconvenience, and we will keep you up to date as we are made aware of any … Continue reading

Posted in Cluster Status Updates | Comments Off on 2021-05-24 – JHPCE cluster unavailable due to cooling issue

Rebooting one DCL storage system Friday, April 2, 8:00AM – 9:00AM

Dear JHPCE community, We will be rebooting one of the DCL01 storage servers this Friday morning in order to resolve an issue with one of the filesystems on that server. The following directories will be unavailable this Friday, April 2, … Continue reading

Posted in Cluster Status Updates | Comments Off on Rebooting one DCL storage system Friday, April 2, 8:00AM – 9:00AM

2020-07-05 – 7:00 PM – JHPCE Cluster currently down due to cooling/power problems at datacenter

The JHPCE cluster is currently down due to problems with the cooling and power facilities at the Bayview/MARCC Colocation facility. We are being told that the issue will not be repaired until Monday at the earliest, but may take longer. … Continue reading

Posted in Cluster Status Updates | Comments Off on 2020-07-05 – 7:00 PM – JHPCE Cluster currently down due to cooling/power problems at datacenter