Oldwiki.scinet.utoronto.ca:System Alerts

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search

System Status

upGPC upTCS upSandy upFile System
upGravity upP7 downViz upBGQ downHPSS

Wed Feb 3 14:35:52 EST 2016: HPSS is down for maintenance.

Jan 15, 11:20 AM: Systems are in the process of being brought online.

Jan 14, 3:30 PM: Downtime extended to noon on Friday Jan 15th (estimate).

Our sincere apologies for this extension of the downtime. Unfortunately, a problem has come to light with some of the disks in the file system. Because of the way the file system is set up, no data is lost, but if we put the system back into production now, a single additional failure would run the risks of data loss or corruption, so this needs to be fixed now.

The BGQ file system hasn't suffered from this and may be brought up earlier.

Updates will be posted here.

Note: Because of the downtime, we'll be deferring the scratch purging that was scheduled for January 15th to Wednesday January 20th.

Jan 13, 7:00 AM: Downtime in effect.

SCHEDULED MAINTENANCE DOWNTIME ANNOUNCEMENT

There will be a full SciNet shutdown from January 13th to January 14th, 2016 for scheduled annual maintenance.

All systems will go down at 7 AM on Wednesday January 13th; all login sessions and jobs will be killed at that time.