Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 31: Line 31:
 
|
 
|
 
|}
 
|}
 +
 +
Sat Oct 10 10:52:55 EDT 2015: There is a glitch on network around 9AM, file systems were unmounted and most jobs got killed. Systems are back to normal.
  
 
Sat Oct 3, 11:55:00 HPSS is up again.
 
Sat Oct 3, 11:55:00 HPSS is up again.

Revision as of 10:57, 10 October 2015

System Status

upGPC upTCS upSandy upARC upFile System
upGravity upP7 upBGQ upHPSS

Sat Oct 10 10:52:55 EDT 2015: There is a glitch on network around 9AM, file systems were unmounted and most jobs got killed. Systems are back to normal.

Sat Oct 3, 11:55:00 HPSS is up again.

Mon Sep 28 15:18:00: A relatively small portion of nodes (still about 250) lost connection to the $SCRATCH file system. Jobs running on those nodes likely failed. The file system is back to normal.