Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 22: Line 22:
 
|[[File:up.png| up|link=TCS Quickstart]][[TCS Quickstart|TCS]]
 
|[[File:up.png| up|link=TCS Quickstart]][[TCS Quickstart|TCS]]
 
|[[File:up.png| up|link=Sandy]][[Sandy]]
 
|[[File:up.png| up|link=Sandy]][[Sandy]]
|[[File:up.png| up]]File System
+
|[[File:up25.png| trouble]]File System
 
|-
 
|-
 
|[[File:up.png| up|link=Gravity]][[Gravity]]
 
|[[File:up.png| up|link=Gravity]][[Gravity]]
Line 30: Line 30:
 
|[[File:up.png| up|link=HPSS]][[HPSS]]
 
|[[File:up.png| up|link=HPSS]][[HPSS]]
 
|}
 
|}
 +
 +
Mon Dec 7, 16:00 EST 2015: File system trouble, with nodes unmounting the file system on many nodes. Investigating.
 +
 
Sun Nov 29 12:03:33 EST 2015: Minor scheduling issue on the GPC caused some queued jobs to get removed.  Running jobs were unaffected.     
 
Sun Nov 29 12:03:33 EST 2015: Minor scheduling issue on the GPC caused some queued jobs to get removed.  Running jobs were unaffected.     
  

Revision as of 17:30, 7 December 2015

System Status

upGPC upTCS upSandy troubleFile System
upGravity upP7 upViz upBGQ upHPSS

Mon Dec 7, 16:00 EST 2015: File system trouble, with nodes unmounting the file system on many nodes. Investigating.

Sun Nov 29 12:03:33 EST 2015: Minor scheduling issue on the GPC caused some queued jobs to get removed. Running jobs were unaffected.

Sat Oct 10 10:52:55 EDT 2015: There is a glitch on network around 9AM, file systems were unmounted and most jobs got killed. Systems are back to normal.

Sat Oct 3 11:55:00 EDT 2015: HPSS is up again.