Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 11: Line 11:
 
|[[File:down.png|up|link=TCS Quickstart]][[TCS Quickstart|TCS]]
 
|[[File:down.png|up|link=TCS Quickstart]][[TCS Quickstart|TCS]]
 
|[[File:down.png|up|link=Sandy]][[Sandy]]
 
|[[File:down.png|up|link=Sandy]][[Sandy]]
|[[File:up.png|up|link=GPU Devel Nodes]][[GPU Devel Nodes|ARC]]
+
|[[File:down.png|up|link=GPU Devel Nodes]][[GPU Devel Nodes|ARC]]
|[[File:up.png|up]]File System
+
|[[File:down.png|up]]File System
 
|-
 
|-
|[[File:up.png|up|link=Gravity]][[Gravity]]
+
|[[File:down.png|up|link=Gravity]][[Gravity]]
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
+
|[[File:down.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
|[[File:up.png|up|link=BGQ]][[BGQ]]
+
|[[File:down.png|up|link=BGQ]][[BGQ]]
 
|[[File:down.png|up|link=HPSS]][[HPSS]]
 
|[[File:down.png|up|link=HPSS]][[HPSS]]
 
|
 
|

Revision as of 15:19, 30 June 2014

System Status

upGPC upTCS upSandy upARC upFile System
upGravity upP7 upBGQ upHPSS

Sun Jun 29 19:57:29: Compute systems started coming online about 730PM.

Sun Jun 29 18:20:41: filesystems restarted after some issues. Likely at least 8PM before compute systems available

Sun Jun 29 16:39:35 EDT 2014: large voltage spike tripped our main circuit breaker. We have power though it's out at sites within 2k because of lightning strike. Cooling system being restored

Sun Jun 29 15:47:11 EDT 2014: staff enroute to site. Should have update on cause within an hour

Sun Jun 29 15:40:31 EDT 2014: power lost about 3:20P today. All systems down. Investigating.


Note: As a precaution, emails by the Moab/Torque scheduler have been disabled because of a potential security vulnerability since Jan 24th 2014.

Last updated: Fri May 23 12:01:44 EDT 2014 (Previous messages)