Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 12: Line 12:
 
|[[File:down.png|down|link=Sandy]][[Sandy]]
 
|[[File:down.png|down|link=Sandy]][[Sandy]]
 
|[[File:down.png|down|link=GPU Devel Nodes]][[GPU Devel Nodes|ARC]]
 
|[[File:down.png|down|link=GPU Devel Nodes]][[GPU Devel Nodes|ARC]]
|[[File:up50.png|investigating]]File System
+
|[[File:up.png|up]]File System
 
|-
 
|-
 
|[[File:down.png|down|link=Gravity]][[Gravity]]
 
|[[File:down.png|down|link=Gravity]][[Gravity]]
 
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
 
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
|[[File:up.png|up|link=BGQ]][[BGQ]]
+
|[[File:down.png|down|link=BGQ]][[BGQ]]
 
|[[File:up.png|up|link=HPSS]][[HPSS]]
 
|[[File:up.png|up|link=HPSS]][[HPSS]]
 
|
 
|
Line 22: Line 22:
  
 
Fri Aug  1 17:23:05 EDT 2014: Around 5pm, a few seconds of power outage has taken down an as-of-yet unknown number of nodes.  GPC, Sandy, TCS, Gravity, ARC are certainly affected, but to which extent is not clear yet.  Updates will be posted here.
 
Fri Aug  1 17:23:05 EDT 2014: Around 5pm, a few seconds of power outage has taken down an as-of-yet unknown number of nodes.  GPC, Sandy, TCS, Gravity, ARC are certainly affected, but to which extent is not clear yet.  Updates will be posted here.
 +
 +
Fri Aug  1 17:46:04 EDT 2014: GPC, Sandy, ARC, Gravity, TCS, and BGQ were all affected. P7, HPSS and file system are okay. We're rebooting the nodes.
 +
  
 
Note: As a precaution, emails by the Moab/Torque scheduler have been disabled because of a potential security vulnerability since Jan 24th 2014.
 
Note: As a precaution, emails by the Moab/Torque scheduler have been disabled because of a potential security vulnerability since Jan 24th 2014.

Revision as of 17:49, 1 August 2014

System Status

downGPC downTCS downSandy downARC upFile System
downGravity upP7 downBGQ upHPSS

Fri Aug 1 17:23:05 EDT 2014: Around 5pm, a few seconds of power outage has taken down an as-of-yet unknown number of nodes. GPC, Sandy, TCS, Gravity, ARC are certainly affected, but to which extent is not clear yet. Updates will be posted here.

Fri Aug 1 17:46:04 EDT 2014: GPC, Sandy, ARC, Gravity, TCS, and BGQ were all affected. P7, HPSS and file system are okay. We're rebooting the nodes.


Note: As a precaution, emails by the Moab/Torque scheduler have been disabled because of a potential security vulnerability since Jan 24th 2014.

Last updated: Tue Jul 15 7:51:44 EDT 2014 (Previous messages)