Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 8: Line 8:
 
  -->
 
  -->
 
{|  
 
{|  
|[[File:up.png| up |link=GPC Quickstart]][[GPC Quickstart|GPC]]
+
|[[File:down.png| down|link=GPC Quickstart]][[GPC Quickstart|GPC]]
|[[File:up.png| up |link=TCS Quickstart]][[TCS Quickstart|TCS]]
+
|[[File:down.png| down|link=TCS Quickstart]][[TCS Quickstart|TCS]]
|[[File:up.png| up |link=Sandy]][[Sandy]]
+
|[[File:down.png| down|link=Sandy]][[Sandy]]
|[[File:up.png| up |link=GPU Devel Nodes]][[GPU Devel Nodes|ARC]]
+
|[[File:down.png| down|link=GPU Devel Nodes]][[GPU Devel Nodes|ARC]]
|[[File:up.png| up| up ]]File System
+
|[[File:down.png| down| up ]]File System
 
|-
 
|-
|[[File:up.png| up |link=Gravity]][[Gravity]]
+
|[[File:down.png| down|link=Gravity]][[Gravity]]
|[[File:up.png| up |link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
+
|[[File:down.png| down|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
 
|[[File:up.png| up |link=BGQ]][[BGQ]]
 
|[[File:up.png| up |link=BGQ]][[BGQ]]
|[[File:up.png| up |link=HPSS]][[HPSS]]
+
|[[File:down.png| down|link=HPSS]][[HPSS]]
 
|
 
|
 
|}
 
|}
  
 +
Sat Feb  7 07:00:14 EST 2015: datacenter is being shutdown automatically due to a power failure
  
 
Fri Feb  6 08:41:08 EST 2015: <span style="color:red">The /scratch file system, which had crashed during the early morning and took almost all jobs with it, is back to normal. Practically all GPC and TCS jobs died. Please resubmit your jobs.</span>
 
Fri Feb  6 08:41:08 EST 2015: <span style="color:red">The /scratch file system, which had crashed during the early morning and took almost all jobs with it, is back to normal. Practically all GPC and TCS jobs died. Please resubmit your jobs.</span>

Revision as of 08:09, 7 February 2015

System Status

downGPC downTCS downSandy downARC upFile System
downGravity downP7 upBGQ downHPSS

Sat Feb 7 07:00:14 EST 2015: datacenter is being shutdown automatically due to a power failure

Fri Feb 6 08:41:08 EST 2015: The /scratch file system, which had crashed during the early morning and took almost all jobs with it, is back to normal. Practically all GPC and TCS jobs died. Please resubmit your jobs.

Fri Feb 6 05:58:00 EST 2015: /scratch is showing Stale file handle on many GPC/TCS nodes, indicating some kind of failure. We're investigating.

Thu Jan 22 13:27:44 EST 2015: BGQ now available as a single 4-rack system. bgqdev-fen1 is the single login/devel/submission node.

(Previous messages)