Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 9: Line 9:
 
{|  
 
{|  
 
|[[File:down.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]
 
|[[File:down.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]
|[[File:down.png|up|link=TCS Quickstart]][[TCS Quickstart|TCS]]
+
|[[File:up.png|up|link=TCS Quickstart]][[TCS Quickstart|TCS]]
 
|[[File:down.png|down|link=Sandy]][[Sandy]]
 
|[[File:down.png|down|link=Sandy]][[Sandy]]
 
|[[File:down.png|down|link=GPU Devel Nodes]][[GPU Devel Nodes|ARC]]
 
|[[File:down.png|down|link=GPU Devel Nodes]][[GPU Devel Nodes|ARC]]
|[[File:down.png|up]]File System
+
|[[File:up75.png|up]]File System
 
|-
 
|-
 
|[[File:down.png|down|link=Gravity]][[Gravity]]
 
|[[File:down.png|down|link=Gravity]][[Gravity]]
|[[File:down.png|down|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
+
|[[File:up.png|down|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
 
|[[File:down.png|down|link=BGQ]][[BGQ]]
 
|[[File:down.png|down|link=BGQ]][[BGQ]]
 
|[[File:down.png|up|link=HPSS]][[HPSS]]
 
|[[File:down.png|up|link=HPSS]][[HPSS]]

Revision as of 12:43, 28 September 2014

System Status

upGPC upTCS downSandy downARC upFile System
downGravity downP7 downBGQ upHPSS

Sun Sep 28 09:43:28 EDT 2014: Brief power outage knocked-out cooling system at about 0806 this morning. Cooling has been restored. Disk controllers and filesystems are being brought up. Systems will be unavailable until at least noon.

Fri Sep 19 15:50:54 EDT 2014: Scheduler has been stable for the past hour, and jobs are being scheduled. Please submit your jobs. Please be aware that showq is not reporting some running jobs that were running before the glitch. Use qstat instead of showq for these jobs. Most of queued jobs this morning were rejected by the scheduler when it went back online.


Fri Sep 19 12:52:20 EDT 2014: We've been experiencing intermittent problems with the scheduler. Job submission has been paused temporarily, until we can restart the scheduler. Please check this space for updates.


Mon Sep 15 11:13:10 EDT 2014: The scheduler had some issues this morning and had to be restarted to resolve them. Some queued and running jobs have been lost.


(Previous messages)