Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 8: Line 8:
 
  -->
 
  -->
 
{|  
 
{|  
|[[File:down.png| down |link=GPC Quickstart]][[GPC Quickstart|GPC]]
+
|[[File:up50.png| up |link=GPC Quickstart]][[GPC Quickstart|GPC]]
 
|[[File:down.png| down |link=TCS Quickstart]][[TCS Quickstart|TCS]]
 
|[[File:down.png| down |link=TCS Quickstart]][[TCS Quickstart|TCS]]
|[[File:down.png| down |link=Sandy]][[Sandy]]
+
|[[File:up.png| up |link=Sandy]][[Sandy]]
 
|[[File:down.png| down |link=GPU Devel Nodes]][[GPU Devel Nodes|ARC]]
 
|[[File:down.png| down |link=GPU Devel Nodes]][[GPU Devel Nodes|ARC]]
|[[File:down.png| down ]]File System
+
|[[File:up.png| up ]]File System
 
|-
 
|-
|[[File:down.png| down |link=Gravity]][[Gravity]]
+
|[[File:up.png| up |link=Gravity]][[Gravity]]
 
|[[File:down.png| down |link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
 
|[[File:down.png| down |link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
 
|[[File:down.png| down |link=BGQ]][[BGQ]]
 
|[[File:down.png| down |link=BGQ]][[BGQ]]
Line 21: Line 21:
 
|}
 
|}
  
 +
 +
Thu  4 Dec 2014 16:05:07 EST:  Some systems are back up, but only until tonight when the advertised shutdown will still happen, starting at around 9PM.  Only short jobs, that fit in the short time before the systems are taken down, will run.  Filesystems are up, as well as devel nodes on all platforms.  BGQ will remain down until after the water repairs.
  
 
Thu Dec  4 14:03:24 EST 2014: Systems abnormally shutdown due to loss of plumbing to secondary loop. Still investigating.
 
Thu Dec  4 14:03:24 EST 2014: Systems abnormally shutdown due to loss of plumbing to secondary loop. Still investigating.
  
 
ALL SYSTEMS TO BE SHUTDOWN ON THURSDAY- On Thursday Dec 4, at 9PM EST, all systems will need to be shutdown.  The city of Vaughan has advised us that the city water supply will be turned off in order to fully fix the problem that occured on Nov 21.  With no water supply we cannot cool the datacentre, hence the shutdown.  We expect all systems to be back up on Friday Dec 5, at around 11AM.
 
ALL SYSTEMS TO BE SHUTDOWN ON THURSDAY- On Thursday Dec 4, at 9PM EST, all systems will need to be shutdown.  The city of Vaughan has advised us that the city water supply will be turned off in order to fully fix the problem that occured on Nov 21.  With no water supply we cannot cool the datacentre, hence the shutdown.  We expect all systems to be back up on Friday Dec 5, at around 11AM.
 
Mon Dec  1 12:41:39 EST 2014:
 
GPC Moab scheduler debugging is finished and is back to normal. No jobs were cancelled.
 
 
Thu Nov 27 11:24:17 EST 2014:
 
On Monday, December 1, a Moab developer will debug a scheduler problem,
 
which cancels jobs unexpectedly upon restart on GPC. The debugging
 
process will start at 11AM, and some queued jobs will be cancelled.
 
It's advised not to submit new jobs during this period. Please checks
 
wiki for update.
 
 
Mon Nov 24 10:39:00 EST 2014:  Filesystems are back.
 
 
Mon 24 Nov 2014 10:24:55 EST:  Filesystems are experiencing huge waiter problems.  Our systems people are working to clear the issue.
 
  
  
 
([[Previous_messages:|Previous messages]])
 
([[Previous_messages:|Previous messages]])

Revision as of 17:12, 4 December 2014

System Status

upGPC downTCS upSandy downARC upFile System
upGravity downP7 downBGQ downHPSS


Thu 4 Dec 2014 16:05:07 EST: Some systems are back up, but only until tonight when the advertised shutdown will still happen, starting at around 9PM. Only short jobs, that fit in the short time before the systems are taken down, will run. Filesystems are up, as well as devel nodes on all platforms. BGQ will remain down until after the water repairs.

Thu Dec 4 14:03:24 EST 2014: Systems abnormally shutdown due to loss of plumbing to secondary loop. Still investigating.

ALL SYSTEMS TO BE SHUTDOWN ON THURSDAY- On Thursday Dec 4, at 9PM EST, all systems will need to be shutdown. The city of Vaughan has advised us that the city water supply will be turned off in order to fully fix the problem that occured on Nov 21. With no water supply we cannot cool the datacentre, hence the shutdown. We expect all systems to be back up on Friday Dec 5, at around 11AM.


(Previous messages)