Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 19: Line 19:
 
|}
 
|}
  
Wed Dec 25 07:15:10 EST 2013: Some TCS jobs were killed at ~7AM today as we shutdown frames 9 and 10 to help stabilize temperatures in the machine room. Please check your jobs and resubmit. The nodes are being restarted
+
January 15, 7:00 am - January 16, 6:00 pm
  
Wed Dec 25 07:15:10 EST 2013: Cooling tower was successfully de-iced and water temperatures have returned to normal.
+
'''Scheduled Maintenance Downtime'''
  
Wed Dec 25 06:57:10 EST 2013:  Shutting down some TCS nodes to help lower room temperatures. Cooling tower has frozen over. Trying to get de-icing cycle going again.
+
Systems will be taken down around 7 am on Jan 15, and expected to be back by 6 pm if there are no unexpected setbacks.
  
Sun Dec 22 11:08:13 EST 2013: Another power event at 0312 today knocked out the BGQ again. Unfortunately key staff are without power so time to restore is unknown (more than 250,000 customers in the GTA currently without power)
+
Check here for updates.
 
 
Sun Dec 22 00:19:23 EST 2013: BGQ up and jobs running.  Some may have been killed so check your logs.
 
 
 
Sat Dec 21 23:39:26 EST 2013:  Power glitch to site at 2240 caused the BGQ to shutdown - it is being restored. Large ice storm is underway and PowerStream reports over 20,000 customers without power. There may well be more issues overnight.
 
 
 
 
 
Last updated: Wed Dec 18 15:59:09 EST 2013
 
 
 
Dear SciNet users:
 
 
 
SciNet is officially on holiday from Sat Dec 21, 2013, until Sun Jan 5, 2014.  All systems will be up, and maintained on a best-effort basis.  User support will also be on a best-effort basis, though we will try to help if we can.
 
 
 
We wish you all Happy Holidays, and the best for the New Year.
 
 
 
The SciNet team.
 
  
 +
Last updated: Wed Jan 7 16:59:09 EST 2014
  
 
([[Previous_messages:|Previous messages]])
 
([[Previous_messages:|Previous messages]])

Revision as of 18:04, 7 January 2014

System Status

upGPC upTCS upSandy upARC
upGravity upP7 upBGQ upHPSS

January 15, 7:00 am - January 16, 6:00 pm

Scheduled Maintenance Downtime

Systems will be taken down around 7 am on Jan 15, and expected to be back by 6 pm if there are no unexpected setbacks.

Check here for updates.

Last updated: Wed Jan 7 16:59:09 EST 2014

(Previous messages)