Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 1: Line 1:
 
== System Status: <span style="color:#00dd44">'''UP'''</span>==
 
== System Status: <span style="color:#00dd44">'''UP'''</span>==
  
'''''Note: a glitch caused the scratch file system to get unmounted everywhere around 9 am on Wednesday. It has been remounted. However, most jobs were killed and you will have to resubmit your jobs.'''''
+
'''''Scheduled shutdown: 8 PM, Tues 22 Nov'''''
  
The recovery of /project directories for groups with storage allocation which is less than 5 TB in /project is still in progress. Until then, those directories are unaccessible (owned by root). If you can read your project directory, it means that the recovery is complete.
+
Systems expected to be back online by early evening on Wedmesday 23 Nov. During this downtime a Transient Voltage Suppression System and an
 +
Under-Voltage & Phase Regulator will be installed to help mitigate
 +
against problems from the electrical grid. The Variable Frequency Drive
 +
on the primary pump will be removed as it has often (for unknown reasons
 +
and despite replacement, fixes and reprogramming) been implicated in  
 +
most of the emergency shutdowns this calendar year. Storage system firwmare on the controllers and disk drawers will be updated. Finally, all the
 +
inlet water valves on the TCS racks will be replaced as a preventative
 +
measure (two already cracked in Aug-Sept).
  
<!-- To expedite this process, for now, no material can be retrieved from HPSS by users. -->
+
Last updated: Mon Nov 21 15:43:25 EST 2011
Note that the monthly purge of the scratch space will be delayed until Friday, 25 Nov because of the downtime.
 
 
 
Last updated: Wed Nov 16 11:10:37 EST 2011
 
  
 
([[Previous_messages:|Previous messages]])
 
([[Previous_messages:|Previous messages]])

Revision as of 16:44, 21 November 2011

System Status: UP

Scheduled shutdown: 8 PM, Tues 22 Nov

Systems expected to be back online by early evening on Wedmesday 23 Nov. During this downtime a Transient Voltage Suppression System and an Under-Voltage & Phase Regulator will be installed to help mitigate against problems from the electrical grid. The Variable Frequency Drive on the primary pump will be removed as it has often (for unknown reasons and despite replacement, fixes and reprogramming) been implicated in most of the emergency shutdowns this calendar year. Storage system firwmare on the controllers and disk drawers will be updated. Finally, all the inlet water valves on the TCS racks will be replaced as a preventative measure (two already cracked in Aug-Sept).

Last updated: Mon Nov 21 15:43:25 EST 2011

(Previous messages)