Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 1: Line 1:
 
== System Status: <span style="color:#33AA33">'''UP'''</span> but filesystems are <span style="color:#FF0000">'''DOWN'''</span> ==  
 
== System Status: <span style="color:#33AA33">'''UP'''</span> but filesystems are <span style="color:#FF0000">'''DOWN'''</span> ==  
  
We've had another GPFS filesystem failure, possibly a remnant of the problems we've been having for the past couple of weeksMost, if not all, jobs were lost.  Please resubmit when the filesystems are back. We'll advise here when the systems are ready again.  Apologies...
+
There was an InfiniBand switch problem on the TCS, which caused the fileservers to essentially go downIt appears to be solved, and the filesystems - and systems - are coming back up.
 +
 
 +
Please resubmit your jobs.
 +
 
 +
Mon Dec 19 18:29:02 EST 2011
  
  
 
''Note: From Dec 21, 2011 to Jan 1, 2012, the SciNet offices are officially closed, but the system will be up and running and we will keep an eye out for emergencies.''
 
''Note: From Dec 21, 2011 to Jan 1, 2012, the SciNet offices are officially closed, but the system will be up and running and we will keep an eye out for emergencies.''
  
Mon Dec 19 14:25:22 EST 2011
 
  
 
([[Previous_messages:|Previous messages]])
 
([[Previous_messages:|Previous messages]])

Revision as of 19:30, 19 December 2011

System Status: UP but filesystems are DOWN

There was an InfiniBand switch problem on the TCS, which caused the fileservers to essentially go down. It appears to be solved, and the filesystems - and systems - are coming back up.

Please resubmit your jobs.

Mon Dec 19 18:29:02 EST 2011


Note: From Dec 21, 2011 to Jan 1, 2012, the SciNet offices are officially closed, but the system will be up and running and we will keep an eye out for emergencies.


(Previous messages)