Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 2: Line 2:
  
 
Systems available. All jobs running at ~3AM this morning almost certainly failed and/or were killed about 11AM.  A hardware failure has reduced /home and /scratch performance by a factor of 2 but should be corrected tomorrow.  
 
Systems available. All jobs running at ~3AM this morning almost certainly failed and/or were killed about 11AM.  A hardware failure has reduced /home and /scratch performance by a factor of 2 but should be corrected tomorrow.  
 +
 +
All queued jobs were also deleted, on both systems.
  
 
Msg last updated: Sun Jul 11 13:08:02 EDT 2010
 
Msg last updated: Sun Jul 11 13:08:02 EDT 2010

Revision as of 13:36, 11 July 2010

System Status

Systems available. All jobs running at ~3AM this morning almost certainly failed and/or were killed about 11AM. A hardware failure has reduced /home and /scratch performance by a factor of 2 but should be corrected tomorrow.

All queued jobs were also deleted, on both systems.

Msg last updated: Sun Jul 11 13:08:02 EDT 2010


Previous messages:

Sun Jul 11 09:56:27 EDT 2010: /scratch was inaccessible as of about 3AM

Fri Jul 9 16:18:18 EDT 2010: /scratch is accessible again

Fri Jul 9 15:38:43 EDT: New trouble with the filesystems. We are working to fix things

Fri Jul 9 11:28:00 EDT 2010 The /scratch filesystem died at about 3 AM on Fri Jul 9, and all jobs running at the time died. Queued jobs will get scheduled.