Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 32: Line 32:
 
Mon 24 Nov 2014 10:24:55 EST:  Filesystems are experiencing huge waiter problems.  Our systems people are working to clear the issue.
 
Mon 24 Nov 2014 10:24:55 EST:  Filesystems are experiencing huge waiter problems.  Our systems people are working to clear the issue.
  
Fri Nov 21 23:53:08 EST 2014: System is ready to login. GPC and BGQ are online accepting jobs. Working the rest of systems.
 
 
Fri Nov 21 22:45:58 EST 2014: Water is restored. Working on bring up systems.
 
 
Fri 21 Nov 2014 21:22:28 EST:  Datacentre still down.  Emergency water repairs being done in the area.  Hence no cooling.  We expect systems to be back in operation sometime tomorrow morning.
 
 
Fri Nov 21 18:49:25 EST 2014:  Datacentre down. Staff enroute. Investigating
 
 
Fri Nov 21 14:30:00 EDT 2014: File system is slow. Investigating.
 
 
Fri Nov 7 15:00:00 EDT 2014: HPSS is nearing capacity: jobs can be submitted, but will only run once they are reviewed and released by SciNet staff.
 
  
 
([[Previous_messages:|Previous messages]])
 
([[Previous_messages:|Previous messages]])

Revision as of 12:31, 27 November 2014

System Status

upGPC downTCS upSandy downARC upFile System
upGravity upP7 upBGQ upHPSS

Thu Nov 27 11:24:17 EST 2014: On Monday, December 1, a Moab developer will debug a scheduler problem, which cancels jobs unexpectedly upon restart on GPC. The debugging process will start at 11AM, and some queued jobs will be cancelled. It's advised not to submit new jobs during this period. Please checks wiki for update.

Mon Nov 24 10:39:00 EST 2014: Filesystems are back.

Mon 24 Nov 2014 10:24:55 EST: Filesystems are experiencing huge waiter problems. Our systems people are working to clear the issue.


(Previous messages)