Oldwiki.scinet.utoronto.ca:System Alerts

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search

System Status

upGPC upTCS upSandy upFile System
upGravity upP7 upViz upBGQ upHPSS

The full BGQ is reserved from 12 noon on Friday August 5, until noon on Sunday August 7, for a full-system run. Jobs may be submitted, but will remain in the queue until the full-system run is done.

Older news:

Fri Jul 29 20:21:18 EDT 2016: /scratch and /project are up.

Fri Jul 29 15:10:00 EDT 2016: /scratch and /project are down.

Thu Jul 7 10:08:45 EDT 2016: All compute systems are back. HPSS will be restarted later today.

Thu 7 Jul 2016 06:45:20 EDT: Main power breaker had tripped in response to a large voltage spike. Cooling system being restarted then filesystems and compute systems. Barring unexpected problems, users should have access by mid-late morning

Thu 7 Jul 2016 06:04:05 EDT: Power failure at datacentre at 0523 this morning. Staff enroute to assess situation. All systems are down

Fri 17 Jun 2016 14:40:28 EDT: Normal access to the BGQ has been restored.

Wed Jun 15 16:20:56 EDT 2016: The HPSS software upgrade is finished.

Mon Jun 13 13:40 File system seems better after rebooting a few troublesome nodes.

Mon Jun 13 13:00 File system slow at present.

Fri Jun 10 13:41:43 EDT 2016: HPSS is scheduled for a software upgrade on Jun/15 (next Wednesday). If you keep submitting jobs requesting 72 hours, they will run only after the upgrade.