Oldwiki.scinet.utoronto.ca:System Alerts

From oldwiki.scinet.utoronto.ca
Revision as of 16:33, 2 September 2016 by Pinto (talk | contribs) (→‎System Status)
Jump to navigation Jump to search

System Status

upGPC upTCS upSandy upFile System
upGravity upP7 upViz downBGQ upHPSS

<b.Fri Sep 2 17:31:44 EDT 2016 The HPSS disk-based cache is full, and currently being drained to tape. Until enough material has been migrated and purged the system will remain on hold.

Sun Aug 28 16:10:56 EDT BGQ racks were shutoff due to environment issue, system being restarted. Please resubmit your jobs.

Sun 28 Aug 2016 10:19:17 EDT Turning systems on. Expect to be up in a couple of hours.

Sun Aug 28 09:16:02 EDT 2016 Datacentre was shutdown due to facility power failure.

Tue 16 Aug 2016 21:55:20 Cooling restored, filesystem up and OK, bringing up clusters. GPC should be available to users by 11PM (perhaps as early as 1030)

Tue 16 Aug 2016 20:46:37 Water service has been restored to building. Restarting cooling system.

Tue 16 Aug 2016 19:52:32 SciNet-related maintenance and modifications have been completed successfully. Work on the building water valve is expected to be done on time (by 9PM). Once water service is restored we need to restore cooling, power-up filesystems and then restart the clusters. Unlikely that any systems are available to users before 11PM and it will take longer to get everything online. Check here for updates

Tue 16 Aug 2016 07:07:32 Shutdown has started


Scheduled full-day maintenance shutdown begins:

7AM, Tuesday, 16 Aug

Several projects (adding new 208V circuits for storage, cooling tower maintenance etc) are being carried out on same day as the landlord needs to shutdown the main building water supply (and therefore our cooling system as well) for repairs.

Expect to start bringing systems up about 10PM depending on when the water work is done. Check here for further updates during the day