Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"
Line 22: | Line 22: | ||
|[[File:up.png|up|link=TCS Quickstart]][[TCS Quickstart|TCS]] | |[[File:up.png|up|link=TCS Quickstart]][[TCS Quickstart|TCS]] | ||
|[[File:up.png|up|link=Sandy]][[Sandy]] | |[[File:up.png|up|link=Sandy]][[Sandy]] | ||
− | |[[File: | + | |[[File:down.png|down]]File System |
|- | |- | ||
|[[File:up.png|up|link=Gravity]][[Gravity]] | |[[File:up.png|up|link=Gravity]][[Gravity]] | ||
Line 30: | Line 30: | ||
|[[File:up.png|up|link=HPSS]][[HPSS]] | |[[File:up.png|up|link=HPSS]][[HPSS]] | ||
|} | |} | ||
+ | |||
+ | <b> Fri Jul 29 15:10:00 EDT 2016: </b> /scratch and /project are down. | ||
<b> Thu Jul 7 10:08:45 EDT 2016: </b> All compute systems are back. HPSS will be restarted later today. | <b> Thu Jul 7 10:08:45 EDT 2016: </b> All compute systems are back. HPSS will be restarted later today. |
Revision as of 16:45, 29 July 2016
System Status
GPC | TCS | Sandy | File System | |
Gravity | P7 | Viz | BGQ | HPSS |
Fri Jul 29 15:10:00 EDT 2016: /scratch and /project are down.
Thu Jul 7 10:08:45 EDT 2016: All compute systems are back. HPSS will be restarted later today.
Thu 7 Jul 2016 06:45:20 EDT: Main power breaker had tripped in response to a large voltage spike. Cooling system being restarted then filesystems and compute systems. Barring unexpected problems, users should have access by mid-late morning
Thu 7 Jul 2016 06:04:05 EDT: Power failure at datacentre at 0523 this morning. Staff enroute to assess situation. All systems are down
Fri 17 Jun 2016 14:40:28 EDT: Normal access to the BGQ has been restored.
Wed Jun 15 16:20:56 EDT 2016: The HPSS software upgrade is finished.
Mon Jun 13 13:40 File system seems better after rebooting a few troublesome nodes.
Mon Jun 13 13:00 File system slow at present.
Fri Jun 10 13:41:43 EDT 2016: HPSS is scheduled for a software upgrade on Jun/15 (next Wednesday). If you keep submitting jobs requesting 72 hours, they will run only after the upgrade.