Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
 
(260 intermediate revisions by 9 users not shown)
Line 17: Line 17:
 
     up.png    for 100% up
 
     up.png    for 100% up
  
  -->
+
   
 
{|  
 
{|  
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]
+
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]
|[[File:up.png|up|link=TCS Quickstart]][[TCS Quickstart|TCS]]
+
|-
|[[File:up.png|up|link=Sandy]][[Sandy]]
 
|[[File:up.png|up|link=Gravity]][[Gravity]]
 
 
|[[File:up.png|up|link=BGQ]][[BGQ]]
 
|[[File:up.png|up|link=BGQ]][[BGQ]]
|[[File:up.png|up]]File System
+
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
 +
|[[File:up.png|up|link=P8]][[P8]]
 +
|-
 +
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]
 +
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]
 +
|[[File:down.png|up|link=HPSS]][https://docs.scinet.utoronto.ca/index.php/HPSS HPSS]
 
|-
 
|-
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
+
|[[File:up.png|up|]]File System
|[[File:up.png|down|link=P8]][[P8]]
+
|[[File:up.png|up|]]External Network
|[[File:up.png|down|link=Knights Landing]][[Knights Landing|KNL]]
+
|
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]
 
|[[File:up.png|up|link=HPSS]][[HPSS]]
 
 
|}
 
|}
  
 +
-->
  
<b>Mon Nov 7 11:59:00 EST 2016</b> File system has been restored. Jobs are being scheduled again. Please resubmit jobs if they crashed or had issues last night or this morning.
+
System status can now be found at [https://docs.scinet.utoronto.ca docs.scinet.utoronto.ca]
 
 
<b>Mon Nov 7 10:40:00 EST 2016</b> Due to this issue, many jobs will have either crashed, or have not had a change to write their output; please check any jobs you had running overnight. The scratch file system is expected to be back up soon.
 
 
 
<b>Mon Nov 7 9:40:00 EST 2016</b> Scratch file system filled up overnight. We are investigating how to mitigate this. In the meantime, the job scheduler has been stopped, so no new jobs will start (but will remain in the queue).
 
 
 
<b>Mon Nov 7 8:00:00 EST 2016</b> Apparent file system issues.
 
  
<b>Fri Oct 28 23:00:00 EDT 2016</b> The login nodes and devel nodes of the GPC, P7 and BGQ, as well as the datamover nodes, will be rebooted between 2 am and 6 am on Sat Oct 29. Running and queued jobs will not be affected, but interactive sessions will be closed.
 
  
<b>Mon Sep 26 10:33:47 EDT 2016</b> HPSS schedule is back to normal operations.
+
<b> Mon 23 Apr 2018 </b> GPC-compute is decommissioned, GPC-storage available until <font color=red><b>30 May 2018</b></font>
  
<b>Sun Sep 25 12:37:12 EDT 2016</b> Problems resolved. Systems have started coming online. Check the status "lights" above.
+
<b> Thu 18 Apr 2018 </b> Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.
  
<b>Sun Sep 25 10:16:37 EDT 2016</b> Power outage tripped main breaker and other circuits. Power has been restored to site but there may be an issue with cooling system power that needs to be resolved before any compute systems can be restarted
+
<b> Fri 13 Apr 2018 </b> HPSS system will be down for a few hours on <b>Mon, Apr/16, 9AM</b>, for hardware upgrades, in preparation for the eventual move to the Niagara side.
  
<b>Sun Sep 25 09:28:15 EDT 2016</b> Staff enroute to site. After assessing situation will give ETA for recovery.
+
<b> Tue 10 Apr 2018 </b> Niagara is open to users.
  
<b>Sun Sep 25 08:46 EDT 2016</b> Power outage at datacentre.
+
<b> Wed 4 Apr 2018 </b> We are very close to the production launch of Niagara, the new system installed at SciNet.
 +
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.
  
 +
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new
 +
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,
 +
will have your accounts created and ready for you to login.
  
 +
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.
  
<!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] -->
+
<!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] -->

Latest revision as of 14:23, 7 May 2018

System Status

System status can now be found at docs.scinet.utoronto.ca


Mon 23 Apr 2018 GPC-compute is decommissioned, GPC-storage available until 30 May 2018

Thu 18 Apr 2018 Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.

Fri 13 Apr 2018 HPSS system will be down for a few hours on Mon, Apr/16, 9AM, for hardware upgrades, in preparation for the eventual move to the Niagara side.

Tue 10 Apr 2018 Niagara is open to users.

Wed 4 Apr 2018 We are very close to the production launch of Niagara, the new system installed at SciNet. While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.

All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new Niagara system. Those of you who are new to SciNet, but got RAC allocations on Niagara, will have your accounts created and ready for you to login.

We are planning an extended Intro to SciNet/Niagara session, available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.