Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
 
(128 intermediate revisions by 7 users not shown)
Line 17: Line 17:
 
     up.png    for 100% up
 
     up.png    for 100% up
  
  -->
+
   
 
{|  
 
{|  
|[[File:down.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]
+
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]
|[[File:down.png|up|link=TCS Quickstart]][[TCS Quickstart|TCS]]
 
|[[File:down.png|up|link=Sandy]][[Sandy]]
 
|[[File:down.png|up|link=Gravity]][[Gravity]]
 
|[[File:down.png|up|link=BGQ]][[BGQ]]
 
|[[File:down.png|up|]]File System
 
 
|-
 
|-
|[[File:down.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
+
|[[File:up.png|up|link=BGQ]][[BGQ]]
|[[File:down.png|up|link=P8]][[P8]]
+
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
|[[File:down.png|up|link=Knights Landing]][[Knights Landing|KNL]]
+
|[[File:up.png|up|link=P8]][[P8]]
|[[File:down.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]
+
|-
|[[File:down.png|up|link=HPSS]][[HPSS]]
+
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]
 +
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]
 +
|[[File:down.png|up|link=HPSS]][https://docs.scinet.utoronto.ca/index.php/HPSS HPSS]
 +
|-
 +
|[[File:up.png|up|]]File System
 +
|[[File:up.png|up|]]External Network
 +
|
 
|}
 
|}
  
<b> Fri Sep 29 11:47:57 EDT 2017 </b> There was a power outage at the Datacentre that caused the compute systems to go down. Power has been restored and we are working to bring systems back online.
+
-->
 
 
<b> Thu Aug 31 12:58:45 EDT 2017 </b> BGQ back online.
 
 
 
<b> Thu Aug 31 9:09:45 EDT 2017 </b> BGQ down for scheduled service.  Should be back online around 2pm.
 
 
 
<b> Fri Aug 18 17:10:24 EDT 2017 </b> Systems back to normal.
 
 
 
<b> Fri Aug 18 13:20:56 EDT 2017 </b> We need to have an emergency shutdown of all compute systems to fix a cooling issue that has arisen.  Should be back up this afternoon.  We will try and keep the login nodes and stroage online, however all the compute nodes will need to be shutdown.
 
 
 
<b>Sat Aug  5 16:48:44 EDT 2017</b> The switch is fixed. Scinet0[2-4] and datamovers are back online.
 
 
 
<b>Sat Aug  5 00:56:31 EDT 2017</b>  Most of GPC will be accessible soon. Lost a switch, scinet0[2-4] and datamovers will be down until it's fixed. Scinet01 may be login using its IP address; "ssh 142.150.188.51".
 
  
<b>Fri Aug  4 17:45:10 EDT 2017</b>  The chiller went down again, causing a full shutdown of all systems. We don't expect them back tonight, as the storm continues and power outages continue with it.
+
System status can now be found at [https://docs.scinet.utoronto.ca docs.scinet.utoronto.ca]
  
<b>Fri Aug  4 17:11:08 EDT 2017</b>  A power glitch took down all the compute nodes, including GPC, TCS, BGQ.  The filesystems are up, except for reserved1 and scratchtcs.  Systems are being restored.
 
  
<b>Tue Jun 13 11:24:15 EDT 2017</b> HPSS is back on service.
+
<b> Mon 23 Apr 2018 </b> GPC-compute is decommissioned, GPC-storage available until <font color=red><b>30 May 2018</b></font>
  
<b>Sat Jun 10 21:38:37 EDT 2017</b> The robot arm on the HPSS library developed problems. HPSS is out of service until further notice.
+
<b> Thu 18 Apr 2018 </b> Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.
  
<b>Tue May 23 13:40:56 EDT 2017</b> HPSS is back on service.
+
<b> Fri 13 Apr 2018 </b> HPSS system will be down for a few hours on <b>Mon, Apr/16, 9AM</b>, for hardware upgrades, in preparation for the eventual move to the Niagara side.
  
<b>Mon May 22 07:37:27 EDT 2017</b> Overnight the robot arm on the library developed problems. We may not be able to have support come in and have it fixed until Tuesday. In the meantime HPSS is out of service
+
<b> Tue 10 Apr 2018 </b> Niagara is open to users.
  
<b>Thu May 18 13:00:00 EDT 2017</b> File system seem resolved. Please check your jobs and resubmit if they had issues.  
+
<b> Wed 4 Apr 2018 </b> We are very close to the production launch of Niagara, the new system installed at SciNet.
 +
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.
  
<b>Thu May 18 12:00:00 EDT 2017</b> File system issues and some jobs may have died. Investigating.
+
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new
 +
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,
 +
will have your accounts created and ready for you to login.
  
 +
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.
  
<!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] -->
+
<!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] -->

Latest revision as of 14:23, 7 May 2018

System Status

System status can now be found at docs.scinet.utoronto.ca


Mon 23 Apr 2018 GPC-compute is decommissioned, GPC-storage available until 30 May 2018

Thu 18 Apr 2018 Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.

Fri 13 Apr 2018 HPSS system will be down for a few hours on Mon, Apr/16, 9AM, for hardware upgrades, in preparation for the eventual move to the Niagara side.

Tue 10 Apr 2018 Niagara is open to users.

Wed 4 Apr 2018 We are very close to the production launch of Niagara, the new system installed at SciNet. While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.

All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new Niagara system. Those of you who are new to SciNet, but got RAC allocations on Niagara, will have your accounts created and ready for you to login.

We are planning an extended Intro to SciNet/Niagara session, available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.