Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
 
Line 1: Line 1:
== System Status: <span style="color:#993300">'''TCS DOWN'''</span>/<span style="color:#339900">'''GPC AND OTHER SYSTEMS UP'''</span>==
+
== System Status==
Thu Apr 12 17:08:00 EST 2012: scheduled maintenance downtime of the TCS. As announced, running TCS jobs and TCS login sessions were killed. All other systems are up. The TCS is expected to be up again sometime this evening.
+
<!--
 +
  Notes for updating the system status:
  
--------------------------------
+
  - When removing system status entries, please archive them to:
  
Tue Apr 10 16:24:00 EST 2012: ''scheduled downtimes:''
+
    http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:
  
Apr 12: TCS Shutdown (Other systems will remain up).  The shutdown will start at 11 am and the system should be available again at in the evening of the same day.
+
    (yes, the trailing colon is part of the url)
  
Apr 18-19: Full SciNet shutdown. More details later.
+
  - The 'status circles' can be one of the following files:  
  
------------------------------
+
    down.png  for down
 +
    up25.png  for 25% up
 +
    up50.png  for 50% up
 +
    up75.png  for 75% up
 +
    up.png    for 100% up
  
Thu Feb 9 11:50:57 EST 2012: ''System Temporary Change for MPI ethernet jobs:''
+
   
 +
{|
 +
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]
 +
|-
 +
|[[File:up.png|up|link=BGQ]][[BGQ]]
 +
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]
 +
|[[File:up.png|up|link=P8]][[P8]]
 +
|-
 +
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]
 +
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]
 +
|[[File:down.png|up|link=HPSS]][https://docs.scinet.utoronto.ca/index.php/HPSS HPSS]
 +
|-
 +
|[[File:up.png|up|]]File System
 +
|[[File:up.png|up|]]External Network
 +
|
 +
|}
  
Due to some changes we are making to the GPC GigE nodes, if you run multinode ethernet MPI jobs (IB multinode jobs are fine), you will need to explicitly request the ethernet interface in your mpirun:
+
-->
  
For Openmpi  -> mpirun  --mca btl self,sm,tcp<br>
+
System status can now be found at [https://docs.scinet.utoronto.ca docs.scinet.utoronto.ca]
For IntelMPI  ->  mpirun -env I_MPI_FABRICS shm:tcp
 
  
There is no need to do this if you run on IB, or if you run single node mpi jobs on the ethernet (GigE) nodes.  Please check [[GPC_MPI_Versions]] for more details.
 
  
([[Previous_messages:|Previous messages]])
+
<b> Mon 23 Apr 2018 </b> GPC-compute is decommissioned, GPC-storage available until <font color=red><b>30 May 2018</b></font>
 +
 
 +
<b> Thu 18 Apr 2018 </b>  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption. 
 +
 
 +
<b> Fri 13 Apr 2018 </b> HPSS system will be down for a few hours on <b>Mon, Apr/16, 9AM</b>, for hardware upgrades, in preparation for the eventual move to the Niagara side.
 +
 
 +
<b> Tue 10 Apr 2018 </b> Niagara is open to users.
 +
 
 +
<b> Wed 4 Apr 2018 </b> We are very close to the production launch of Niagara, the new system installed at SciNet.
 +
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.
 +
 
 +
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new
 +
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,
 +
will have your accounts created and ready for you to login.
 +
 
 +
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.
 +
 
 +
<!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] -->

Latest revision as of 14:23, 7 May 2018

System Status

System status can now be found at docs.scinet.utoronto.ca


Mon 23 Apr 2018 GPC-compute is decommissioned, GPC-storage available until 30 May 2018

Thu 18 Apr 2018 Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.

Fri 13 Apr 2018 HPSS system will be down for a few hours on Mon, Apr/16, 9AM, for hardware upgrades, in preparation for the eventual move to the Niagara side.

Tue 10 Apr 2018 Niagara is open to users.

Wed 4 Apr 2018 We are very close to the production launch of Niagara, the new system installed at SciNet. While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.

All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new Niagara system. Those of you who are new to SciNet, but got RAC allocations on Niagara, will have your accounts created and ready for you to login.

We are planning an extended Intro to SciNet/Niagara session, available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.