Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 1: Line 1:
 
== System Status: <span style="color:#339900">'''UP'''</span> ==  
 
== System Status: <span style="color:#339900">'''UP'''</span> ==  
  
Tue Mar  6 18:30:00 EST 2012
+
Tue Mar  20 18:30:00 EST 2012
  
There was an issue on some of the IB switches that may have caused IB jobs to die. The issue is now resolved.
+
There is a planned maintenance shutdown of the TCS and GPC IB on Thursday March 22, 2012 starting at 9am.  GPC ethernet jobs and the file system will remain active and unaffected. Systems should be back in the afternoon.
  
 
------------------------------
 
------------------------------

Revision as of 23:36, 20 March 2012

System Status: UP

Tue Mar 20 18:30:00 EST 2012

There is a planned maintenance shutdown of the TCS and GPC IB on Thursday March 22, 2012 starting at 9am. GPC ethernet jobs and the file system will remain active and unaffected. Systems should be back in the afternoon.


Thu Feb 9 11:50:57 EST 2012: System Temporary Change for MPI ethernet jobs:

Due to some changes we are making to the GPC GigE nodes, if you run multinode ethernet MPI jobs (IB multinode jobs are fine), you will need to explicitly request the ethernet interface in your mpirun:

For Openmpi -> mpirun --mca btl self,sm,tcp
For IntelMPI -> mpirun -env I_MPI_FABRICS shm:tcp

There is no need to do this if you run on IB, or if you run single node mpi jobs on the ethernet (GigE) nodes. Please check GPC_MPI_Versions for more details.

(Previous messages)