Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 1: Line 1:
 
== System Status: <span style="color:#339900">'''UP'''</span>==
 
== System Status: <span style="color:#339900">'''UP'''</span>==
  
Wed 28 Mar 2012 21:45:03 EDT
 
 
Connection problem was caused by trouble with a filesystem manager. Problem solved.
 
 
Wed 28 Mar 2012 20:55:27 EDT
 
 
We're experiencing some problems connecting to the login nodes. Investigating.
 
 
------------------------------
 
 
Wed Mar 28 10:34:25 EDT 2012
 
 
There have been some GPC file system and network stability issues reported over that past few days that we believe are related to some OS configuration changes.  We are in the process of resolving them. Thanks for your patience. 
 
  
 
------------------------------
 
------------------------------

Revision as of 19:46, 31 March 2012

System Status: UP


Thu Feb 9 11:50:57 EST 2012: System Temporary Change for MPI ethernet jobs:

Due to some changes we are making to the GPC GigE nodes, if you run multinode ethernet MPI jobs (IB multinode jobs are fine), you will need to explicitly request the ethernet interface in your mpirun:

For Openmpi -> mpirun --mca btl self,sm,tcp
For IntelMPI -> mpirun -env I_MPI_FABRICS shm:tcp

There is no need to do this if you run on IB, or if you run single node mpi jobs on the ethernet (GigE) nodes. Please check GPC_MPI_Versions for more details.

(Previous messages)