Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 1: Line 1:
 
== System Status: <span style="color:#339900">'''UP'''</span>==
 
== System Status: <span style="color:#339900">'''UP'''</span>==
  
Thu Mar 22 13:23:47 EDT 2012
+
Wed Mar 28 10:34:25 EDT 2012
  
All systems are back online.
+
There have been some GPC file system and network stability issues reported over that past few days that we believe are related to some OS configuration changes. We are in the process of resolving them. Thanks for your patience. 
  
 
------------------------------
 
------------------------------

Revision as of 10:37, 28 March 2012

System Status: UP

Wed Mar 28 10:34:25 EDT 2012

There have been some GPC file system and network stability issues reported over that past few days that we believe are related to some OS configuration changes. We are in the process of resolving them. Thanks for your patience.


Thu Feb 9 11:50:57 EST 2012: System Temporary Change for MPI ethernet jobs:

Due to some changes we are making to the GPC GigE nodes, if you run multinode ethernet MPI jobs (IB multinode jobs are fine), you will need to explicitly request the ethernet interface in your mpirun:

For Openmpi -> mpirun --mca btl self,sm,tcp
For IntelMPI -> mpirun -env I_MPI_FABRICS shm:tcp

There is no need to do this if you run on IB, or if you run single node mpi jobs on the ethernet (GigE) nodes. Please check GPC_MPI_Versions for more details.

(Previous messages)