Difference between revisions of "Oldwiki.scinet.utoronto.ca:System Alerts"

From oldwiki.scinet.utoronto.ca
Jump to navigation Jump to search
Line 4: Line 4:
 
Thu Apr 19 16:54:42 EDT 2012:  Shutdown Status
 
Thu Apr 19 16:54:42 EDT 2012:  Shutdown Status
  
All work associated with the machine room expansion project have been completed as well as cooling tower maintenance and hardware changes related to the GPC networking upgrade.  The systems are currently being tested and we expect to allow users back later this evening.
+
All work associated with the machine room expansion project has been completed as well as cooling tower maintenance and hardware changes related to the GPC networking upgrade.  The systems are currently being tested and we expect to allow users back later this evening.
  
 
------------------------------
 
------------------------------

Revision as of 16:57, 19 April 2012

System Status: DOWN


Thu Apr 19 16:54:42 EDT 2012: Shutdown Status

All work associated with the machine room expansion project has been completed as well as cooling tower maintenance and hardware changes related to the GPC networking upgrade. The systems are currently being tested and we expect to allow users back later this evening.


Wed Apr 18 9:05:00 EST 2012

Apr 18-19: Full SciNet shutdown. All logins and jobs will be killed at 9AM on 18 April. Expect systems to come back online in the evening of the following day (19 April).


Thu Feb 9 11:50:57 EST 2012: System Temporary Change for MPI ethernet jobs:

Due to some changes we are making to the GPC GigE nodes, if you run multinode ethernet MPI jobs (IB multinode jobs are fine), you will need to explicitly request the ethernet interface in your mpirun:

For Openmpi -> mpirun --mca btl self,sm,tcp
For IntelMPI -> mpirun -env I_MPI_FABRICS shm:tcp

There is no need to do this if you run on IB, or if you run single node mpi jobs on the ethernet (GigE) nodes. Please check GPC_MPI_Versions for more details.

(Previous messages)