oldwiki.scinet.utoronto.ca - User contributions [en-gb]

Using Paraview

2018-09-11T18:38:42Z

Rzon:

Ssh keys

2018-08-31T23:42:14Z

Rzon:

Gamess

2018-08-31T23:41:43Z

Rzon:

SciNet Usage Reports

2018-08-31T23:40:44Z

Rzon:

HPSS compared to HSM-TSM

2018-08-31T23:39:53Z

Rzon:

Cpmd

2018-08-31T23:39:36Z

Rzon:

Performance And Debugging Tools: TCS

2018-08-31T23:38:37Z

Rzon:

SciNet User Support

2018-08-31T23:38:13Z

Rzon:

MARS

2018-08-31T23:37:37Z

Rzon:

HPSS Servers

2018-08-31T23:36:46Z

Rzon:

Systems Overview

2018-08-31T23:35:48Z

Rzon:

Namd on BGQ

2018-08-31T23:35:02Z

Rzon:

User Python virtualenv

2018-08-31T23:32:38Z

Rzon:

HPSS-by-pomes

2018-08-31T23:32:18Z

Rzon:

R Statistical Package

2018-08-31T23:30:55Z

Rzon:

Phi

2018-08-31T23:29:26Z

Rzon:

Using Signals

2018-08-31T23:28:56Z

Rzon:

Performance And Debugging Tools: GPC

2018-08-31T23:28:19Z

Rzon:

Ansys

2018-08-31T23:27:31Z

Rzon:

Niagara

2018-08-31T23:27:13Z

Rzon:

GPU Devel Nodes

2018-08-31T23:26:46Z

Rzon:

Using MySQL on the GPC

2018-08-31T23:26:09Z

Rzon:

Accelerator Research Cluster

2018-08-31T23:25:00Z

Rzon:

MemP

2018-08-31T23:24:28Z

Rzon:

User Codes

2018-08-31T23:23:58Z

Rzon:

{| style="border-spacing: 8px; width:100%"
| valign="top" style="cellpadding:1em; padding:1em; border:2px solid; background-color:#f6f674; border-radius:5px"|
'''WARNING: SciNet is in the process of replacing this wiki with a new documentation site. For current information, please go to [https://docs.scinet.utoronto.ca https://docs.scinet.utoronto.ca]'''
|}

__FORCETOC__

==Astrophysics==

===Athena (explicit, uniform grid MHD code)===

[[Image:StrongScalingAthenaGPC.png|thumb|right|320px|Athena scaling on GPC with OpenMPI and MVAPICH2 on GigE, and OpenMPI on InfiniBand]]

[http://www.astro.princeton.edu/~jstone/athena.html Athena] is a straightforward C code which doesn't use a lot of libraries so it is pretty straightforward to build and compile on new machines.

It encapsulates its compiler flags, etc in an <tt>Makeoptions.in</tt> file which is then processed by <tt>configure</tt>. I've used the following additions to <tt>Makeoptions.in</tt> on TCS and GPC:

<source lang="make">
ifeq ($(MACHINE),scinettcs)
CC = mpcc_r
LDR = mpcc_r
OPT = -O5 -q64 -qarch=pwr6 -qtune=pwr6 -qcache=auto -qlargepage -qstrict
MPIINC =
MPILIB =
CFLAGS = $(OPT)
LIB = -ldl -lm
else
ifeq ($(MACHINE),scinetgpc)
CC = mpicc
LDR = mpicc
OPT = -O3
MPIINC =
MPILIB =
CFLAGS = $(OPT)
LIB = -lm
else
...
endif
endif
</source>
It performs quite well on the GPC, scaling extremely well even on a strong scaling test out to about 256 cores (32 nodes) on Gigabit ethernet, and performing beautifully on InfiniBand out to 512 cores (64 nodes).

-- [[User:Ljdursi|ljdursi]] 19:20, 13 August 2009 (UTC)

===FLASH3 (Adaptive Mesh reactive hydrodynamics; explict hydro/MHD)===

[[Image:weak-scaling-example.png|thumb|right|320px|Weak scaling test of the 2d sod problem on both the GPC and TCS. The results are actually somewhat faster on the GPC; in both cases (weak) scaling is very good out at least to 256 cores]]

[http://flash.uchicago.edu FLASH] encapsulates its machine-dependant information in the <tt>FLASH3/sites</tt> directory. For the GPC, you'll have to
<pre>
module load intel
module load openmpi
module load hdf5/184-p1-v16-openmpi
</pre>

and with that, the following file (<tt>sites/scinetgpc/Makefile.h</tt>) works for me:
<source lang="make">
## Must do module load hdf5/183-v16-openmpi
HDF5_PATH = ${SCINET_HDF5_BASE}
ZLIB_PATH = /usr/local

#----------------------------------------------------------------------------
# Compiler and linker commands
#
# We use the f90 compiler as the linker, so some C libraries may explicitly
# need to be added into the link line.
#----------------------------------------------------------------------------

## modules will put the right mpi in our path
FCOMP = mpif77
CCOMP = mpicc
CPPCOMP = mpiCC
LINK = mpif77

#----------------------------------------------------------------------------
# Compilation flags
#
# Three sets of compilation/linking flags are defined: one for optimized
# code, one for testing, and one for debugging. The default is to use the
# _OPT version. Specifying -debug to setup will pick the _DEBUG version,
# these should enable bounds checking. Specifying -test is used for
# flash_test, and is set for quick code generation, and (sometimes)
# profiling. The Makefile generated by setup will assign the generic token
# (ex. FFLAGS) to the proper set of flags (ex. FFLAGS_OPT).
#----------------------------------------------------------------------------

FFLAGS_OPT = -c -r8 -i4 -O3 -xSSE4.2
FFLAGS_DEBUG = -c -g -r8 -i4 -O0
FFLAGS_TEST = -c -r8 -i4

LIB_HDF5 = -L${HDF5_PATH}/lib -lhdf5 -L${SCINET_ZLIB_LIB} -lz -lgpfs

# if we are using HDF5, we need to specify the path to the include files
CFLAGS_HDF5 = -I${HDF5_PATH}/include

CFLAGS_OPT = -c -O3 -xSSE4.2
CFLAGS_TEST = -c -O2
CFLAGS_DEBUG = -c -g

MDEFS =

.SUFFIXES: .o .c .f .F .h .fh .F90 .f90

#----------------------------------------------------------------------------
# Linker flags
#
# There is a seperate version of the linker flags for each of the _OPT,
# _DEBUG, and _TEST cases.
#----------------------------------------------------------------------------

LFLAGS_OPT = -o
LFLAGS_TEST = -o
LFLAGS_DEBUG = -g -o

MACHOBJ =

MV = mv -f
AR = ar -r
RM = rm -f
CD = cd
RL = ranlib
ECHO = echo
</source>

-- [[User:Ljdursi|ljdursi]] 22:11, 13 August 2009 (UTC)

==Aeronautics==

==Chemistry==

===CPMD===

Please refer to the [[Cpmd | CPMD]] page.

===NWChem===

Please refer to the [[Nwchem | NWChem]] page.

===GAMESS (US)===

Please refer to the [[gamess|GAMESS (US)]] page.

User supplied content below.

====Tips from the Fekl Lab====

Through trial and error, we have found a few useful things that we would like to share:

1. Two very useful, open-source programs for visualization of output files from GAMESS(US) and for generation of input files are [http://www.scl.ameslab.gov/MacMolPlt/ MacMolPlt]and [http://avogadro.openmolecules.net/wiki/Main_Page Avogadro]. The are available for UNIX/LINUX, Windows and Mac based machines, HOWEVER: any input files that we have generated with these programs on a Windows-based machine do not run on Mac based machines. We don't know why.

2. [http://winscp.net/eng/index.php WinSCP] is a very useful tool that has a graphical user interface for moving files from a local machine to SCINET and vice versa. It also has text editing capabilities.

3. The [https://bse.pnl.gov/bse/portal ESML Basis Set Exchange] is an excellent source for custom basis set or effective core potential parameters. Make sure that you specify "Gamess-US" in the format drop-down box.

4. The commercial program [http://www.chemcraftprog.com/ ChemCraft] is a highly useful visualization program that has the ability to edit molecules in a very similar fashion to GaussView. It can also be customized to build GAMESS(US) input files.

====Anatomy of a GAMESS(US) Input File with Basis Set Info in an External File====

$CONTRL SCFTYP=RHF RUNTYP=OPTIMIZE DFTTYP=M06-L MAXIT=199 MULT=1 NOSYM=1
ECP=READ $END
$SYSTEM TIMLIM=525600 MWORDS=1750 PARALL=.TRUE. $END
$BASIS GBASIS=CUSTOMNI EXTFIL=.t. $END
$SCF DIRSCF=.TRUE. FDIFF=.f. $END
$STATPT OPTTOL=0.0001 NSTEP=500 HSSEND=.t. $END
$DATA
Mo_BDT3
C1
MOLYBDENUM 42.0 5.7556500000 4.4039600000 16.5808400000
SULFUR 16.0 7.4169700000 3.1956300000 15.2089300000
SULFUR 16.0 4.0966800000 3.2258300000 15.1761100000
SULFUR 16.0 3.9677300000 4.4940500000 18.3266100000
SULFUR 16.0 7.1776900000 3.5815000000 18.4485200000
SULFUR 16.0 4.3776600000 6.2447400000 15.6786900000
SULFUR 16.0 7.5478700000 6.0679800000 16.2223700000
CARBON 6.0 6.4716900000 2.1004800000 14.1902300000
CARBON 6.0 5.0690300000 2.1781400000 14.1080700000
CARBON 6.0 4.8421800000 4.2701300000 19.8855500000
CARBON 6.0 6.1969000000 3.9249600000 19.9397400000
CARBON 6.0 6.8280600000 3.7834200000 21.1913200000
CARBON 6.0 5.7697600000 7.6933500000 17.4241800000
CARBON 6.0 7.2043100000 7.9413600000 17.8281100000
CARBON 6.0 5.5051400000 7.0409700000 14.5903800000
CARBON 6.0 6.8905200000 6.9194700000 14.7626200000
CARBON 6.0 7.7396400000 7.5379800000 13.8285700000
HYDROGEN 1.0 8.8190700000 7.4520600000 13.9252200000
CARBON 6.0 7.2169400000 8.2960300000 12.7704100000
HYDROGEN 1.0 7.8667000000 8.7825100000 12.0575600000
CARBON 6.0 5.8260300000 8.4502300000 12.6467800000
HYDROGEN 1.0 5.4143000000 9.0544300000 11.8493100000
CARBON 6.0 4.9881500000 7.8192300000 13.5528400000
HYDROGEN 1.0 3.9090500000 7.9420000000 13.4583700000
CARBON 6.0 7.1538500000 1.1569600000 13.4143900000
CARBON 6.0 4.4018100000 1.3603900000 13.1919900000
CARBON 6.0 6.4791600000 0.3185500000 12.5353300000
CARBON 6.0 5.0837400000 0.4369500000 12.4084900000
HYDROGEN 1.0 7.0116000000 -0.4099400000 11.9434600000
HYDROGEN 1.0 8.2399000000 1.0702400000 13.4937600000
HYDROGEN 1.0 3.3185600000 1.4368700000 13.0953100000
HYDROGEN 1.0 4.5549800000 -0.1997300000 11.7165200000
CARBON 6.0 6.1105700000 3.9639000000 22.3866100000
CARBON 6.0 4.1216300000 4.4424400000 21.1020100000
HYDROGEN 1.0 7.8732900000 3.5217100000 21.2520500000
CARBON 6.0 4.7606000000 4.2868500000 22.3363800000
HYDROGEN 1.0 6.6064200000 3.8406000000 23.3428500000
HYDROGEN 1.0 4.2065000000 4.4170700000 23.2667100000
HYDROGEN 1.0 3.0674000000 4.6893500000 21.0889000000
HYDROGEN 1.0 7.4249200000 7.7545300000 18.8583200000
HYDROGEN 1.0 7.6651700000 8.9049700000 17.7652100000
HYDROGEN 1.0 5.3324000000 8.6487800000 17.2222700000
HYDROGEN 1.0 5.5015000000 7.1039000000 18.2759400000
$END
$ECP
MO-ECP GEN 28 3
5 ----- f potential -----
-0.0469492 0 537.9667807
-20.2080084 1 147.8982938
-106.2116302 2 45.7358898
-41.8107368 2 13.2911467
-4.2054103 2 4.7059961
3 ----- s-f potential -----
2.8063717 0 110.2991760
44.5162012 1 23.2014645
82.7785227 2 5.3530131
4 ----- p-f potential -----
4.9420876 0 63.2901397
25.8604976 1 23.3315302
132.4708742 2 24.6759423
57.3149794 2 4.6493040
5 ----- d-f potential -----
3.0054591 0 104.4839977
26.3637851 1 66.2307245
183.3849199 2 39.1283176
98.4453068 2 13.1164437
22.4901377 2 3.6280263
S NONE
S NONE
S NONE
S NONE
S NONE
S NONE
C NONE
C NONE
C NONE
C NONE
C NONE
C NONE
C NONE
C NONE
C NONE
C NONE
H NONE
C NONE
H NONE
C NONE
H NONE
C NONE
H NONE
C NONE
C NONE
C NONE
C NONE
H NONE
H NONE
H NONE
H NONE
C NONE
C NONE
H NONE
C NONE
H NONE
H NONE
H NONE
H NONE
H NONE
H NONE
H NONE
$END

=====The Input Deck=====

Below is the input deck. It is where you tell GAMESS(US) what job type to execute and where all you individual parameters are entered for your specific job type. The example input deck below is for a geometry optimization and frequency calculation. This input deck is equivalent to the Gaussian job with "opt" and "freq" in the route section.

$CONTRL SCFTYP=RHF RUNTYP=OPTIMIZE DFTTYP=M06-L MAXIT=199 MULT=1 NOSYM=1
ECP=READ $END
$SYSTEM TIMLIM=2850 MWORDS=1750 MEMDDI=20 PARALL=.TRUE. $END
$BASIS GBASIS=CUSTOMNI EXTFIL=.t. $END
$SCF DIRSCF=.TRUE. FDIFF=.f. $END
$STATPT OPTTOL=0.0001 NSTEP=500 HSSEND=.t. $END
$DATA

An important thing to note is the spacing. In the input deck, there must be 1 space at the beginning of each line of the input deck. If not, the job will fail. Most builders will insert this space anyway, but it helps to double check.

The end of the input deck is marked by the "$DATA" line.

=====Job Title Line=====

The next line of the file is the job title. It can be anthing you wish, however, we have found that to be on the safe side, we avoide using symbols or spaces

Mo_BDT3

=====Symmetry Point Group=====

The next line of the file is the symmetry point group of your molecule. Note that there is no leading space before the point group.

C1

=====Coordinates=====

The next block of text is set aside for the coordinates of the molecule. This can be in internal (or z-matrix) format or cartesian coordinates. Note that there is no leading space before the coordinates. One may use the chemical symbol or the full name of each atom in the molecule. Note that the end of the coordinates is signified by an "$END", which MUST have one space preceding it. The coordinates below do NOT have any basis set information inserted. It is possible to insert basis set information directly into the input file. This is accomplished by obtaining the desired basis set parameters from the EMSL and then inserting them below each relevant atom. An example input file with inserted basis set information will be shown later.

MOLYBDENUM 42.0 5.7556500000 4.4039600000 16.5808400000
SULFUR 16.0 7.4169700000 3.1956300000 15.2089300000
SULFUR 16.0 4.0966800000 3.2258300000 15.1761100000
SULFUR 16.0 3.9677300000 4.4940500000 18.3266100000
SULFUR 16.0 7.1776900000 3.5815000000 18.4485200000
SULFUR 16.0 4.3776600000 6.2447400000 15.6786900000
SULFUR 16.0 7.5478700000 6.0679800000 16.2223700000
CARBON 6.0 6.4716900000 2.1004800000 14.1902300000
CARBON 6.0 5.0690300000 2.1781400000 14.1080700000
CARBON 6.0 4.8421800000 4.2701300000 19.8855500000
CARBON 6.0 6.1969000000 3.9249600000 19.9397400000
CARBON 6.0 6.8280600000 3.7834200000 21.1913200000
CARBON 6.0 5.7697600000 7.6933500000 17.4241800000
CARBON 6.0 7.2043100000 7.9413600000 17.8281100000
CARBON 6.0 5.5051400000 7.0409700000 14.5903800000
CARBON 6.0 6.8905200000 6.9194700000 14.7626200000
CARBON 6.0 7.7396400000 7.5379800000 13.8285700000
HYDROGEN 1.0 8.8190700000 7.4520600000 13.9252200000
CARBON 6.0 7.2169400000 8.2960300000 12.7704100000
HYDROGEN 1.0 7.8667000000 8.7825100000 12.0575600000
CARBON 6.0 5.8260300000 8.4502300000 12.6467800000
HYDROGEN 1.0 5.4143000000 9.0544300000 11.8493100000
CARBON 6.0 4.9881500000 7.8192300000 13.5528400000
HYDROGEN 1.0 3.9090500000 7.9420000000 13.4583700000
CARBON 6.0 7.1538500000 1.1569600000 13.4143900000
CARBON 6.0 4.4018100000 1.3603900000 13.1919900000
CARBON 6.0 6.4791600000 0.3185500000 12.5353300000
CARBON 6.0 5.0837400000 0.4369500000 12.4084900000
HYDROGEN 1.0 7.0116000000 -0.4099400000 11.9434600000
HYDROGEN 1.0 8.2399000000 1.0702400000 13.4937600000
HYDROGEN 1.0 3.3185600000 1.4368700000 13.0953100000
HYDROGEN 1.0 4.5549800000 -0.1997300000 11.7165200000
CARBON 6.0 6.1105700000 3.9639000000 22.3866100000
CARBON 6.0 4.1216300000 4.4424400000 21.1020100000
HYDROGEN 1.0 7.8732900000 3.5217100000 21.2520500000
CARBON 6.0 4.7606000000 4.2868500000 22.3363800000
HYDROGEN 1.0 6.6064200000 3.8406000000 23.3428500000
HYDROGEN 1.0 4.2065000000 4.4170700000 23.2667100000
HYDROGEN 1.0 3.0674000000 4.6893500000 21.0889000000
HYDROGEN 1.0 7.4249200000 7.7545300000 18.8583200000
HYDROGEN 1.0 7.6651700000 8.9049700000 17.7652100000
HYDROGEN 1.0 5.3324000000 8.6487800000 17.2222700000
HYDROGEN 1.0 5.5015000000 7.1039000000 18.2759400000
$END

=====Effective Core Potential Data=====

The effective core potential (ECP) data is entered after the coordinates. It starts with "$ECP", which must be preceded with a space. The atoms of the molecule are listed in the same order as in the coordinates section and the parameters for the ECP are listed after each atom. Note that for any atom that does NOT have an ECP, one must enter "ECP-NONE" or "NONE" after each atom without an ECP.

$ECP
MO-ECP GEN 28 3
5 ----- f potential -----
-0.0469492 0 537.9667807
-20.2080084 1 147.8982938
-106.2116302 2 45.7358898
-41.8107368 2 13.2911467
-4.2054103 2 4.7059961
3 ----- s-f potential -----
2.8063717 0 110.2991760
44.5162012 1 23.2014645
82.7785227 2 5.3530131
4 ----- p-f potential -----
4.9420876 0 63.2901397
25.8604976 1 23.3315302
132.4708742 2 24.6759423
57.3149794 2 4.6493040
5 ----- d-f potential -----
3.0054591 0 104.4839977
26.3637851 1 66.2307245
183.3849199 2 39.1283176
98.4453068 2 13.1164437
22.4901377 2 3.6280263
S NONE
S NONE
S NONE
S NONE
S NONE
S NONE
C NONE
C NONE
C NONE
C NONE
C NONE
C NONE
C NONE
C NONE
C NONE
C NONE
H NONE
C NONE
H NONE
C NONE
H NONE
C NONE
H NONE
C NONE
C NONE
C NONE
C NONE
H NONE
H NONE
H NONE
H NONE
C NONE
C NONE
H NONE
C NONE
H NONE
H NONE
H NONE
H NONE
H NONE
H NONE
H NONE
$END

-- [[User:M.Zimmer-De Iuliis|mzd]] 16 November 2009

====Using an External File to Define Basis Set in GAMESS(US)====

Since GAMESS(US) has a limited number of built-in ECPs and basis sets, one may want to make GAMESS(US) read an external file that contains the basis set information ECP data using the "EXTFIL" keyword in the $GBASIS command line of the input file. For many metal containing compounds, it is very convenient and time saving to use an effective core potential (ECP) for the core metal electrons, as they are usually not important to the reactivity of the complex or the geometry around the metal. In addition, to make GAMESS(US) use this external file, one must copy the "rungms" file and modify it accordingly. The following is a list of instructions with commands that will work from a terminal. One could also use WinSCP to do all of this with a GUI rather than a TUI.

=====Modifiying rungms to Use Custom Basis Set File=====
1. Copy "rungms" from /scinet/gpc/Applications/gamess to one's own /scratch/$USER/ directory:
cp /scinet/gpc/Applications/gamess/rungms /scratch/$USER/

2. Change to the scratch directory and check to see if "rungms" has copied successfully.
cd /scratch/$USER
ls

3. Edit line 147 of the script.
vi rungms
Move the cursor down to line 147 using the arrow keys. It should say "setenv EXTBAS /dev/null". Using the arrow keys, move the cursor to the first "/" and then hit "i" to insert text. Put the path to your external basis file here. For example, /scratch/$USER/basisset. Then hit "escape". To save the changes and exit vi, type ":" and you should see a colon appear at the bottom of the window. Type "wq" (which should appear at the bottom of the window next to the colon) and then hit enter. Now you are done with vi.

=====Creating a Custom Basis Set File=====
1. To create a custom basis set file, you need create a new text document. Our group's common practice is to comment out the first line of this file by inserting an exclamation mark (!) followed by noting the specific basis sets and ECPs that are going to be used for each of the atoms. Let us use the molecule Mo(CO)6, Molybdenum hexacarbonyl, as an example. Below is the first line of the the external file, which we will call "CUSTOMMO" (NOTE: you can use any name for the external file that suits you, as long as it has no spaces and is 8 characters or less).

! 6-31G on C and O and LANL2D2 ECP on Mo

2. The next step is to visit the [https://bse.pnl.gov/bse/portal EMSL Basis Set exchange] and select C and O from the periodic table. Then, on the left of the page, select "6-31G" as the basis set. Finally, make sure the output is in GAMESS(US) format using the drop-down menu and then click "get basis set".

[[File:C_O_6_31G_basisset.JPG|centre]]

3. A new window should appear with text in it. For our example case, the text looks like this:

! 6-31G EMSL Basis Set Exchange Library 10/13/09 11:12 AM
! Elements References
! -------- ----------
! H - He: W.J. Hehre, R. Ditchfield and J.A. Pople, J. Chem. Phys. 56,
! Li - Ne: 2257 (1972). Note: Li and B come from J.D. Dill and J.A.
! Pople, J. Chem. Phys. 62, 2921 (1975).
! Na - Ar: M.M. Francl, W.J. Petro, W.J. Hehre, J.S. Binkley, M.S. Gordon,
! D.J. DeFrees and J.A. Pople, J. Chem. Phys. 77, 3654 (1982)
! K - Zn: V. Rassolov, J.A. Pople, M. Ratner and T.L. Windus, J. Chem. Phys.
! 109, 1223 (1998)
! Note: He and Ne are unpublished basis sets taken from the Gaussian
! program
!
$DATA 
CARBON
S 6
1 3047.5249000 0.0018347
2 457.3695100 0.0140373
3 103.9486900 0.0688426
4 29.2101550 0.2321844
5 9.2866630 0.4679413
6 3.1639270 0.3623120
L 3
1 7.8682724 -0.1193324 0.0689991
2 1.8812885 -0.1608542 0.3164240
3 0.5442493 1.1434564 0.7443083
L 1
1 0.1687144 1.0000000 1.0000000 
OXYGEN
S 6
1 5484.6717000 0.0018311
2 825.2349500 0.0139501
3 188.0469600 0.0684451
4 52.9645000 0.2327143
5 16.8975700 0.4701930
6 5.7996353 0.3585209
L 3
1 15.5396160 -0.1107775 0.0708743
2 3.5999336 -0.1480263 0.3397528
3 1.0137618 1.1307670 0.7271586
L 1
1 0.2700058 1.0000000 1.0000000
$END

3. Now, copy and paste the text between the $DATA and $END headings onto our external text file, CUSTOMMO. We also need to change the change the name of each element to the corresponding symbol in the periodic table. Finally, we need to add the name of the external file next to the element symbol, separated by one space. Note that there should be a blank line separating the basis set information and the first, commented-out line (The line starting with the '!'). The CUSTOMMO should look like this:

! 6-31G on C and O and LANL2D2 ECP on Mo 
C CUSTOMMO
S 6
1 3047.5249000 0.0018347
2 457.3695100 0.0140373
3 103.9486900 0.0688426
4 29.2101550 0.2321844
5 9.2866630 0.4679413
6 3.1639270 0.3623120
L 3
1 7.8682724 -0.1193324 0.0689991
2 1.8812885 -0.1608542 0.3164240
3 0.5442493 1.1434564 0.7443083
L 1
1 0.1687144 1.0000000 1.0000000 
O CUSTOMMO
S 6
1 5484.6717000 0.0018311
2 825.2349500 0.0139501
3 188.0469600 0.0684451
4 52.9645000 0.2327143
5 16.8975700 0.4701930
6 5.7996353 0.3585209
L 3
1 15.5396160 -0.1107775 0.0708743
2 3.5999336 -0.1480263 0.3397528
3 1.0137618 1.1307670 0.7271586
L 1
1 0.2700058 1.0000000 1.0000000

4. Repeat Step 3 above but choose Mo and select the LANL2DZ ECP instead. A new window will pop up with the basis set information as well as the ECP data we need, since we specified the LANL2DZ '''ECP'''. The ECP data is not inserted into the external file, rather it is placed into the input file itself (More on this later).

[[File:Mo_LANL2DZ_basisset.JPG|centre]]

5. After copying the molybdenum basis set information, your fiished external basis set file should look like this:
! 6-31G on C and O and LANL2D2 ECP on Mo 
C CUSTOMMO
S 6
1 3047.5249000 0.0018347
2 457.3695100 0.0140373
3 103.9486900 0.0688426
4 29.2101550 0.2321844
5 9.2866630 0.4679413
6 3.1639270 0.3623120
L 3
1 7.8682724 -0.1193324 0.0689991
2 1.8812885 -0.1608542 0.3164240
3 0.5442493 1.1434564 0.7443083
L 1
1 0.1687144 1.0000000 1.0000000 
O CUSTOMMO
S 6
1 5484.6717000 0.0018311
2 825.2349500 0.0139501
3 188.0469600 0.0684451
4 52.9645000 0.2327143
5 16.8975700 0.4701930
6 5.7996353 0.3585209
L 3
1 15.5396160 -0.1107775 0.0708743
2 3.5999336 -0.1480263 0.3397528
3 1.0137618 1.1307670 0.7271586
L 1
1 0.2700058 1.0000000 1.0000000 
Mo CUSTOMO
S 3
1 2.3610000 -0.9121760
2 1.3090000 1.1477453
3 0.4500000 0.6097109
S 4
1 2.3610000 0.8139259
2 1.3090000 -1.1360084
3 0.4500000 -1.1611592
4 0.1681000 1.0064786
S 1
1 0.0423000 1.0000000
P 3
1 4.8950000 -0.0908258
2 1.0440000 0.7042899
3 0.3877000 0.3973179
P 2
1 0.4995000 -0.1081945
2 0.0780000 1.0368093
P 1
1 0.0247000 1.0000000
D 3
1 2.9930000 0.0527063
2 1.0630000 0.5003907
3 0.3721000 0.5794024
D 1
1 0.1178000 1.0000000

-- [[User:M.Zimmer-De Iuliis|mzd]] 21 September 2010

====A Modified BASH Script for Runnning GAMESS(US)====
Below please find the bash script that we use to run GAMESS(US) on a single node with 8 processors.

One quirk of GAMESS(US) is that it will NOT write over old or failed jobs that have the same name as the input file you are submitting. For example: my input file name is "mo_opt.inp" and I submit this job to the queue. However, it comes back seconds later with an error. The log file says that I have typed an incorrect keyword, and lo and behold, I have a comma where it shouldn't be. Such typos can be common. If you simply try to re-submit, GAMESS(US) will fail again, because it has written a .log file and some other files to the /scratch/user/gamess-scratch/ directory. These files must all be deleted before you re-submit your fixed input file.

This script takes care of this annoying problem by deleting failed jobs with the same file name for you.

Here it is:

#!/bin/bash
#PBS -l nodes=1:ppn=8,walltime=48:00:00

## To submit type: qsub x.sh

# If not an interactive job (i.e. -I), then cd into the directory where
# I typed qsub.
if [ "$PBS_ENVIRONMENT" != "PBS_INTERACTIVE" ]; then
if [ -n "$PBS_O_WORKDIR" ]; then
cd $PBS_O_WORKDIR
fi
fi

# the input file is typically named something like "gamesjob.inp"
# so the script will be run like "$SCINET_RUNGMS gamessjob 00 8 8"

find /scratch/user/gamess-scratch -type f -name ${NAME:-safety_net}\* -exec /bin/rm {} \;

# loading modules should be done in the submission script.
# It is not recommended by SciNet staff to put this command
# in the .bashrc file.
module load gamess

# run the program

/scratch/user/rungms $NAME 00 8 8 >& $NAME.log

====A Script to Add the $VIB Group for Hessian Restarts in GAMESS(US)====

Sometimes, a optimization + vibrational analysis or just a plain vibrational analysis must be restarted. This can be because the two day time limit has been exceeded or perhaps there was an error during calculation. In any case, when this happens, the job must be restarted. In GAMESS(US), you can restart a vibrational analysis from a previous one and it will utilize the frequencies that were already computed in the failed run.

For example, if one submits the input file "job_name.inp" and it fails before it has finished, then one must utilize the file "job_name.rst", which contains data that is required to restart the calculation. This file is located in the /scratch/user/gamess-scratch directory. Data from the "job_name.rst" file must be appended at the end of the new input file (after the coordinates and ECP section if it is present) to restart the calculation, letus call it "job_name_restart.inp"

A shortened version of the "job_name.rst" file looks like this:

ENERGY/GRADIENT/DIPOLE RESTART DATA FOR RUNTYP=HESSIAN
job_name
$VIB
IVIB= 0 IATOM= 0 ICOORD= 0 E= -3717.1435124522
-5.165258381E-04 1.584665821E-02-1.206270555E-02-2.241461728E-03 3.176050715E-03
-5.706738823E-04 2.502034151E-03 5.130112290E-04-2.716945939E-03 1.357008279E-03
-1.059915305E-03 1.693526456E-03-2.957638907E-04-5.994938737E-04 9.684054361E-04
.
.
.
.

The text eventually ends with one blank line. The $VIB heading and all of the text after $VIB must be appended to the end of file "job_name_restart.inp" and then " $END" must be inserted at the very end of the file.

One could do this, one could cut cut and paste in a text editor, but we have written a small script that will do this automatically. We call it ".vib.sh" but you can call it whatever you like. Here it is:

#!/bin/bash
# script to add vibrational data for a hessian restart

awk '/\$VIB/{p=1}p;END{print " $END"}' /scratch/user/gamess-scratch/$NAME1.rst >> $NAME2.inp

To use it, simply copy it into a new text file with the extension ".sh" and make it executable. Also, you will need to edit the location of the "/scratch/user/gamess-scratch/ directory to match your user name. The two variables in the script, NAME1 and NAME2, represent the name of your ".rst" file and your new ".inp" file, respectively. In the example above, NAME1=job_name (that is, the same name as the .rst file that contains the $VIB data and that was created in the /gamess-scrsatch/ directory) and NAME2=job_name_restart (that is, the name of the new input file that you have prepared and want to copy the $VIB data into).

To run it on a gpc node without submitting it to the job queue, type:

NAME1=job_name NAME2=job_name_restart ./vib.sh

To run it in the queue, type:

qsub vib.sh -v NAME1=job_name,NAME2=job_name_restart

-special thanks to Ramses for help with this

-- [[User:M.Zimmer-De Iuliis|mzd]] 30 September 2010

====Most Commonly Used Headers in The Fekl Lab====

After about a year of using GAMESS(US), we have found that we are most often doing optimizations, frequency analyses, transition state searches and IRC calculations using DFT methods. Here are the input decks thatwe found have worked well for inorganic and organometallic compounds.

=====Optimization Plus Frequency (for a neutral, singlet)=====

$CONTRL SCFTYP=RHF RUNTYP=OPTIMIZE DFTTYP=''FILL_IN_YOUR_PREFEENCE_HERE'' MAXIT=199 MULT=1 NOSYM=1
ECP=READ $END
$SYSTEM TIMLIM=2800 MWORDS=20 MEMDDI=50 PARALL=.TRUE. $END
$SCF DIRSCF=.TRUE. FDIFF=.f. DIIS=.T. SOSCF=.F. DAMP=.T. $END
$STATPT OPTTOL=0.00001 NSTEP=500 HSSEND=.t. $END
$FORCE TEMP=298.15 PURIFY=.t. PROJCT=.t. $END
$DATA

=====Frequency Only (for a neutral, singlet)=====

$CONTRL SCFTYP=RHF RUNTYP=HESSIAN DFTTYP=''FILL_IN_YOUR_PREFEENCE_HERE'' MAXIT=199 MULT=1 NOSYM=1
ECP=READ $END
$SYSTEM TIMLIM=2800 MWORDS=20 MEMDDI=50 PARALL=.TRUE. $END
$SCF DIRSCF=.TRUE. FDIFF=.f. DIIS=.T. SOSCF=.F. DAMP=.T. $END
$FORCE METHOD=SEMINUM VIBANL=.TRUE. PROJCT=.T. PURIFY=.T. $END
$DATA

=====Transition State Search (for a neutral, singlet)=====

$CONTRL SCFTYP=RHF RUNTYP=SADPOINT DFTTYP=''FILL_IN_YOUR_PREFEENCE_HERE'' MAXIT=199 MULT=1 NOSYM=1
ECP=READ $END
$SYSTEM TIMLIM=2850 MWORDS=20 MEMDDI=50 PARALL=.TRUE. $END
$SCF DIRSCF=.TRUE. FDIFF=.f. DIIS=.T. SOSCF=.F. $END
$STATPT STSTEP=0.05 OPTTOL=0.00001 NSTEP=500 HESS=CALC HSSEND=.t.
STPT=.FALSE. $END
$FORCE METHOD=SEMINUM VIBANL=.TRUE. PURIFY=.T. PROJCT=.T. $END
$DATA

=====IRC (Intrinsic Reaction Coordinate following forward reaction) Calculation (for a neutral, singlet)=====

$CONTRL SCFTYP=RHF RUNTYP=IRC DFTTYP=''FILL_IN_YOUR_PREFEENCE_HERE'' MAXIT=199 MULT=1 NOSYM=1
ECP=READ $END
$IRC OPTTOL=0.00001 STRIDE=0.05 NPOINT=5000 SADDLE=.TRUE. FORWRD=.F.
$END
$SYSTEM TIMLIM=2850 MWORDS=20 MEMDDI=50 PARALL=.TRUE. $END
$SCF DIRSCF=.TRUE. FDIFF=.f. $END
$FORCE TEMP=298.15 PURIFY=.t. PROJCT=.t. $END
$DATA

-- [[User:M.Zimmer-De Iuliis|mzd]] 21 September 2010

====How to Run an IRC Calculation Using GAMESS(US)====

An IRC or Intrinsic Reaction Coordinate calculation follows the imaginary mode of the vibrational analysis of a transition state calculation. In GAMESS(US), you can choose to follow the forward (towards the products) or backward (toward the reactants) direction. As shown above in the IRC header that we use, the direction of the IRC calculation is controlled by the "FORWRD" key word. Using "FORWRD=.T." means that the IRC is following the forward direction, while using "FORWRD=.F." means that the IRC calculation is following the backward direction.

Let us say we want to perform an IRC. In order to perform an IRC calculation, you must first perform a vibrational analysis of you molecule and check to ensure there is only 1 negative frequency. If that is the case, then the vibrational analysis completed successfully and there will be a file, let us call it "job_name.dat" in the "/users/$USER/gamess-scratch/" directory (where $USER is your user name) with the extension ".dat". In this file is data that is required for the IRC input file.

To prepare your IRC input file, prepare an input file using the coordinates of the optimized structure of the transition state. This can be from ChemCraft or Avogadro or MacMolPlt - what ever you prefer to use. Then copy and paste the IRC header above or use your own parameters. Call it whatever you want, as long as it has an ".inp" extension. Let us call in "irc_job.inp".

For example, the "STRIDE" value determines the "size" of the steps between each point on the IRC graph. If you increase the value of the stride, say from 0.05 to 0.1, then the steps in between each point become larger and you will approach the minimum faster (this will give you fewer data points should you chose to plot the IRC data). Decreasing the stride value, say from 0.05 to 0.01 will make the steps in between each point become smaller and you may not reach the minimum of the reaction coordinate in the alloted time period.

You should now have an input file with an IRC header, the coordinates of the transition state and basis set and ECP information called "irc_job.inp".

Now you need to use the "job_name.dat" file in the "/users/$USER/gamess-scratch/" In this file are a number of blocks of data that are sandwiched between a line that contains only " $HESS" and a line that contains only " $END". What you need is the LAST of these blocks of text and it has to be copied and pasted directly below the last entry of your input file.

This can be difficult and time consuming, as the .dat files can be very large (sometimes over 150 MB) and cumbersome to navigate through. However, we have written a script, similar to the .vib.sh script, that can help you out with this. Basically, this script does all the copying and pasting for you.

Here it is:
#!/bin/bash
# script to add hessian data for an IRC calculation

awk '/\$HESS/{arr="";f=1} f {arr=(arr)?arr ORS $0:$0} /\$END/{f=0} END {print arr}' /scratch/$USER/gamess-scratch/$DAT.dat >> $IN.inp

To use it, simply copy it into a new text file with the name "irc.sh" and make it executable. Also, you will need to edit the location of the "/scratch/user/gamess-scratch/ directory to match your user name. The two variables in the script, $DAT and $IN, represent the name of your ".dat" file and your new ".inp" file, respectively. Using our current example, $DAT=job_name and In the example above, $IN=irc_job (that is, the same name as the .dat file that contains the $HESS data and that was created in the /gamess-scrsatch/ directory) and IN=irc_job (that is, the name of the new input file that you have prepared and want to copy the $HESS data into).

To run it on a gpc node without submitting it to the job queue, type:

DAT=job_name IN=irc_job ./irc.sh

To run it in the queue, type:

qsub irc.sh -v DAT=job_name,IN=irc_job

-- [[User:M.Zimmer-De Iuliis|mzd]] 21 October 2010

===Vienna Ab-initio Simulation Package (VASP)===
Please refer to the [http://www.vasp.at VASP page].

User-supplied content below.

====Tips from the Polanyi Lab====
Using VASP on SciNet

Logon using SSH
login.scinet.utoronto.ca

then ssh to the TCS cluster
ssh tcs01

change directory to
cd /scratch/imcnab/test/Si111 - or whatever other directory is convenient.

VASP is contained in the directory imcnab/bin

To submit a job, first edit (at least) the POSCAR file and other VASP
input files as necessary.

=====Input Files=====
The minimum set of input files is:

'''vasp.script''' - script file telling TCS to run a VASP job - must be edited to run in current working directory.

'''POSCAR''' - specifies supercell geometry and "ionic" positions (i.e. atomic centres) and whether relaxation allowed. Ionic positions may be given in cartesion coordinates (x,y,z in A) or "absolute", which are fractions of the unit cell vectors. CONTCAR is always in absolute coords, so after the first run of any job, you'll find yourself running in absolute coords. VMD can be used to change these back to cartesian coordinates.

'''INCAR''' - specifies parameters to run the job. INCAR is free format - can put input commands in ANY order.

'''POTCAR''' - specifies the potentials to use for each atomic type. Must be in the same order as the atoms are first met in POSCAR

'''KPOINTS''' - specifies the number and position of K-points to use in the calculation.

Any change of name or directory needs to be edited into the job script. The job script name is "vasp.script".

VASP attempts to read initial wavefunctions from WAVECAR, so if a job is run in steps, leaving the WAVECAR file on the working directory is an efficient way to start the next stage of the calculation

VASP also writes CONTCAR which is of the same format as POSCAR, and can simply be renamed if it is to be used as the starting point for a new job.

Submit the job to load-leveller with the command llsubmit ./vasp.script from the correct working directory.

can check the status of a job with llq

can cancel a job using llcancel tcs-fXXnYY.$PID where tcs number etc is shown by llq

==
INPUT FILES ==

The minimum set of input files is:

'''vasp.script''' - script file telling TCS to run a VASP job - must be edited to run in current working directory.

'''POSCAR''' - specifies supercell geometry and "ionic" positions (i.e. atomic centres) and whether relaxation allowed. Ionic positions may be given in cartesion coordinates (x,y,z in A) or "absolute", which are fractions of the unit cell vectors. CONTCAR is always in absolute coords, so after the first run of any job, you'll find yourself running in absolute coords. VMD can be used to change these back to cartesian coordinates.

'''INCAR''' - specifies parameters to run the job. INCAR is free format - can put input commands in ANY order.

'''POTCAR''' - specifies the potentials to use for each atomic type. Must be in the same order as the atoms are first met in POSCAR

'''KPOINTS''' - specifies the number and position of K-points to use in the calculation.

Any change of name or directory needs to be edited into the job script.
The job script name is "'''vasp.script'''".

VASP attempts to read initial wavefunctions from WAVECAR, so if a job is
run in steps, leaving the WAVECAR file on the working directory is an
efficient way to start the next stage of the calculation

VASP also writes CONTCAR which is of the same format as POSCAR, and can
simply be renamed if it is to be used as the starting point for a new job.

Submit the job to load-leveller with the command
llsubmit ./vasp.script from the correct working directory.

can check the status of a job with
llq

can cancel a job using
llcancel tcs-fXXnYY.$PID where tcs number etc is shown by llq

===== GENERAL NOTES =====

MUCH faster to use ISPIN=1, no-spin (corresponds to RHF, rather than
ISPIN=2 which corresponds to URHF). So far, I've not found a system where the atom positions differ, or where the calculated electronic energy differs by more than 1E-4, which is the convergence
criteria set.

MUCH faster to use real space LREAL = A, NSIM=4.

So, ''always'' optimize in real space first, then re-optimize in reciprocal space. This does NOT guarantee, a one-step optimization in reciprocal space. May still need to progressively
relax a large system.

'''Relaxing a large system.'''
If you attempt to relax a large system in one step, it will usually fail.

The starting geometry is usually an unrelaxed molecule above an unrelaxed surface.
The bottom plane of the surface will NEVER be relaxed, because this corresponds to the fixed boundary condition of REALITY.

First, relax the molecule alone (assuming you have already found a good starting position from single point calcultions, place the molecule closer to the surface than you think it should be (say 0.9 VdW radii away).

Then ALSO allow the top layer of the surface to relax.
Then ALSO allow the second top layer of the surface to relax... etc... etc.

If this DOESN'T WORK: Then relax X,Y and Z separately in iterations.
Example. For the following problem, representing layers of the crystal going DOWN from the top (Z pointing to the top of the screen)

Molecule 
Layer 1 
Layer 2 
Layer 3 
Layer 4 
Layer 5 - fixed layer 
Layer 6 - Valence H's, fixed layer 

we can try the following relaxation schemes: 

Successive relaxation, Layer by Layer: 
(1) 
Molecule XYZ Relax 
Layer 1 XYZ fixed 
Layer 2 XYZ fixed 
Layer 3 XYZ fixed 
Layer 4 XYZ fixed 
Layer 5 - fixed layer. 
Layer 6 - Valence H's, fixed layer 

(2) 
Molecule XYZ Relax 
Layer 1 XYZ Relax 
Layer 2 XYZ fixed 
Layer 3 XYZ fixed 
Layer 4 XYZ fixed 
Layer 5 - fixed layer. 
Layer 6 - Valence H's, fixed layer 

(3) 
Molecule XYZ Relax 
Layer 1 XYZ Relax 
Layer 2 XYZ Relax 
Layer 3 XYZ fixed 
Layer 4 XYZ fixed 
Layer 5 - fixed layer. 
Layer 6 - Valence H's, fixed layer 

etc. etc... if this works then you're fine. However, it can happen that even by Layer 2, you're running into real problems, and the ionic relaxation NEVER converges. In which case, I have found the following scheme (and variations thereof) useful:

(1) 
Molecule XYZ Relax 
Layer 1 XYZ fixed 
Layer 2 XYZ fixed 
Layer 3 XYZ fixed 
Layer 4 XYZ fixed 
Layer 5 - fixed layer 
Layer 6 - Valence H's, fixed layer 

(2) 
Molecule XYZ Relax 
Layer 1 XYZ Relax 
Layer 2 XYZ fixed 
Layer 3 XYZ fixed 
Layer 4 XYZ fixed 
Layer 5 - fixed layer 
Layer 6 - Valence H's, fixed layer 

(3) 
Molecule XYZ Relax 
Layer 1 XYZ Relax 
Layer 2 XYZ Relax 
Layer 3 XYZ fixed 
Layer 4 XYZ fixed 
Layer 5 - fixed layer 
Layer 6 - Valence H's, fixed layer 

IF (3) DOESN'T converge THEN TRY

(2') 
Molecule Z Relax, XY FIXED 
Layer 1 Z Relax, XY FIXED 
Layer 2 XYZ Relax 
Layer 3 XYZ fixed 
Layer 4 XYZ fixed 
Layer 5 - fixed layer 
Layer 6 - Valence H's, fixed layer 

- you are allowing the top layers to move only UP or DOWN, while allowing the intermediate
layer 2 to fully relax (actually, there is no way of telling VASP to move ALL atoms by the SAME deltaZ, but that appears to be the effect.
Followed by

(2") 
Molecule XYZ Relax 
Layer 1 XYZ Relax 
Layer 2 XYZ Relax 
Layer 3 XYZ fixed 
Layer 4 XYZ fixed 
Layer 5 - fixed layer 
Layer 6 - Valence H's, fixed layer 

If (2") doesn't work, you need to go back to the output of (2') and vary the cycle - perhaps something like:
(2"') 
Molecule XYZ Relax 
Layer 1 XYZ Relax 
Layer 2 XYZ fixed 
Layer 3 XYZ fixed 
Layer 4 XYZ fixed 
Layer 5 - fixed layer 
Layer 6 - Valence H's, fixed layer 

then try (2") again.

Repeat as necessary. This scheme does appear to work quite well for big unit cells. It can be very difficult to relax as many layers as necessary in a big unit cell.

Experience on the One Per Corner Hole problem shows that it may be necessary to have a large number of UNRELAXED (i.e. BULK silicon) layers underneath the relaxed layers in order to get physically meaningful answers. This is because silicon is so elastic.

===== Problems and solutions: =====

If getting ZBRENT errors, try changing ALGO. Usually use ALGO = Fast, change to ALGO = Normal. With ALGO = Normal, NFREE now DOES correspond to degrees of freedom (maximum suggested setting is 20). Haven't found this terribly helpful.

Many calculations seem to fail after 20 or 30 ionic steps. I suspect a memory leak.

Sometimes the calculation appears to lose WAVECAR... this is not a disaster, just means a slight increase in start time as the first wavefunction is calculated.

If calculation does not finish nicely, can force a WAVECAR generation by doing a purely electronic calculation (these are pretty fast).

VASP is VERY slow at relaxing molecules at surfaces. This is because it doesn't know a molecule is a connected entity. It treats every atom independently.

THEREFORE, MUCH MUCH faster to try molecular positions by hand first.
Do some sample calculations at a few geometries to find a good starting point.

ALSO, once you think you know where the molecule is to be placed, put it too close to the surface, and let it relax outwards... the forces close to the surface are repulsive, and much steeper, so relaxation is FASTER in this direction.

=='''Climate Modelling'''==

The Community Earth System Model (CESM) is a fully-coupled, global climate model that provides state-of-the-art computer simulations of the Earth's past, present, and future climate states.

Development of a comprehensive CESM that accurately represents the principal components of the climate system and their couplings requires both wide intellectual participation and computing capabilities beyond those available to most U.S. institutions. The CESM, therefore, must include an improved framework for coupling existing and future component models developed at multiple institutions, to permit rapid exploration of alternate formulations. This framework must be amenable to components of varying complexity and at varying resolutions, in accordance with a balance of scientific needs and resource demands. In particular, the CESM must accommodate an active program of simulations and evaluations, using an evolving model to address scientific issues and problems of national and international policy interest.

User guides and information on each version of the model can be found at the following links:

CCSM3: http://www.cesm.ucar.edu/models/ccsm3.0/
CCSM4: http://www.cesm.ucar.edu/models/ccsm4.0/
CESM1: http://www.cesm.ucar.edu/models/cesm1.0/

Please see:

===[[Installing CCSM3]]===

===[[Running CCSM3]]===

===[[Installing CCSM4]]===

===[[Running CCSM4]]===

===[[Post Processing CCSM Output]]===

===[[CCSM4/CESM1 TCS Simulation List]]===

==Engineering==

===ANSYS===

Please refer to the [[Ansys]] page.

==Medicine/Bio==

==High Energy Physics==

==Structural Biology==
Molecular simulation of proteins, lipids, carbohydrates, and other biologically relevant molecules.
===Molecular Dynamics (MD) simulation===
====GROMACS====
Please refer to the [[gromacs|GROMACS]] page
====AMBER====
Please refer to the [[amber|AMBER]] page
====NAMD====
'''Note: the instructions below are rather old, and should not be needed by most users. A version of namd is available on the GPC in the module <tt>namd</tt>. For current information see the [[Namd]] page'''

NAMD is one of the better scaling MD packages out there. With sufficiently large systems, it is able to scale to hundreds or thousands of cores on Scinet. Below are details for compiling and running NAMD on Scinet.

More information regarding performance and different compile options coming soon...

=====Compiling NAMD for GPC=====
Ensure the proper compiler/mpi modules are loaded.
<source lang="bash">
module load intel
module load openmpi/1.3.3-intel-v11.0-ofed
</source>

'''Compile Charm++ and NAMD'''
<source lang="bash">
#Unpack source files and get required support libraries
tar -xzf NAMD_2.7b1_Source.tar.gz
cd NAMD_2.7b1_Source
tar -xf charm-6.1.tar
wget http://www.ks.uiuc.edu/Research/namd/libraries/fftw-linux-x86_64.tar.gz
wget http://www.ks.uiuc.edu/Research/namd/libraries/tcl-linux-x86_64.tar.gz
tar -xzf fftw-linux-x86_64.tar.gz; mv linux-x86_64 fftw
tar -xzf tcl-linux-x86_64.tar.gz; mv linux-x86_64 tcl
#Compile Charm++
cd charm-6.1
./build charm++ mpi-linux-x86_64 icc --basedir /scinet/gpc/mpi/openmpi/1.3.3-intel-v11.0-ofed/ --no-shared -O -DCMK_OPTIMIZE=1
cd ..
#Compile NAMD.
#Edit arch/Linux-x86_64-icc.arch and add "-lmpi" to the end of the CXXOPTS and COPTS line.
#Make a builds directory if you want different versions of NAMD compiled at the same time.
mkdir builds
./config builds/Linux-x86_64-icc --charm-arch mpi-linux-x86_64-icc
cd builds/Linux-x86_64-icc/
make -j4 namd2 # Adjust value of j as desired to specify number of simultaneous make targets.
</source>
--[[User:Cmadill|Cmadill]] 16:18, 27 August 2009 (UTC)

=====Running Fortran=====
On the development nodes, there is an old gcc. The associated libraries are not on the compute nodes. Ensure the line:

module load gcc

is in your .bashrc file.

====LAMMPS====
[[Image:StrongScalingLAMMPS.png|thumb|320px|right|Strong scaling test on GPC with OpenMPI and IntelMPI on Ethernet and InfiniBand]]
[[Image:WeakScalingLAMMPS.png|thumb|320px|right|Weak scaling test on GPC with OpenMPI and IntelMPI on Ethernet and InfiniBand]]
LAMMPS is a parallel MD code that can be found [http://lammps.sandia.gov/ here].

'''Compiling LAMMPS on GPC'''

First, make sure the ''intel/intel-v11.1.072'' and ''openmpi/1.4.1-intel-v11.0-ofed'' modules are loaded.

To download LAMMPS into your directory, instructions can be found [http://lammps.sandia.gov/download.html here].

In ''src/MAKE/Makefile.openmpi'', make the following changes:

FFT_INC = -DFFT_NONE

<nowiki># FFT_PATH =</nowiki>

<nowiki># FFT_LIB =</nowiki>

Note that the FFT library is required for using PPPM from the KSPACE package.

In the ''src'' directory, run ''make openmpi'', which should produce a ''lmp_openmpi'' executable.

--[[User:sellan|sellan]] 14:08 Jul 28 2011

'''Scaling Tests on GPC'''

Results from strong scaling tests for LAMMPS using EAM potentials on GPC are shown in the graph on the right. Test simulation ran 500 timesteps for 4,000,000 atoms.

Results from weak scaling tests for LAMMPS using EAM potentials on GPC are shown in the graph on the right. Test simulation ran 500 timesteps for 32,000 atoms per processor.

OpenMPI version used: openmpi/1.4.1-intel-v11.0-ofed

IntelMPI version used: intelmpi/impi-4.0.0.013

LAMMPS version used: 15 Jan 2010

'''Summary of Scaling Tests'''

Results show good scaling for both OpenMPI and IntelMPI on Ethernet up to 16 processors, after which performance begins to suffer. On Infiniband, excellent scaling is maintained to 512 processors.

IntelMPI shows slightly better performance compared to OpenMPI when running with Infiniband.

--[[User:jchu|jchu]] 14:08 Feb 2, 2010

===Monte Carlo (MC) simulation===

GPU Benchmarks

2018-08-31T23:23:20Z

Rzon:

Running CCSM4

2018-08-31T23:22:53Z

Rzon:

Installing your own modules

2018-08-31T23:22:33Z

Rzon:

User Serial on GPC

2018-08-31T23:22:00Z

Rzon:

User Serial

2018-08-31T23:20:55Z

Rzon:

Perl

2018-08-31T23:20:21Z

Rzon:

MATLAB

2018-08-31T23:20:00Z

Rzon:

BGQ OpenFOAM

2018-08-31T23:03:10Z

Rzon:

Python

2018-08-31T23:01:36Z

Rzon:

GPC MPI Versions

2018-08-09T17:32:22Z

Rzon:

Data Management

2018-08-09T17:32:13Z

Rzon:

Scheduler

2018-08-09T17:32:02Z

Rzon:

Software and Libraries

2018-08-09T17:31:23Z

Rzon:

FAQ

2018-08-09T17:30:57Z

Rzon:

{| style="border-spacing: 8px; width:100%"
| valign="top" style="cellpadding:1em; padding:1em; border:2px solid; background-color:#f6f674; border-radius:5px"|
'''WARNING: SciNet is in the process of replacing this wiki with a new documentation site. For current information, please go to [https://docs.scinet.utoronto.ca https://docs.scinet.utoronto.ca]'''
|}
__TOC__

==The Basics==
===Whom do I contact for support?===

Whom do I contact if I have problems or questions about how to use the SciNet systems?

'''Answer:'''

E-mail [mailto:support@scinet.utoronto.ca <support@scinet.utoronto.ca>]

In your email, please include the following information:

* your username on SciNet
* the cluster that your question pertains to (GPC or TCS; SciNet is not a cluster!),
* any relevant error messages
* the commands you typed before the errors occured
* the path to your code (if applicable)
* the location of the job scripts (if applicable)
* the directory from which it was submitted (if applicable)
* a description of what it is supposed to do (if applicable)
* if your problem is about connecting to SciNet, the type of computer you are connecting from.

Note that your password should never, never, never be to sent to us, even if your question is about your account.

Try to avoid sending email only to specific individuals at SciNet. Your chances of a quick reply increase significantly if you email our team!

===What does ''code scaling'' mean?===

'''Answer:'''

Please see [[Introduction_To_Performance#Parallel_Speedup|A Performance Primer]]

===What do you mean by ''throughput''?===

'''Answer:'''

Please see [[Introduction_To_Performance#Throughput|A Performance Primer]].

Here is a simple example:

Suppose you need to do 10 computations. Say each of these runs for
1 day on 8 cores, but they take "only" 18 hours on 16 cores. What is the
fastest way to get all 10 computations done - as 8-core jobs or as
16-core jobs? Let us assume you have 2 nodes at your disposal.
The answer, after some simple arithmetic, is that running your 10
jobs as 8-core jobs will take 5 days, whereas if you ran them
as 16-core jobs it would take 7.5 days. Take your own conclusions...

===I changed my .bashrc/.bash_profile and now nothing works===

The default startup scripts provided by SciNet, and guidelines for them, can be found [[Important_.bashrc_guidelines|here]]. Certain things - like sourcing <tt>/etc/profile</tt>
and <tt>/etc/bashrc</tt> are ''required'' for various SciNet routines to work!

If the situation is so bad that you cannot even log in, please send email [mailto:support@scinet.utoronto.ca support].

===Could I have my login shell changed to (t)csh?===

The login shell used on our systems is bash. While the tcsh is available on the GPC and the TCS, we do not support it as the default login shell at present. So "chsh" will not work, but you can always run tcsh interactively. Also, csh scripts will be executed correctly provided that they have the correct "shebang" <tt>#!/bin/tcsh</tt> at the top.

===How can I run Matlab / IDL / Gaussian / my favourite commercial software at SciNet?===

'''Answer:'''

Because SciNet serves such a disparate group of user communities, there is just no way we can buy licenses for everyone's commercial package. The only commercial software we have purchased is that which in principle can benefit everyone -- fast compilers and math libraries (Intel's on GPC, and IBM's on TCS).

If your research group requires a commercial package that you already have or are willing to buy licenses for, contact us at [mailto:support@scinet.utoronto.ca support@scinet] and we can work together to find out if it is feasible to implement the packages licensing arrangement on the SciNet clusters, and if so, what is the the best way to do it.

Note that it is important that you contact us before installing commercially licensed software on SciNet machines, even if you have a way to do it in your own directory without requiring sysadmin intervention. It puts us in a very awkward position if someone is found to be running unlicensed or invalidly licensed software on our systems, so we need to be aware of what is being installed where.

===Do you have a recommended ssh program that will allow scinet access from Windows machines?===

'''Answer:'''

The [[Ssh#SSH_for_Windows_Users | SSH for Windows users]] programs we recommend are:

* [http://mobaxterm.mobatek.net/en/ MobaXterm] is a tabbed ssh client with some Cygwin tools, including ssh and X, all wrapped up into one executable.
* [http://www.chiark.greenend.org.uk/~sgtatham/putty/ PuTTY] - this is a terminal for windows that connects via ssh. It is a quick install and will get you up and running quickly. '''WARNING:''' Make sure you download putty from the official website, because there are "trojanized" versions of putty around that will send your login information to a site in Russia (as reported [http://blogs.cisco.com/security/trojanized-putty-software here]). To set up your passphrase protected ssh key with putty, see [http://the.earth.li/~sgtatham/putty/0.61/htmldoc/Chapter8.html#pubkey here].
* [http://www.cygwin.com/ CygWin] - this is a whole linux-like environment for windows, which also includes an X window server so that you can display remote windows on your desktop. Make sure you include the openssh and X window system in the installation for full functionality. This is recommended if you will be doing a lot of work on Linux machines, as it makes a very similar environment available on your computer. To set up your ssh keys, following the Linux instruction on the [[Ssh keys]] page.
 To set up your ssh keys, following the Linux instruction on the [[Ssh keys]] page.

===My ssh key does not work! WARNING: REMOTE HOST IDENTIFICATION HAS CHANGED! ===

'''Answer:'''

[[Ssh_keys#Testing_Your_Key | Testing Your Key]]

* If this doesn't work, you should be able to login using your password, and investigate the problem. For example, if during a login session you get an message similar to the one below, just follow the instruction and delete the offending key on line 3 (you can use vi to jump to that line with ESC plus : plus 3). That only means that you may have logged in from your home computer to SciNet in the past, and that key is obsolete.
<pre>
$ ssh USERNAME@login.scinet.utoronto.ca

@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@**@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
@ WARNING: REMOTE HOST IDENTIFICATION HAS CHANGED! @
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@**@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
IT IS POSSIBLE THAT SOMEONE IS DOING SOMETHING NASTY!
Someone could be eavesdropping on you right now (man-in-the-middle
attack)!
It is also possible that the RSA host key has just been changed.
The fingerprint for the RSA key sent by the remote host is
53:f9:60:71:a8:0b:5d:74:83:52:**fe:ea:1a:9e:cc:d3.
Please contact your system administrator.
Add correct host key in /home/<user>/.ssh/known_hosts to get rid of
this message.
Offending key in /home/<user>/.ssh/known_hosts:3
RSA host key for login.scinet.utoronto.ca
<http://login.scinet.utoronto.ca <http://login.scinet.utoronto.ca>> has
changed and you have requested
</pre>

* If you get the message below you may need to logout of your gnome session and log back in since the ssh-agent needs to be
restarted with the new passphrase ssh key.
<pre>
$ ssh USERNAME@login.scinet.utoronto.ca

Agent admitted failure to sign using the key.
</pre>

===Can't get graphics: "Can't open display/DISPLAY is not set"===

To use graphics on SciNet machines and have it displayed on your machine, you need to have a X server running on your computer (an X server is the standard way graphics is done on linux). One an X server is running, you can log in with the "-Y" option to ssh ("-X" sometimes also works).

How to get an X server running on your computer, depends on the operating system. On linux machines with a graphical interface, X will already be running. On windows, the easiest solution is using MobaXterm, which comes with an X server (alternatives, such as cygwin with the x11 server installed, or running putty+Xming, can also work, but are a bit more work to set up. For Macs, you will need to install Xquartz.

===Remote graphics stops working after a while: "Can't open display"===

If you still cannot get graphics, or it works only for a while and then suddenly it "can't open display localhost:....", your X11 graphics connection may have timed out (Macs seem to be particularly prone to this). You'll have to tell your own computer not to allow, and not to timeout the X11 graphics connection.

The following should fix it. The ssh configuration settings are in a file called /etc/ssh/ssh_config (or /etc/ssh_config in older OS X versions, or $HOME/.ssh/config for specific users). In the config file, find (or create) the section "Host *" (meaning all hosts) and add the following lines:

Host *
ServerAliveInterval 60
ServerAliveCountMax 3
ForwardX11 yes
ForwardX11Trusted yes
ForwardX11Timeout 596h

(The <tt>Host *</tt> is only needed if there was no Host section yet to append these settings to.)

If this does not resolve it, try it again with "ssh -vvv -Y ....". The "-vvv" spews out a lot of diagnostic messages. Look for anything resembling a timeout, and let us know (support AT scinet DOT utoronto DOT ca).

===Can't forward X: "Warning: No xauth data; using fake authentication data", or "X11 connection rejected because of wrong authentication."===

I used to be able to forward X11 windows from SciNet to my home machine, but now I'm getting these messages; what's wrong?

'''Answer:'''

This very likely means that ssh/xauth can't update your ${HOME}/.Xauthority file.

The simplest pssible reason for this is that you've filled your 10GB /home quota and so can't write anything to your home directory. Use

<pre>
$ module load extras
$ diskUsage
</pre>

to check to see how close you are to your disk usage on ${HOME}.

Alternately, this could mean your .Xauthority file has become broken/corrupted/confused some how, in which case you can delete that file, and when you next log in you'll get a similar warning message involving creating .Xauthority, but things should work.

===I have a CCDB account, but I can't login to SciNet. How can I get a SciNet account?===

'''Answer:'''

You must extend your CCDB application process to also get a SciNet account:

https://wiki.scinet.utoronto.ca/wiki/index.php/Application_Process

https://www.scinethpc.ca/getting-a-scinet-account/

===How come I can not login to TCS?===

'''Answer:'''

A SciNet account doesn't automatically entitle you to TCS access. At a minimum, TCS jobs need to run on at least 32 cores (64 preferred because of Simultaneous Multi Threading - [[TCS_Quickstart#Node_configuration|SMT]] - on these nodes) and need the large memory (4GB/core) and bandwidth on the system. Essentially you need to be able to explain why the work can't be done on the GPC.

===How can I reset the password for my Compute Canada account?===

'''Answer:'''

You can reset your password for your Compute Canada account here:

https://ccdb.computecanada.ca/security/forgot

===How can I change or reset the password for my SciNet account?===

'''Answer:'''

To reset your password at SciNet please go to [https://portal.scinet.utoronto.ca/password_resets Password reset page].

If you know your old password and want to change it, that can be done here after logging in on the portal:

https://portal.scinet.utoronto.ca

===Why am I getting the error "Permission denied (publickey,gssapi-with-mic,password)"?===

This error can pop up in a variety of situations: when trying to log in, or when after a job has finished, when the error and output files fail to be copied (there are other possible reasons for this failure as well -- see [[FAQ#My_GPC_job_died.2C_telling_me_.60Copy_Stageout_Files_Failed.27|My GPC job died, telling me:Copy Stageout Files Failed]]).
In most cases, the "Permission denioed" error is caused by incorrect permission of the (hidden) .ssh directory. Ssh is used for logging in as well as for the copying of the standard error and output files after a job.

For security reasons,
the directory .ssh should only be writable and readable to you, but yours
has read permission for everybody, and thus it fails. You can change
this by
<pre>
chmod 700 ~/.ssh
</pre>
And to be sure, also do
<pre>
chmod 600 ~/.ssh/id_rsa ~/authorized_keys
</pre>

===ERROR:102: Tcl command execution failed? when loading modules ===
Modules sometimes require other modules to be loaded first.
Module will let you know if you didn’t.
For example:
<pre>
$ module purge
$ module load python
python/2.6.2(11):ERROR:151: Module ’python/2.6.2’ depends on one of the module(s) ’gcc/4.4.0’
python/2.6.2(11):ERROR:102: Tcl command execution failed: prereq gcc/4.4.0
$ gpc-f103n084-$ module load gcc python
$
</pre>

=== How do I compute the core-years usage of my code? ===

The "core-years" quantity is a way to account for the time your code runs, by considering the total number of cores and time used, accounting for the total number of hours in a year.
For instance if your code uses ''HH'' hours, in ''NN'' nodes, where each node has ''CC'' cores, then "core-years" can be computed as follow:

''HH*(NN*CC)/(365*24)''

If you have several independent instances (batches) running on different nodes, with ''BB'' number of batches and each batch during ''HH'' hours, then your core-years usage can be computed as,

''BB*HH*(NN*CC)/(365*24)''

As a general rule, in our GPC system, each node has only 8 cores, so ''CC'' will be always 8.

==Compiling your Code==

===How can I get g77 to work?===

The fortran 77 compilers on the GPC are ifort and gfortran. We have dropped support for g77. This has been a conscious decision. g77 (and the associated library libg2c) were completely replaced six years ago (Apr 2005) by the gcc 4.x branch, and haven't undergone any updates at all, even bug fixes, for over five years.
If we would install g77 and libg2c, we would have to deal with the inevitable confusion caused when users accidentally link against the old, broken, wrong versions of the gcc libraries instead of the correct current versions.

If your code for some reason specifically requires five-plus-year-old libraries, availability, compatibility, and unfixed-known-bug problems are only going to get worse for you over time, and this might be as good an opportunity as any to address those issues.

''A note on porting to gfortran or ifort:''

While gfortran and ifort are rather compatible with g77, one
important difference is that by default, gfortran does not preserve
local variables between function calls, while g77 does. Preserved
local variables are for instance often used in implementations of quasi-random number
generators. Proper fortran requires to declare such variables as SAVE
but not all old code does this.
Luckily, you can change gfortran's default behavior with the flag
<tt>-fno-automatic</tt>. For ifort, the corresponding flag is <tt>-noautomatic</tt>.

===Where is libg2c.so?===

libg2c.so is part of the g77 compiler, for which we dropped support. See [[#How can I get g77 to work on the GPC?]] for our reasons.

===Autoparallelization does not work!===

I compiled my code with the <tt>-qsmp=omp,auto</tt> option, and then I specified that it should be run with 64 threads - with
export OMP_NUM_THREADS=64

However, when I check the load using <tt>llq1 -n</tt>, it shows a load on the node of 1.37. Why?

'''Answer:'''

Using the autoparallelization will only get you so far. In fact, it usually does not do too much. What is helpful is to run the compiler with the <tt>-qreport</tt> option, and then read the output listing carefully to see where the compiler thought it could parallelize, where it could not, and the reasons for this. Then you can go back to your code and carefully try to address each of the issues brought up by the compiler.
We ''emphasize'' that this is just a rough first guide, and that the compilers are still not magical! For more sophisticated approaches to parallelizing your code, email us at [mailto:support@scinet.utoronto.ca <support@scinet.utoronto.ca>] to set up an appointment with one
of our technical analysts.

===How do I link against the Intel Math Kernel Library?===

If you need to link to the Intel Math Kernal Library (MKL) with the intel compilers, just add the <pre>-mkl</pre> flag. There are in fact three flavours: <tt>-mkl=sequential</tt>, <tt>-mkl=parallel</tt> and <tt>-mkl=cluster</tt>, for the serial version, the threaded version and the mpi version, respectively. (Note: The cluster version is available only when using the intelmpi module and mpi compilation wrappers.)

If you need to link in the Intel Math Kernel Library (MKL) libraries to gcc/gfortran/c++, you are well advised to use the Intel(R) Math Kernel Library Link Line Advisor: http://software.intel.com/en-us/articles/intel-mkl-link-line-advisor/ for help in devising the list of libraries to link with your code.

'''''Note that this give the link line for the command line. When using this in Makefiles, replace $MKLPATH by ${MKLPATH}.'''''

'''''Note too that, unless the integer arguments you will be passing to the MKL libraries are actually 64-bit integers, rather than the normal int or INTEGER types, you want to specify 32-bit integers (lp64) .'''''

===Can the compilers on the login nodes be disabled to prevent accidentally using them?===

'''Answer:'''

You can accomplish this by modifying your .bashrc to not load the compiler modules. See [[Important .bashrc guidelines]].

==="relocation truncated to fit: R_X86_64_PC32": Huh?===

What does this mean, and why can't I compile this code?

'''Answer:'''

Welcome to the joys of the x86 architecture! You're probably having trouble building arrays larger than 2GB, individually or together. Generally, you have to try to use the medium or large x86 `memory model'. For the intel compilers, this is specified with the compile options

-mcmodel=medium -shared-intel

==="feupdateenv is not implemented and will always fail"===

How do I get rid of this and what does it mean?

'''Answer:'''
First note that, as ominous as it sounds, this is really just a warning, and has to do with the intel math library. You can ignore it (unless you really are trying to manually change the exception handlers for floating point exceptions such as divide by zero), or take the safe road and get rid off it by linking with the intel math functions library:<pre>-limf</pre>See also [[#How do I link against the Intel Math Kernel Library?]]

===Cannot find rdmacm library when compiling on GPC===

I get the following error building my code on GPC: "<tt>ld: cannot find -lrdmacm</tt>". Where can I find this library?

'''Answer:'''

This library is part of the MPI libraries; if your compiler is having problems picking it up, it probably means you are mistakenly trying to compile on the login nodes (scinet01..scinet04). The login nodes aren't part of the GPC; they are for logging into the data centre only. From there you must go to the GPC or TCS development nodes to do any real work.

=== Why do I get this error when I try to compile: "icpc: error #10001: could not find directory in which /usr/bin/g++41 resides" ?===

You are trying to compile on the login nodes. As described in the wiki ( https://support.scinet.utoronto.ca/wiki/index.php/GPC_Quickstart#Login ), or in the users guide you would have received with your account, Scinet supports two main clusters, with very different architectures. Compilation must be done on the development nodes of the appropriate cluster (in this case, gpc01-04). Thus, log into gpc01, gpc02, gpc03, or gpc04, and compile from there.

==Testing your Code==

=== Can I run a something for a short time on the development nodes? ===

I am in the process of playing around with the mpi calls in my code to get it to work. I do a lot of tests and each of them takes a couple of seconds only. Can I do this on the development nodes?

'''Answer:'''

Yes, as long as it's very brief (a few minutes). People use the development nodes
for their work, and you don't want to bog it down for people, and testing a real
code can chew up a lot more resources than compiling, etc. The procedures differ
depending on what machine you're using.

==== TCS ====

On the TCS you can run small MPI jobs on the tcs02 node, which is meant for
development use. But even for this test run on one node, you'll need a host file --
a list of hosts (in this case, all tcs-f11n06, which is the `real' name of tcs02)
that the job will run on. Create a file called `hostfile' containing the following:

tcs-f11n06
tcs-f11n06
tcs-f11n06
tcs-f11n06

for a 4-task run. When you invoke "poe" or "mpirun", there are runtime
arguments that you specify pointing to this file. You can also specify it
in an environment variable MP_HOSTFILE, so, if your file is in your /scratch directory, say
${SCRATCH}/hostfile, then you would do

<pre>
export MP_HOSTFILE=${SCRATCH}/hostfile
</pre>

in your shell. You will also need to create a <tt>.rhosts</tt> file in your
home director, again listing <tt>tcs-f11n06</tt> so that <tt>poe</tt>
can start jobs. After that you can simply run your program. You can use
mpiexec:

<pre>
mpiexec -n 4 my_test_program
</pre>

adding <tt> -hostfile /path/to/my/hostfile</tt> if you did not set the environment
variable above. Alternatively, you can run it with the poe command (do a "man poe" for details), or even by
just directly running it. In this case the number of MPI processes will by default
be the number of entries in your hostfile.

==== GPC ====

On the GPC one can run short test jobs on the GPC [[GPC_Quickstart#Compile.2FDevel_Nodes | development nodes ]]<tt>gpc01</tt>..<tt>gpc04</tt>;
if they are single-node jobs (which they should be) they don't need a hostfile. Even better, though, is to request an [[ Moab#Interactive | interactive ]] job and run the tests either in regular batch queue or using a short high availability [[ Moab#debug | debug ]] queue that is reserved for this purpose.

=== How do I run a longer (but still shorter than an hour) test job quickly ? ===

'''Answer'''

On the GPC there is a high turnover short queue called [[ Moab#debug | debug ]] that is designed for
this purpose. You can use it by adding
<pre>
#PBS -q debug
</pre>
to your submission script.

==Submitting your jobs==

===Error Submitting My Job: qsub: Bad UID for job execution MSG=ruserok failed ===

I write up a submission script as in the examples, but when I attempt to submit the job, I get the above error. What's wrong?

'''Answer:'''

This error will occur if you try to submit a job from the login nodes. The login nodes are the gateway to all of SciNet's systems (GPC, TCS, P7, ARC), which have different hardware and queueing systems. To submit a job, you must log into a development node for the particular cluster you are submitting to and submit from there.

=== How do I charge jobs to my RAC allocation? ===

'''Answer:'''

Please see the [[Moab#Accounting|accounting section of Moab page]].

===How can I automatically resubmit a job?===

Commonly you may have a job that you know will take longer to run than what is
permissible in the queue. As long as your program contains [[Checkpoints|checkpoint]] or
restart capability, you can have one job automatically submit the next. In
the following example it is assumed that the program finishes before
the 48 hour limit and then resubmits itself by logging into one
of the development nodes.

<source lang="bash">
#!/bin/bash
# MOAB/Torque example submission script for auto resubmission
# SciNet GPC
#
#PBS -l nodes=1:ppn=8,walltime=48:00:00
#PBS -N my_job

# DIRECTORY TO RUN - $PBS_O_WORKDIR is directory job was submitted from
cd $PBS_O_WORKDIR

# YOUR CODE HERE
./run_my_code

# RESUBMIT 10 TIMES HERE
num=$NUM
if [ "$num" -lt 10 ]; then
num=$(($num+1))
ssh gpc01 "cd $PBS_O_WORKDIR; qsub ./script_name.sh -v NUM=$num";
fi
</source>

<pre>
qsub script_name.sh -v NUM=0
</pre>

You can alternatively use [[ Moab#Job_Dependencies | Job dependencies ]] through the queuing system which will not start one job until another job has completed.

If your job can't be made to automatically stop before the 48 hour queue window, but it does write out checkpoints, you can use the timeout command to stop the program while you still have time to resubmit; for instance

<source lang="bash">
timeout 2850m ./run_my_code argument1 argument2
</source>

will run the program for 47.5 hours (2850 minutes), and then send it SIGTERM to exit the program.

===How can I pass in arguments to my submission script?===

If you wish to make your scripts more generic you can use qsub's ability
to pass in environment variables to pass in arguments to your script.
The following example shows a case where an input and an output
file are passed in on the qsub line. Multiple variables can be
passed in using the qsub "-v" option and comma delimited.

<source lang="bash">
#!/bin/bash
# MOAB/Torque example of passing in arguments
# SciNet GPC
#
#PBS -l nodes=1:ppn=8,walltime=48:00:00
#PBS -N my_job

# DIRECTORY TO RUN - $PBS_O_WORKDIR is directory job was submitted from
cd $PBS_O_WORKDIR

# YOUR CODE HERE
./run_my_code -f $INFILE -o $OUTFILE
</source>

<pre>
qsub script_name.sh -v INFILE=input.txt,OUTFILE=outfile.txt
</pre>

===I submit my GPC job, and I get an email saying it was rejected===
'''Answer:'''

This happens because the job you've submitted breaks one of the rules of the queues and is rejected. An email
is sent with the JOBID, JOBNAME, and the reason it was rejected. The following is an example where a job
requests more than 48 hours and was rejected.

<pre>
PBS Job Id: 3462493.gpc-sched
Job Name: STDIN
job deleted
Job deleted at request of root@gpc-sched
MOAB_INFO: job was rejected - job violates class configuration 'wclimit too high for class 'batch_ib' (345600 > 172800)'
</pre>

Jobs on the TCS or GPC may only run for 48 hours at a time; this restriction greatly increases responsiveness of the queue and queue throughput for all our users. If your computation requires longer than that, as many do, you will have to [[ Checkpoints | checkpoint ]] your job and restart it after each 48-hour queue window. You can manually re-submit jobs, or if you can have your job cleanly exit before the 48 hour window, there are ways to [[ FAQ#How_can_I_automatically_resubmit_a_job.3F | automatically resubmit jobs ]].

Other rejections return a more cryptic error saying "job violates class configuration" such as follows:
<pre>
PBS Job Id: 3462409.gpc-sched
Job Name: STDIN
job deleted
Job deleted at request of root@gpc-sched
MOAB_INFO: job was rejected - job violates class configuration 'user required by class 'batch''
</pre>

The most common problems that result in this error are:

* '''Incorrect number of processors per node''': Jobs on the GPC are scheduled per-node not per-core and since each node has 8 processor cores (ppn=8) the smallest job allowed is one node with 8 cores (nodes=1:ppn=8). For serial jobs users must bundle or batch them together in groups of 8. See [[ FAQ#How_do_I_run_serial_jobs_on_GPC.3F | How do I run serial jobs on GPC? ]]
* '''No number of nodes specified''': Jobs submitted to the main queue must request a specific number of nodes, either in the submission script (with a line like <tt>#PBS -l nodes=2:ppn=8</tt>) or on the command line (eg, <tt>qsub -l nodes=2:ppn=8,walltime=5:00:00 script.pbs</tt>). Note that for the debug queue, you can get away without specifying a number of nodes and a default of one will be assigned; for both technical and policy reasons, we do not enforce such a default for the main ("batch") queue.
* '''There is a 15 minute walltime minimum''' on all queues except debug and if you set your walltime less than this, it will be rejected.

=== When submitting your job, fails saying: "script is written in DOS/Windows text format" ===

'''Answer:'''

Very likely you have written your script in a windows machine, so to fix this you just need to change the format of you submission script to unix from Windows/DOS.
Use the command below for all your script files:

<pre>
dos2unix <pbs-script-file>
</pre>

where <pbs-script-file> has to be substituted by the name of your script file.

==Running your jobs==

===My job can't write to /home===

My code works fine when I test on the development nodes, but when I submit a job, or even run interactively in the development queue on GPC, it fails. What's wrong?

'''Answer:'''

As [[Data_Management#Home_Disk_Space | discussed]] [https://support.scinet.utoronto.ca/wiki/images/5/54/SciNet_Tutorial.pdf elsewhere], <tt>/home</tt> is mounted read-only on the compute nodes; you can only write to <tt>/home</tt> from the login nodes and devel nodes. (The [[GPC_Quickstart#128Glargemem | largemem nodes]] on GPC, in this respect, are more like devel nodes than compute nodes). In general, to run jobs you can read from <tt>/home</tt> but you'll have to write to <tt>/scratch</tt> (or, if you were allocated space through the RAC process, on <tt>/project</tt>). More information on SciNet filesytems can be found on our [[Data_Management | Data Management]] page.

===OpenMP on the TCS===

How do I run an OpenMP job on the TCS?

'''Answer:'''

Please look at the [[TCS_Quickstart#Submission_Script_for_an_OpenMP_Job | TCS Quickstart ]] page.

===Can I can use hybrid codes consisting of MPI and openMP on the GPC?===

'''Answer:'''

Yes. Please look at the [[GPC_Quickstart#Hybrid_MPI.2FOpenMP_jobs | GPC Quickstart ]] page.

===How do I run serial jobs on GPC?===

'''Answer''':

So it should be said first that SciNet is a parallel computing resource,
and our priority will always be parallel jobs. Having said that, if
you can make efficient use of the resources using serial jobs and get
good science done, that's good too, and we're happy to help you.

The GPC nodes each have 8 processing cores, and making efficient use of these
nodes means using all eight cores. As a result, we'd like to have the
users take up whole nodes (eg, run multiples of 8 jobs) at a time.

It depends on the nature of your job what the best strategy is. Several approaches are presented on the [[User_Serial|serial run wiki page]].

===Why can't I request only a single cpu for my job on GPC?===

'''Answer''':

On GPC, computers are allocated by the node - that is, in chunks of 8 processors. If you want to run a job that requires only one processor, you need to bundle the jobs into groups of 8, so as to not be wasting the other 7 for 48 hours. See [[User_Serial|serial run wiki page]].

===How do I run serial jobs on TCS?===

'''Answer''': You don't.

===But in the queue I found a user who is running jobs on GPC, each of which is using only one processor, so why can't I?===

'''Answer''':

The pradat* and atlaspt* jobs, amongst others, are jobs of the ATLAS high energy physics project. That they are reported as single cpu jobs is an artifact of the moab scheduler. They are in fact being automatically bundled into 8-job bundles but have to run individually to be compatible with their international grid-based systems.

===How do I use the ramdisk on GPC?===

To use the ramdisk, create and read to / write from files in /dev/shm/.. just as one would to (eg) ${SCRATCH}. Only the amount of RAM needed to store the files will be taken up by the temporary file system; thus if you have 8 serial jobs each requiring 1 GB of RAM, and 1GB is taken up by various OS services, you would still have approximately 7GB available to use as ramdisk on a 16GB node. However, if you were to write 8 GB of data to the RAM disk, this would exceed available memory and your job would likely crash.

It is very important to delete your files from ram disk at the end of your job. If you do not do this, the next user to use that node will have less RAM available than they might expect, and this might kill their jobs.

''More details on how to setup your script to use the ramdisk can be found on the [[User_Ramdisk|Ramdisk wiki page]].''

=== How can I run a job longer than 48 hours? ===

'''Answer:'''

The SciNet queues have a queue limit of 48 hours. This is pretty typical for systems of its size in Canada and elsewhere, and larger systems commonly have shorter limits. The limits are there to ensure that every user gets a fair share of the system (so that no one user ties up lots of nodes for a long time), and for safety (so that if one memory board in one node fails in the middle of a very long job, you haven't lost a months' worth of work).

Since many of us have simulations that require more than that much time, most widely-used scientific applications have "checkpoint-restart" functionality, where every so often the complete state of the calculation is stored as a checkpoint file, and one can restart a simulation from one of these. In fact, these restart files tend to be quite useful for a number of purposes.

If your job will take longer, you will have to submit your job in multiple parts, restarting from a checkpoint each time. In this way, one can run a simulation much longer than the queue limit. In fact, one can even write job scripts which automatically re-submit themselves until a run is completed, using [[FAQ#How_can_I_automatically_resubmit_a_job.3F | automatic resubmission. ]]

=== Why did showstart say it would take 3 hours for my job to start before, and now it says my job will start in 10 hours? ===

'''Answer:'''

Please look at the [[FAQ#How_do_priorities_work.2Fwhy_did_that_job_jump_ahead_of_mine_in_the_queue.3F | How do priorities work/why did that job jump ahead of mine in the queue? ]] page.

===How do priorities work/why did that job jump ahead of mine in the queue?===

'''Answer:'''

The [[Moab | queueing system]] used on SciNet machines is a [http://en.wikipedia.org/wiki/Priority_queue Priority Queue]. Jobs enter the queue at the back of the queue, and slowly make their way to the front as those ahead of them are run; but a job that enters the queue with a higher priority can `cut in line'.

The main factor which determines priority is whether or not the user (or their PI) has an [http://wiki.scinethpc.ca/wiki/index.php/Application_Process RAC allocation]. These are competitively allocated grants of computer time; there is a call for proposals towards the end of every calendar year. Users with an allocation have high priorities in an attempt to make sure that they can use the amount of computer time the committees granted them. Their priority decreases as they approach their allotted usage over the current window of time; by the time that they have exhausted that allotted usage, their priority is the same as users with no allocation (unallocated, or `default' users). Unallocated users have a fixed, low, priority.

This priority system is called `fairshare'; the scheduler attempts to make sure everyone has their fair share of the machines, where the share that's fair has been determined by the allocation committee. The fairshare window is a rolling window of two weeks; that is, any time you have a job in the queue, the fairshare calculation of its priority is given by how much of your allocation of the machine has been used in the last 14 days.

A particular allocation might have some fraction of GPC - say 4% of the machine (if the PI had been allocated 10 million CPU hours on GPC). The allocations have labels; (called `Resource Allocation Proposal Identifiers', or RAPIs) they look something like

abc-123-ab

where abc-123 is the PIs CCRI, and the suffix specifies which of the allocations granted to the PI is to be used. These can be specified on a job-by-job basis. On GPC, one adds the line
#PBS -A RAPI
to your script; on TCS, one uses
# @ account_no = RAPI
If the allocation to charge isn't specified, a default is used; each user has such a default, which can be changed at the same portal where one changes one's password,

https://portal.scinet.utoronto.ca/

A jobs priority is determined primarily by the fairshare priority of the allocation it is being charged to; the previous 14 days worth of use under that allocation is calculated and compared to the allocated fraction (here, 5%) of the machine over that window (here, 14 days). The fairshare priority is a decreasing function of the allocation left; if there is no allocation left (eg, jobs running under that allocation have already used 379,038 CPU hours in the past 14 days), the priority is the same as that of a user with no granted allocation. (This last part has been the topic of some debate; as the machine gets more utilized, it will probably be the case that we allow RAC users who have greatly overused their quota to have their priorities to drop below that of unallocated users, to give the unallocated users some chance to run on our increasingly crowded system; this would have no undue effect on our allocated users as they still would be able to use the amount of resources they had been allocated by the committees.) Note that all jobs charging the same allocation get the same fairshare priority.

There are other factors that go into calculating priority, but fairshare is the most significant. Other factors include
* amount of time waiting in queue (measured in units of the requested runtime). A waiting queue job gains priority as it sits in the queue to avoid job starvation.
* User adjustment of priorities ( See below ).

The major effect of these subdominant terms is to shuffle the order of jobs running under the same allocation.

===How do we manage job priorities within our research group?===

'''Answer:'''

Obviously, managing shared resources within a large group - whether it
is conference funding or CPU time - takes some doing.

It's important to note that the fairshare periods are intentionally kept
quite short - just two weeks long. So, for example, let us say that in your resource
allocation you have about 10% of the machine. Then for someone to use
up the whole two week amount of time in 2 days, they'd have to use 70%
of the machine in those two days - which is unlikely to happen by
accident. If that does happen,
those using the same allocation as the person who used 70% of the
machine over the two days will suffer by having much lower priority for
their jobs, but only for the next 12 days - and even then, if there are
idle cpus they'll still be able to compute.

There will be online tools for seeing how the allocation is being used,
and those people who are in charge in your group will be able to use
that information to manage the users, telling them to dial it down or
up. We know that managing a large research group is hard, and we want
to make sure we provide you the information you need to do your job
effectively.

One way for users within a group to manage their priorities within the group
is with [[Moab#Adjusting_Job_Priority | user-adjusted priorities]]; this is
described in more detail on the [[Moab | Scheduling System]] page.



=== I couldn't find the .o output file in the .pbs_spool directory as I used to ===

On Feb 24 2011, the temporary location of standard input and output files was moved from the shared file system ${SCRATCH}/.pbs_spool to the
node-local directory /var/spool/torque/spool (which resides in ram). The final location after a job has finished is unchanged,
but to check the output/error of running jobs, users will now have to ssh into the (first) node assigned to the job and look in
/var/spool/torque/spool.

This alleviates access contention to the temporary directory, especially for those users that are running a lot of jobs, and reduces the burden on the file system in general.

Note that it is good practice to redirect output to a file rather than to count on the scheduler to do this for you.

=== My GPC job died, telling me `Copy Stageout Files Failed' ===

'''Answer:'''

When a job runs on GPC, the script's standard output and error are redirected to
<tt>$PBS_JOBID.gpc-sched.OU</tt> and <tt>$PBS_JOBID.gpc-sched.ER</tt> in
/var/spool/torque/spool on the (first) node on which your job is running. At the end of the job, those .OU and .ER files are copied to where the batch script tells them to be copied, by default <tt>$PBS_JOBNAME.o$PBS_JOBID</tt> and<tt>$PBS_JOBNAME.e$PBS_JOBID</tt>. (You can set those filenames to be something clearer with the -e and -o options in your PBS script.)

When you get errors like this:
<pre>
An error has occurred processing your job, see below.
request to copy stageout files failed on node
</pre>
it means that the copying back process has failed in some way. There could be a few reasons for this. The first thing to '''make sure that your .bashrc does not produce any output''', as the output-stageout is performed by bash and further output can cause this to fail.
But it also could have just been a random filesystem error, or it could be that your job failed spectacularly enough to shortcircuit the normal job-termination process (e.g. ran out of memory very quickly) and those files just never got copied.

Write to [mailto:support@scinet.utoronto.ca <support@scinet.utoronto.ca>] if your input/output files got lost, as we will probably be able to retrieve them for you (please supply at least the jobid, and any other information that may be relevant).

Mind you that it is good practice to redirect output to a file rather than depending on the job scheduler to do this for you.



===IB Memory Errors, eg <tt> reg_mr Cannot allocate memory </tt>===

Infiniband requires more memory than ethernet; it can use RDMA (remote direct memory access) transport for which it sets aside registered memory to transfer data.

In our current network configuration, it requires a _lot_ more memory, particularly as you go to larger process counts; unfortunately, that means you can't get around the "I need more memory" problem the usual way, by running on more nodes. Machines with different memory or
network configurations may exhibit this problem at higher or lower MPI
task counts.

Right now, the best workaround is to reduce the number and size of OpenIB queues, using XRC: with the OpenMPI, add the following options to your mpirun command:

<pre>
-mca btl_openib_receive_queues X,128,256,192,128:X,2048,256,128,32:X,12288,256,128,32 -mca btl_openib_max_send_size 12288
</pre>

With Intel MPI, you should be able to do

<pre>
module load intelmpi/4.0.3.008
mpirun -genv I_MPI_FABRICS=shm:ofa -genv I_MPI_OFA_USE_XRC=1 -genv I_MPI_OFA_DYNAMIC_QPS=1 -genv I_MPI_DEBUG=5 -np XX ./mycode
</pre>

to the same end.

For more information see [[GPC MPI Versions]].

===My compute job fails, saying <tt>libpng12.so.0: cannot open shared object file</tt> or <tt>libjpeg.so.62: cannot open shared object file</tt>===

'''Answer:'''

To maximize the amount of memory available for compute jobs, the compute nodes have a less complete system image than the development nodes. In particular, since interactive graphics libraries like matplotlib and gnuplot are usually used interactively, the libraries for their use are included in the devel nodes' image but not the compute nodes.

Many of these extra libraries are, however, available in the "extras" module. So adding a "module load extras" to your job submission script - or, for overkill, to your .bashrc - should enable these scripts to run on the compute nodes.

==Monitoring jobs in the queue==

===Why hasn't my job started?===

'''Answer:'''

Use the moab command

<pre>
checkjob -v jobid
</pre>

and the last couple of lines should explain why a job hasn't started.

Please see [[Moab| Job Scheduling System (Moab) ]] for more detailed information

===How do I figure out when my job will run?===

'''Answer:'''

Please see [[Moab#Available_Resources| Job Scheduling System (Moab) ]]



===Running checkjob on my job gives me messages about JobFail and rejected===

Running checkjob on my job gives me messages that suggest my job has failed, as below: what did I do wrong?

<pre>
AName: test
State: Idle
Creds: user:xxxxxx group:xxxxxxxx account:xxxxxxxx class:batch_ib qos:ibqos
WallTime: 00:00:00 of 8:00:00
BecameEligible: Wed Jul 23 10:39:27
SubmitTime: Wed Jul 23 10:38:22
(Time Queued Total: 00:01:47 Eligible: 00:01:05)

Total Requested Tasks: 8

Req[0] TaskCount: 8 Partition: ALL
Opsys: centos6computeA Arch: --- Features: ---

Notification Events: JobFail

IWD: /scratch/x/xxxxxxxx/xxxxxxx/xxxxxxx
Partition List: torque,DDR
Flags: RESTARTABLE
Attr: checkpoint
StartPriority: 76
rejected for Opsys - (null)
rejected for State - (null)
rejected for Reserved - (null)
NOTE: job req cannot run in partition torque (available procs do not meet requirements : 0 of 8 procs found)
idle procs: 793 feasible procs: 0

Node Rejection Summary: [Opsys: 117][State: 2895][Reserved: 19]

NOTE: job violates constraints for partition SANDY (partition SANDY not in job partition mask)

NOTE: job violates constraints for partition GRAVITY (partition GRAVITY not in job partition mask)

rejected for State - (null)
NOTE:
</pre>

'''Answer:'''

The output from check job is a little cryptic in places, and if you are wondering why your job hasn't started yet, you might think that "rejection" and "JobFail" suggest that there's something wrong. But the above message is actually normal; you can use the <tt>showstart</tt> command on your job to get a (preliminary, subject to change) estimate as to when the job will start, and you'll find that it is in fact scheduled to start up in the near future.

In the above message:

* `Notification Events: JobFail` just means that, if notifications are enabled, you'll get a message if the job fails;
* `job req cannot run in partition torque` just means that the job cannot run just yet (that's why it's queued);
* `job req cannot run in dynamic partition DDR now (insufficient procs available: 0 < 8)` says why: there aren't processors available; and
* `job violates constraints for partition SANDY/GRAVITY` just means that the job isn't eligable to run in those paritcular (small) sections of the cluster.

that is, the above output is the normal and expected (if somewhat cryptic) explanation as to why the job is waiting - nothing to worry about.

===How can I monitor my running jobs on TCS?===

How can I monitor the load of TCS jobs?

'''Answer:'''

You can get more information with the command
/xcat/tools/tcs-scripts/LL/jobState.sh
which I alias as:
alias llq1='/xcat/tools/tcs-scripts/LL/jobState.sh'
If you run "llq1 -n" you will see a listing of jobs together with a lot of information, including the load.

===How can I check the memory usage from my jobs?===

How can I check the memory usage from my jobs?

'''Answer:'''

In many occasions it can be really useful to take a look at how much memory your job is using while it is running.
There a couple of ways to do so:

1) using some of the [https://wiki.scinet.utoronto.ca/wiki/index.php/SciNet_Command_Line_Utilities command line utilities] we have developed, e.g: by using the '''jobperf''' or '''jobtop''' utilities, it will allow you to check the job performance and head's node utilization respectively.

2) ''ssh'' into the nodes where your job is being run and check for memory usage and system stats right there. For instance, trying the 'top' or 'free' commands, in those nodes.

Also, it always a good a idea and strongly encouraged to inspect the standard output-log and error-log generated for your job submissions.
These files are named respectively: ''JobName.{o|e}jobIdNumber''; where ''JobName'' is the name you gave to the job (via the '-N' PBS flag) and ''JobIdNumber'' is the id number of the job.
These files are saved in the working directory after the job is finished, but they can be also accessed on real-time using the '''jobError''' and '''jobOutput''' [https://wiki.scinet.utoronto.ca/wiki/index.php/SciNet_Command_Line_Utilities command line utilities] available loading the ''extras'' module.

Other related topics to memory usage: 
[https://wiki.scinet.utoronto.ca/wiki/index.php/GPC_Quickstart#Ram_Disk Using Ram Disk]
 
[https://wiki.scinet.utoronto.ca/wiki/index.php/GPC_Quickstart#Memory_Configuration Different Memory Configuration nodes]
 
[https://wiki.scinet.utoronto.ca/wiki/index.php/FAQ#Monitoring_jobs_in_the_queue Monitoring Jobs in the Queue]
 
[https://wiki.scinet.utoronto.ca/wiki/images/a/a0/TechTalkJobMonitoring.pdf Tech Talk on Monitoring Jobs]

===Can I run cron jobs on devel nodes to monitor my jobs?===

Can I run cron jobs on devel nodes to monitor my jobs?

'''Answer:'''

No, we do not permit cron jobs to be run by users. To monitor the status of your jobs using a cron job running on your own machine, use the command

<pre>
ssh myusername@login.scinet.utoronto.ca "qstat -u myusername"
</pre>

or some variation of this command. Of course, you will need to have SSH keys setup on the machine running the cron job, so that password entry won't be necessary.

=== How does one check the amount of used CPU-hours in a project, and how does one get statistics for each user in the project? ===

'''Answer:'''

This information is available on the scinet portal,https://portal.scinet.utoronto.ca, See also [[SciNet Usage Reports]].



=== I couldn't find the .o output file in the .pbs_spool directory as I used to ===

On Feb 24 2011, the temporary location of standard input and output files was moved from the shared file system ${SCRATCH}/.pbs_spool to the
node-local directory /var/spool/torque/spool (which resides in ram). The final location after a job has finished is unchanged,
but to check the output/error of running jobs, users will now have to ssh into the (first) node assigned to the job and look in
/var/spool/torque/spool.

This alleviates access contention to the temporary directory, especially for those users that are running a lot of jobs, and reduces the burden on the file system in general.

Note that it is good practice to redirect output to a file rather than to count on the scheduler to do this for you.

=== My GPC job died, telling me `Copy Stageout Files Failed' ===

'''Answer:'''

When a job runs on GPC, the script's standard output and error are redirected to
<tt>$PBS_JOBID.gpc-sched.OU</tt> and <tt>$PBS_JOBID.gpc-sched.ER</tt> in
/var/spool/torque/spool on the (first) node on which your job is running. At the end of the job, those .OU and .ER files are copied to where the batch script tells them to be copied, by default <tt>$PBS_JOBNAME.o$PBS_JOBID</tt> and<tt>$PBS_JOBNAME.e$PBS_JOBID</tt>. (You can set those filenames to be something clearer with the -e and -o options in your PBS script.)

When you get errors like this:
<pre>
An error has occurred processing your job, see below.
request to copy stageout files failed on node
</pre>
it means that the copying back process has failed in some way. There could be a few reasons for this. The first thing to '''make sure that your .bashrc does not produce any output''', as the output-stageout is performed by bash and further output can cause this to fail.
But it also could have just been a random filesystem error, or it could be that your job failed spectacularly enough to shortcircuit the normal job-termination process (e.g. ran out of memory very quickly) and those files just never got copied.

Write to [mailto:support@scinet.utoronto.ca <support@scinet.utoronto.ca>] if your input/output files got lost, as we will probably be able to retrieve them for you (please supply at least the jobid, and any other information that may be relevant).

Mind you that it is good practice to redirect output to a file rather than depending on the job scheduler to do this for you.



===IB Memory Errors, eg <tt> reg_mr Cannot allocate memory </tt>===

Infiniband requires more memory than ethernet; it can use RDMA (remote direct memory access) transport for which it sets aside registered memory to transfer data.

In our current network configuration, it requires a _lot_ more memory, particularly as you go to larger process counts; unfortunately, that means you can't get around the "I need more memory" problem the usual way, by running on more nodes. Machines with different memory or
network configurations may exhibit this problem at higher or lower MPI
task counts.

Right now, the best workaround is to reduce the number and size of OpenIB queues, using XRC: with the OpenMPI, add the following options to your mpirun command:

<pre>
-mca btl_openib_receive_queues X,128,256,192,128:X,2048,256,128,32:X,12288,256,128,32 -mca btl_openib_max_send_size 12288
</pre>

With Intel MPI, you should be able to do

<pre>
module load intelmpi/4.0.3.008
mpirun -genv I_MPI_FABRICS=shm:ofa -genv I_MPI_OFA_USE_XRC=1 -genv I_MPI_OFA_DYNAMIC_QPS=1 -genv I_MPI_DEBUG=5 -np XX ./mycode
</pre>

to the same end.

For more information see [[GPC MPI Versions]].

===My compute job fails, saying <tt>libpng12.so.0: cannot open shared object file</tt> or <tt>libjpeg.so.62: cannot open shared object file</tt>===

'''Answer:'''

To maximize the amount of memory available for compute jobs, the compute nodes have a less complete system image than the development nodes. In particular, since interactive graphics libraries like matplotlib and gnuplot are usually used interactively, the libraries for their use are included in the devel nodes' image but not the compute nodes.

Many of these extra libraries are, however, available in the "extras" module. So adding a "module load extras" to your job submission script - or, for overkill, to your .bashrc - should enable these scripts to run on the compute nodes.

-->

==Data on SciNet disks==

===How do I find out my disk usage?===

'''Answer:'''

The standard unix/linux utilities for finding the amount of disk space used by a directory are very slow, and notoriously inefficient on the GPFS filesystems that we run on the SciNet systems. There are utilities that very quickly report your disk usage:

The <tt>'''diskUsage'''</tt> command, available with the 'extras' module on the login nodes, datamovers and the GPC devel nodes, provides information in a number of ways on the home, scratch, and project file systems. For instance, how much disk space is being used by yourself and your group (with the -a option), or how much your usage has changed over a certain period ("delta information") or you may generate plots of your usage over time.
This information is only updated hourly!

More information about these filesystems is available at the [[Data_Management | Data_Management]].

===How do I transfer data to/from SciNet?===

'''Answer:'''

All incoming connections to SciNet go through relatively low-speed connections to the <tt>login.scinet</tt> gateways, so using scp to copy files the same way you ssh in is not an effective way to move lots of data. Better tools are described in our page on [[Data_Management#Data_Transfer | Data Transfer]].

===My group works with data files of size 1-2 GB. Is this too large to transfer by scp to login.scinet.utoronto.ca ?===

'''Answer:'''

Generally, occasion transfers of data less than 10GB is perfectly acceptible to so through the login nodes. See [[Data_Management#Data_Transfer | Data Transfer]].

===How can I check if I have files in /scratch that are scheduled for automatic deletion?===

'''Answer:'''

Please see [[Storage_Quickstart#Scratch_Disk_Purging_Policy | Storage At SciNet]]

===How to allow my supervisor to manage files for me using ACL-based commands?===

'''Answer:'''

Please see [[Data_Management#File.2FOwnership_Management_.28ACL.29 | File/Ownership Management]]

===Can we buy extra storage space on SciNet?===

'''Answer:'''
Yes, please see [[Data_Management#Buying_storage_space_on_GPFS_or_HPSS | Buying storage space on GPFS or HPSS ]] for more details.

===Can I transfer files between BGQ and HPSS?===

'''Answer:'''
Yes, please see [https://support.scinet.utoronto.ca/wiki/index.php/BGQ#Bridge_to_HPSS Bridge to HPSS ] for more details.

==Keep 'em Coming!==

===Next question, please===

Send your question to [mailto:support@scinet.utoronto.ca <support@scinet.utoronto.ca>]; we'll answer it asap!

Knowledge Base: Tutorials and Manuals

2018-08-09T17:30:23Z

Rzon:

__NOINDEX__

{| style="border-spacing: 8px; width:100%"
| valign="top" style="cellpadding:1em; padding:1em; border:2px solid; background-color:#f6f674; border-radius:5px"|
'''WARNING: SciNet is in the process of replacing this wiki with a new documentation site. The links below may still contain valuable information, but for current information, please go to [https://docs.scinet.utoronto.ca https://docs.scinet.utoronto.ca]'''
|}

__TOC__

=Training material=

For upcoming classes, see our [https://support.scinet.utoronto.ca/education/ Training and Education website]!
==SciNet Basics==
* [[Media:SciNet_Tutorial.pdf|SciNet User Tutorial]]
* [[SciNet Command Line Utilities]]
* Intro to SciNet May 2014: [http://support.scinet.utoronto.ca/CourseVideo/SciNetIntroMay2014/IntroMay2014.html Video]/[[Media:IntroScinetMay2014.pdf|Slides]]
* SciNet Resources: [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Monday_Morning_SciNet_Resources/Monday_Morning_SciNet_Resources.mp4 Video]/ [[Media:Monday_Morning_SciNet_Resources.pdf|Slides]]
* [[Essentials]]
* [[FAQ|Frequenty asked questions]]
* [[Ssh]]
* [[GPC_Quickstart|GPC quickstart]]
* [[TCS_Quickstart|TCS quickstart]]
* [[GPU_Devel_Nodes|ARC/GPU quickstart]]
* [[Cell_Devel_Nodes|ARC/Cell quickstart]]
* [[Important .bashrc guidelines]]
* [[Media:LargeScaleBio.pdf‎|Workflow Optimization (w/focus on Large Scale BioInformatics)]]
* [[Software_and_Libraries | Software and libraries]]
* [[Installing your own modules]]
* [[Media:SNUGlocalsetup.pdf|User-space modules and packages (April 2011 SNUG TechTalk)]]
* [[Media:HPSS_rationale.pdf|HPSS - SciNet's storage capacity expansion]]
* BGQ Hardware Overview [https://support.scinet.utoronto.ca/~northrup/bgqhardware.pdf Slides ]/ [https://support.scinet.utoronto.ca/CourseVideo/BGQ/bgqhardware/bgqhardware.mp4 Video Recording ]
* Intro to Using the BGQ [[Media:Bgqintro.pdf|Slides ]]/[https://support.scinet.utoronto.ca/CourseVideo/BGQ/bgqintro/bgqintro.mp4 Video Recording ]

==Linux==
* [http://www.ee.surrey.ac.uk/Teaching/Unix/index.html Linux Tutorial] (warning: the last part of this tutorial is specific to the csh shell, whereas SciNet uses the bash shell).
* Linux Command Line: A Primer (June 2012) [[Media:SS_IntroToShell.pdf|Slides,]] [[Media:SS_IntroToShell.tgz|Files]]
* Introduction to the Linux Shell, SciNet, Mar 2012: [[Media:IntroToShell.pdf|Slides]] and [[Media:Shell-data.tgz|Data files]]

==Batch job management==
* [https://support.scinet.utoronto.ca/education/go.php/183/file_storage/index.php/download/1/files%5B%5D/3702/ Job and Queue Management Tools] (Techtalk Mar.11, 2015)
* [[Media:LargeScaleBio.pdf‎|Workflow Optimization (w/focus on Large Scale BioInformatics)]]
* [[Media:Tech-talk-gnu-parallel.pdf|GNU Parallel (Techtalk Nov 14, 2012)]]
* [[Media:TechTalkJobMonitoring.pdf|Job Monitoring on SciNet and Job Efficiency]]

* [[Wallclock time]]
* [[Checkpoints]]
* [[Using_Signals|Signals]]
* [[Moab]]
* [[User_Serial|Serial Jobs (including GNU Parallel)]]
* [[User_Ramdisk|Ramdisk]]
* [http://www.clusterresources.com/products/mwm/docs/index.shtml Moab workload manager]
* [http://www.clusterresources.com/products/mwm/docs/a.gcommandoverview.shtml Moab commands]
* [http://www.clusterresources.com/products/torque/docs/ Torque resource manager]
* [http://www.clusterresources.com/products/torque/docs/a.acommands.shtml Torque PBS commands]
* [http://support.scinet.utoronto.ca/Manuals/PE5.1-operationanduse.pdf Parallel environment]
* [http://publib.boulder.ibm.com/infocenter/clresctr/vxrx/index.jsp Cluster information center] (with error codes)
* [http://support.scinet.utoronto.ca/Manuals/LL-usingandadministering.pdf LoadLeveler: using & administering]

==Programming==
===General===
* [[Media:SciDev-XLCompilers.pdf|Performance Tuning with the IBM XL Compilers]]: Slides fromt the SciNet Develop Seminar by Kit Barton, Sep 17, 2012.
* [[Media:Remotescinet.pdf‎|Remote Development]], slides from TechTalk Jun 13, 2012
* [[Scientific Software Development Course]], part I of the SciNet's Scientific Computing Course
* [http://software-carpentry.org Software Carpentry Resources]
* Version Control: [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Thursday_Morning_BP_Revision_Control/Thursday_Morning_BP_Revision_Control.mp4 Video]/ [[Media:Snug_techtalk_revcontrol.pdf | Slides]]
* [[IBM_Nov_Workshop | IBM AIX Workshop, SciNet, Nov 2008 ]]
* [[IBM_Compiler_Workshop | IBM Compiler Workshop, SciNet, Feb 2009]]
* SNUG Techtalk Dec 2011 [[Media:Snug_techtalk_compiler.pdf | Intel Compiler Optimizations]]

===Fortran===
* Modern Fortran Course (1 day), SciNet, 19 Apr 2011
** [[Media:ModernFortran.pdf | Slides]]
** [[Media:ModernFortran.tgz | Source Code]]
* [http://software.intel.com/sites/products/documentation/hpc/compilerp* [http://support.scinet.utoronto.ca/Manuals/xlf-compiler.pdf IBM Fortran compiler] [http://support.scinet.utoronto.ca/Manuals/xlf-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlf-proguide.pdf optimization]

===C++===
* [[Media:Cpp11.pdf|Slides]] and [http://support.scinet.utoronto.ca/CourseVideo/Cpp11/cpp11.html recording] of the SciNet Developer Seminar on C++11, March 20, 2013
* Scientific C++ Course (1 day), SciNet, 15 March 2011
** [[Media:Scientific-c%2B%2B.pdf|Slides]] (updated on Apr 26, 2012)
** [[Media:Scinetcppexamples.tgz|Example source code]]
** [[Videos_of_the_One-Day_Scientific_C%2B%2B_Class | Videos of the Scientific C++ class]]
* [http://software.intel.com/sites/products/documentation/hpc/compilerpro/en-us/cpp/lin/compiler_c/index.htm Intel C & C++ compiler]
ro/en-us/fortran/lin/compiler_f/index.htm Intel Fortran compiler]
* [http://support.scinet.utoronto.ca/Manuals/xlC++-compiler.pdf IBM C++ compiler], [http://support.scinet.utoronto.ca/Manuals/xlC++-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlC++-proguide.pdf optimization]

===C===
* C refresher: [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Monday_Morning_C_Review/Monday_Morning_C_Review.mp4 Video]/ [[Media:Monday_Morning_C_Review.pdf| Slides]]
* [http://support.scinet.utoronto.ca/Manuals/xlc-compiler.pdf IBM C compiler], [http://support.scinet.utoronto.ca/Manuals/xlc-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlc-proguide.pdf optimization]

===Cilk Plus===
* [[Media:TechTalkSzalwinskiCilkPlus.pdf|CilkPlus - Getting Started]] (slides of the May 2014 TechTalk by Chris Szalwinski)

===Hadoop===
* Introduction to Hadoop for HPCers, Part I - MapReduce: [[Media:Hadoop-PartI.pdf|Slides]], [[Media:HadoopPart1examples.tgz|Source Code]], [http://support.scinet.utoronto.ca/~ljdursi/SciNetHadoopVM.zip Virtual Machine]

===Perl===
* [[Perl]]
===Python===
* [[Python]]
* [[IPython Notebook on GPC]] (January 2014 TechTalk)
* [[Research Computing with Python]] (Modular Course, Fall 2013)
* [[Intro to Tkinter|Python GUIs with Python and TkInter]] [[Media:Tkinter.pdf|slides]] [[Media:Tkinter_code.tgz|code]]
* [http://support.scinet.utoronto.ca/Snug/scinet-f2py/scinet-f2py.html f2py: Fortran and Python] (June 2011 TechTalk by Pierre de Buyl)

===R===
* [[R Statistical Package]]
===Lua===
* [[Media:PeterColberg_Lua_scinet.pdf | Scripting HALMD with Lua and Luabind]] (May 2011 TechTalk by Peter Colberg)

==Parallel Programming==
* [[2014 Ontario Summerschool on High Performance Computing Central]] (incl. OpenMP and MPI)
* [[High Performance Scientific Computing Course 2014]] (OpenMP and MPI, with video recordings of lectures)
* [[Media:SciNet_MPI3.0_seminar.pdf | MPI 3 Developer Seminar]]
* [[Media:Ds-openmp.pdf| OpenMP 4 Developer Seminar]]
* [[Ontario Summerschool on High Performance Computing Central]] (2013)
* [[High Performance Scientific Computing]], part 3 of SciNet's Scientific Computing Course (Winter 2012)
* Parallel Programming Course (5 days), SciNet, May 2011
** [[Parallel_Scientific_Computing_-_May_2011 | Videos, slides and code]]
* Parallel Computing for Computational Fluid Dynamics (CFD), SciNet, 23 March 2011
** [[Media:parCFD-mpi.pdf | Slides]]
** [[Media:parCFD.tgz | Source Code]]
* Intro to Practical Parallel Programming (1 day), SciNet, 22 Sept 2010:
**[[Media:PPP-Intro-Morning.pdf|Morning Slides, Intro and OpenMP ]]
**[[Media:PPP-Intro-Afternoon.pdf|Afternoon Slides, MPI]]
**[[Media:Intro-ppp.tgz|Example source code]]
* Parallel Scientific Computing Workshop (5 days), SciNet, Aug 2009:
**[[ Parallel_Scientific_Computing_-_Aug_09 | Slides ]]
**[http://www.cita.utoronto.ca/~ljdursi/PSP/ Video]
* [http://www.vscse.org/ Virtual School for CSE] Web courses (Jul/Aug 2010):
** Petascale programming environments and tools
** Big data for science
** Proven algorithmic techniques for many-core processors
* [https://computing.llnl.gov/tutorials/mpi/ LLNL MPI Tutorial]: This was the basis for the MPI workshop at SciNet.
* [http://software.intel.com/sites/products/documentation/hpc/mpi/linux/reference_manual.pdf Intel MPI library]
* [[GPC MPI Versions]]
* [[Co-array Fortran on the GPC]]
* [[IBM_Feb_Workshop | IBM MPI Workshop, SciNet, Feb 2009]]
* [http://support.scinet.utoronto.ca/Manuals/UPC/compiler.pdf IBM UPC compiler], [http://support.scinet.utoronto.ca/Manuals/UPC/langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/UPC/upcopt.pdf optimization], [http://support.scinet.utoronto.ca/Manuals/UPC/standlib.pdf library], [http://support.scinet.utoronto.ca/Manuals/UPC/upcusersguide.pdf user's guide], [http://support.scinet.utoronto.ca/Manuals/UPC/proguide.pdf programmer's guide]

==GPU Computing==
* 1-day "Introduction to GPGPU Programming with CUDA" Course (May 2015): [https://support.scinet.utoronto.ca/education/go.php/251/index.php/ib/1//p_course/251].
* 2-day "Programming GPUs with CUDA" Course (2015 Summer School): [https://wiki.scinet.utoronto.ca/wiki/index.php/2015_Ontario_Summer_School_for_High_Performance_Computing_Central#Programming_GPUs_with_CUDA]
* [[Media:SNUG_NOV_CUDA.pdf | New Features in CUDA 5 & 6, Nov. 2014]]
* [[Media:Westgrid_CUDA.pdf | Intro to GPU Computing Using CUDA]] (WestGrid Spring 2014 Seminar Series)
* 1.5 hour intro to CUDA, March 2013: [[Media:CUDA-Graphics-Intro-2013.pdf | Slides]] and [[Media:CUDA-Graphics-Intro-2013.tgz | Source Code]]
* [[CUDA_Minicourse_Fall_2012 | CITA/SciNet CUDA Minicourse, Fall 2012]]
* [[SciNet GPU Workshop July 2010]]
* Intro to GPGPU Programming: [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Friday_Morning_GPGPU/Friday_Morning_GPGPU.mp4 Video]/ [[Media:Gpgpu.pdf | Slides]] (from 5 day parallel programming course at SciNet, May 2011)
* 1-day intro to GPGPU using CUDA Course (Aug 2011): [[Media:Intro-gpu.tgz | Source Code]], [[Media:IntroGPGPU-Aug2011.pdf | Slides]].
* [http://developer.nvidia.com/object/cuda_training.html NVidia archived courses for GPGPU Programming]
* [http://www.pgroup.com/doc/pgiug.pdf PGI Compiler User's Guide]
* [http://www.pgroup.com/doc/pgiref.pdf PGI Compiler Reference Manual]
* [http://www.pgroup.com/doc/pgifortref.pdf PGI Fortran reference]
* [http://www.pgroup.com/doc/pgicudaforug.pdf PGI CUDA Fortran Programming Guide and Reference]
* [http://www.pgroup.com/doc/openACC_gs.pdf PGI OpenACC Getting Started Guide]

==Performance Tuning==
* [[Performance and Profiling Course, April 2013]]
* [[Introduction To Performance]]
* Performance tools for [[Performance_And_Debugging_Tools:_GPC | GPC ]] and [[Performance_And_Debugging_Tools:_TCS | TCS ]]
* Dec 2010 SNUG TechTalk: [[Media:ProfillingTechTalk-Dec2010.pdf | Profiling Tools on GPC]]
* [http://support.scinet.utoronto.ca/Manuals/JUMP-AIX-POWER6-AppsPerformanceTuning-wp032008.pdf Performance tuning]
* [[Media:Mpi-tuning-parameters.pdf‎ | MPI Tuning Parameters]] - SNUG TechTalk, Feb 2012
* [http://cnx.org/content/col11136/latest/ High Performance Computing Book] Online version of an older O'Reilly book which covers the basics of (mostly serial) programming for performance. Covers the most important issues today (such as cache) very clearly.
* [http://www.ece.cmu.edu/~franzf/papers/gttse07.pdf How to Write Fast Numerical Code ] Good introduction to thinking about performance and cache.

==Debugging==
* [[Parallel Debugging with DDT]]
* [[Media:SS_Debug.pdf|Debugging with GDB and DDT, half-day session at the Ontario HPC Summerschool 2012 Central Slides]], [[Media:SS_Debug.tgz|Code]].
* [[Media:Snugdebug.pdf|TechTalk: Debuggers & Parallel Debugging on SciNet - gdb, ddd, padb], SciNet User Group Meeting, Nov 2010]] [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Thursday_Morning_Debugging/Thursday_Morning_Debugging.mp4 Video]
* [http://www.allinea.com/downloads/userguide.pdf Allinea DDT (Distributed Debugging Tool) User Guide]

==Math libraries (BLAS, LAPACK, FFT)==
* [[Media:MKLTechTalkMarch2012.pdf|Intel Math Kernel Library (MKL): An overview]] (TechTalk, March, 2012)
* [[Numerical Tools for Physical Scientists]], part 2 of SciNet's Scientific computing course, covers, random nubers, blas, lapack, fft, ...
* [[Media:FP_Consistency.pdf|Intel Compiler Floating Point Consistency]]
* [http://software.intel.com/sites/products/documentation/hpc/mkl/lin/index.htm Math Kernel Library (MKL)]
* [http://software.intel.com/sites/products/documentation/hpc/mkl/vsl/vslnotes.pdf Math Kernel Library's Vector Statistical Library]
* [http://software.intel.com/en-us/articles/intel-mkl-link-line-advisor Math Kernel Library link line advisor] ($MKLPATH → ${MKLPATH} in makefiles)
* [http://publib.boulder.ibm.com/epubs/pdf/am501405.pdf ESSL high performance math library V4] ([http://publib.boulder.ibm.com/epubs/pdf/am601305.pdf V3])
* [http://publib.boulder.ibm.com/epubs/pdf/am601305.pdf Parallel ESSL high performance math library V3.3]
* [http://hal.inria.fr/inria-00576469 Linear Algebra Libraries] by Claire Mouton. 2009 INRIA Technical Report on existing linear algebra libraries for C++ (also here: [http://arxiv.org/abs/1103.3020])

==I/O==

* [[Media:NetCDF.pdf|Introduction to NetCDF4 binary files with Python, C++ and R (TechTalk March 2014)]]
* [[Using MySQL on the GPC]]
* [https://support.scinet.utoronto.ca/education/go.php/19/content.php/cid/59/ Relational Database Basics]
* [[Media:SCIENCEDATA.pdf‎|Sep 2012 SNUG TechTalk: Science=Data]]
* [[Data_Management|Data management]]
* Intro to Parallel I/O, SciNet, Oct 6th, 2010:
**[[Media:Parallel_io_course.pdf‎|Morning & MPI-IO Slides]]
**[[Media:Netcdfhdf5.pdf|NetCDF/HDF5 Slides]]
**[[Media:ParIO.tgz|Source Code]].
* Intro to Parallel I/O, SciNet, Feb, 2013
** [http://support.scinet.utoronto.ca/CourseVideo/ParallelIOcourse/pario-intro/pario-intro.mp4 Video of the Introduction]
** [http://support.scinet.utoronto.ca/CourseVideo/ParallelIOcourse/pario-netcdfhdf5/pario-netcdfhdf5.mp4 Video of the parallel hdf5 and netcdf part] [[Media:Netcdfhdf5.pdf|NetCDF/HDF5 Slides]]
** [http://support.scinet.utoronto.ca/CourseVideo/ParallelIOcourse/pario-mpiio/pario-mpiio.mp4 Video of the MPI-IO part]
* Half-day HPCS2012 Parallel I/O tutorial, covering MPI-IO, HDF5, NetCDF, based on the above: [[Media:ParIO-HPCS2012.pdf|slides (pdf)]] and [[Media:ParIO-HPCS2012.tgz|source code]].
* [[Media:Snugio.pdf|Sept 2010 SNUG TechTalk: Parallel File System and IO]] [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Friday_Morning_IO/Friday_Morning_IO.mp4 Video]
* [[File System and I/O dos and don'ts]]
* [[Media:40TB.pdf|So you have 40TB of Data]] -- an overview of things to consider with large data sets.
* [[Media:Adios-techtalk-may2012.pdf|May 2012 SNUG TechTalk: ADIOS for Parallel IO slides]] and [[Media:Adios-techtalk-may2012-src.tgz|source code]]
* [[hdf5_table|Writting / Reading a table in hdf5]]
* [[NetCDF_table|Writting / Reading a table in NetCDF]]
* Intro to Parallel I/O, SciNet, Sept 215
** MPI-IO [[Media:ParIO.MPIIO.2015.pdf |Slides]] and [[Media:ParIO.MPIIO.tar.gz | Sample codes]]
** [[Media:netcdf2015.pdf | NETCDF]] & [[Media:hfd5.pdf | HDF5]]

==Infiniband Networking==
* [[Media:Snug_techtalk_Infiniband.pdf | TechTalk on SciNet's Infiniband Network & MPI options ]]

==Visualization==
* [[Using Paraview]]
* [[VNC|VNC on the GPC]] (see also [[Media:Ttvnc.pdf|slides of the TechTalk on VNC]])
* [[Software_and_Libraries#anchor_viz|Visualization Software on the GPC]]
* [https://wiki.scinet.utoronto.ca/wiki/images/5/51/Remoteviz.pdf Remote visualization (X-forwarding and VNC)]
* [[Using Paraview]]
* [https://support.scinet.utoronto.ca/~mponce/ss2016/ss2016_visualization-I.pdf gnuplot, xmgrace, remote visualization tools (X-forwarding and VNC), python's matplotlib]
* [https://support.scinet.utoronto.ca/~mponce/ss2016/ss2016_visualization-II.pdf Brief overview of ParaView & VisIt]
* [https://support.scinet.utoronto.ca/education/go.php/242/file_storage/index.php/download/1/files%5B%5D/6399/ VisIt Basics]
* [https://support.scinet.utoronto.ca/education/go.php/273/file_storage/index.php/download/1/files%5B%5D/7363/ Scientific Visualization using VisIt (Basics and Advanced)], M.Ponce and A.Razoumov
* [https://support.scinet.utoronto.ca/education/get.php/8_ComplexNetworks.pdf Intro to Complex Networks Visualization, with Python]

* [http://scienceillustrated.ca Science Illustrated:] Two-day symposium on Visualizing Science, Feb 2011
* [http://www.kmdi.utoronto.ca/story/2011/03/si-science-illustrated-symposium-success Videos of the talks given at Science Illustrated] (recorded by [http://www.kmdi.utoronto.ca KMDI] at [http://www.utoronto.ca UoT]):
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=94ff5cd5-be6e-4fc6-9be6-dd2222342bcd Opening remarks] by Paul Young
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=4255c34e-15e7-4b24-ba99-78f5c8fa4381 Information Visualization and the Myth of Information Overload] by Christopher Collins
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=adcf02bf-16cb-46cc-8cdc-1a65e9071d6b Beyond Basic Visualization] by Ramses Van Zon
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=47baf346-3599-4fa3-9b10-6a58faa6b33c Network Visualization & Analysis] by Igor Jurisica
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=7e754a2e-7be5-476e-bb54-37def37bc07e Simulation and Visualization of Blood Flow] by David Steinman
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=d7622587-2c31-49c1-99d7-f0f16c078801 Scientific Visualizations: Does the Science Matter?] by Thomas Lucas
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=9963c637-6840-454f-a57f-9a1be6456616 How can visualization impact public perception of science?] Panelists: Jay Ingram, Peter Calamai, Reni Barlow, Hooley McLaughlin
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=88e50cff-db9b-4c71-b10d-781fec60a2c0 How Info Graphics are Created for the Mainstream Media] by Peter Calamai
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=3897e2a3-1fda-42be-ab78-edbab090fd9e Design Boot Camp] by Graham Huber
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=4c27ed76-7292-407e-83a6-814e1461eccd Visualization Large Datasets] by Jonathan Dursi
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=7d49c845-3937-44e2-a300-9b8ffe57a857 Visualizing Colliding Black Holes] by Herald Pfeiffer
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=7d7e4803-39cd-4c8e-a443-bc2a7b1b3c28 Closing remarks] by Mubdi Rahman

==Applications==
{{:Knowledge Base: Applications}}
* See also [[User Codes]]

== Development Systems ==

=== KNL ===

* SNUG TechTalk October 2016 - [[Media:Snug_techtalk_KNL.pdf | Intel Xeon Phi Knights Landing ]]

=Manuals=

==Intel compilers and libraries (GPC)==
* [http://software.intel.com/sites/products/documentation/doclib/stdxe/2013/composerxe/compiler/cpp-lin/index.htm C & C++ compiler]
* [http://software.intel.com/sites/products/documentation/doclib/stdxe/2013/composerxe/compiler/fortran-lin/index.htm Fortran compiler]
* [[Media:FP_Consistency.pdf|Intel Compiler Floating Point Consistency]]
* [[Media:Compiler_qrg12.pdf‎|Intel Compiler Optimization Guide]]
* [http://software.intel.com/sites/products/documentation/hpc/mkl/lin/index.htm Math Kernel Library (MKL)]
* [http://software.intel.com/sites/products/documentation/hpc/mkl/vsl/vslnotes.pdf Math Kernel Library's Vector Statistical Library]
* [http://software.intel.com/sites/products/documentation/hpc/mpi/linux/reference_manual.pdf Intel MPI library]
* [http://software.intel.com/en-us/articles/intel-mkl-link-line-advisor Math Kernel Library link line advisor] ($MKLPATH → ${MKLPATH} in makefiles)

==IBM compilers and libraries (TCS/P7)==
* [http://support.scinet.utoronto.ca/Manuals/xlc-compiler.pdf C compiler], [http://support.scinet.utoronto.ca/Manuals/xlc-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlc-proguide.pdf optimization]
* [http://support.scinet.utoronto.ca/Manuals/xlC++-compiler.pdf C++ compiler], [http://support.scinet.utoronto.ca/Manuals/xlC++-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlC++-proguide.pdf optimization]
* [http://support.scinet.utoronto.ca/Manuals/xlf-compiler.pdf Fortran compiler] [http://support.scinet.utoronto.ca/Manuals/xlf-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlf-proguide.pdf optimization]
* [http://support.scinet.utoronto.ca/Manuals/UPC/compiler.pdf UPC compiler], [http://support.scinet.utoronto.ca/Manuals/UPC/langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/UPC/upcopt.pdf optimization], [http://support.scinet.utoronto.ca/Manuals/UPC/standlib.pdf library], [http://support.scinet.utoronto.ca/Manuals/UPC/upcusersguide.pdf user's guide], [http://support.scinet.utoronto.ca/Manuals/UPC/proguide.pdf programmer's guide]
* [http://publib.boulder.ibm.com/epubs/pdf/am501405.pdf ESSL high performance math library V4] ([http://publib.boulder.ibm.com/epubs/pdf/am601305.pdf V3])
* [[Media:essl51.pdf|ESSL high performance math library V5.1 for Linux on Power]]
* [http://support.scinet.utoronto.ca/Manuals/JUMP-AIX-POWER6-AppsPerformanceTuning-wp032008.pdf Performance tuning]
* [http://support.scinet.utoronto.ca/Manuals/PE5.1-operationanduse.pdf Parallel environment]
* [http://publib.boulder.ibm.com/infocenter/clresctr/vxrx/index.jsp Cluster information center] (with error codes)
* [http://support.scinet.utoronto.ca/Manuals/LL-usingandadministering.pdf LoadLeveler: using & administering]

==PGI compilers (ARC)==
* [http://www.pgroup.com/doc/pgiug.pdf Compiler User's Guide]
* [http://www.pgroup.com/doc/pgiref.pdf Compiler Reference Manual]
* [http://www.pgroup.com/doc/pgifortref.pdf Fortran reference]
* [http://www.pgroup.com/doc/pgicudaforug.pdf CUDA Fortran Programming Guide and Reference]
* [http://www.pgroup.com/doc/openACC_gs.pdf OpenACC Getting Started Guide] (Note: $PGI/linux86-64/12.5/doc contains a newer version.)

==Scheduler (Adaptive Computing/Cluster Resources)==
* [http://docs.adaptivecomputing.com/mwm/archive/7-0/help.htm Moab workload manager]
* [http://docs.adaptivecomputing.com/mwm/archive/7-0/Content/a.gcommandoverview.html Moab commands]
* [http://docs.adaptivecomputing.com/torque/archive/4-0-1/help.htm Torque resource manager]
* [http://docs.adaptivecomputing.com/torque/archive/4-0-1/help.htm#topics/12-appendices/commandsOverview.htm Torque PBS commands]

==DDT Debugger (Allinea)==
* [http://www.allinea.com/downloads/userguide.pdf Distributed Debugging Tool User Guide]

Knowledge Base: Tutorials and Manuals

2018-08-09T17:30:10Z

Rzon:

__NOINDEX__
__TOC__

{| style="border-spacing: 8px; width:100%"
| valign="top" style="cellpadding:1em; padding:1em; border:2px solid; background-color:#f6f674; border-radius:5px"|
'''WARNING: SciNet is in the process of replacing this wiki with a new documentation site. The links below may still contain valuable information, but for current information, please go to [https://docs.scinet.utoronto.ca https://docs.scinet.utoronto.ca]'''
|}

=Training material=

For upcoming classes, see our [https://support.scinet.utoronto.ca/education/ Training and Education website]!
==SciNet Basics==
* [[Media:SciNet_Tutorial.pdf|SciNet User Tutorial]]
* [[SciNet Command Line Utilities]]
* Intro to SciNet May 2014: [http://support.scinet.utoronto.ca/CourseVideo/SciNetIntroMay2014/IntroMay2014.html Video]/[[Media:IntroScinetMay2014.pdf|Slides]]
* SciNet Resources: [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Monday_Morning_SciNet_Resources/Monday_Morning_SciNet_Resources.mp4 Video]/ [[Media:Monday_Morning_SciNet_Resources.pdf|Slides]]
* [[Essentials]]
* [[FAQ|Frequenty asked questions]]
* [[Ssh]]
* [[GPC_Quickstart|GPC quickstart]]
* [[TCS_Quickstart|TCS quickstart]]
* [[GPU_Devel_Nodes|ARC/GPU quickstart]]
* [[Cell_Devel_Nodes|ARC/Cell quickstart]]
* [[Important .bashrc guidelines]]
* [[Media:LargeScaleBio.pdf‎|Workflow Optimization (w/focus on Large Scale BioInformatics)]]
* [[Software_and_Libraries | Software and libraries]]
* [[Installing your own modules]]
* [[Media:SNUGlocalsetup.pdf|User-space modules and packages (April 2011 SNUG TechTalk)]]
* [[Media:HPSS_rationale.pdf|HPSS - SciNet's storage capacity expansion]]
* BGQ Hardware Overview [https://support.scinet.utoronto.ca/~northrup/bgqhardware.pdf Slides ]/ [https://support.scinet.utoronto.ca/CourseVideo/BGQ/bgqhardware/bgqhardware.mp4 Video Recording ]
* Intro to Using the BGQ [[Media:Bgqintro.pdf|Slides ]]/[https://support.scinet.utoronto.ca/CourseVideo/BGQ/bgqintro/bgqintro.mp4 Video Recording ]

==Linux==
* [http://www.ee.surrey.ac.uk/Teaching/Unix/index.html Linux Tutorial] (warning: the last part of this tutorial is specific to the csh shell, whereas SciNet uses the bash shell).
* Linux Command Line: A Primer (June 2012) [[Media:SS_IntroToShell.pdf|Slides,]] [[Media:SS_IntroToShell.tgz|Files]]
* Introduction to the Linux Shell, SciNet, Mar 2012: [[Media:IntroToShell.pdf|Slides]] and [[Media:Shell-data.tgz|Data files]]

==Batch job management==
* [https://support.scinet.utoronto.ca/education/go.php/183/file_storage/index.php/download/1/files%5B%5D/3702/ Job and Queue Management Tools] (Techtalk Mar.11, 2015)
* [[Media:LargeScaleBio.pdf‎|Workflow Optimization (w/focus on Large Scale BioInformatics)]]
* [[Media:Tech-talk-gnu-parallel.pdf|GNU Parallel (Techtalk Nov 14, 2012)]]
* [[Media:TechTalkJobMonitoring.pdf|Job Monitoring on SciNet and Job Efficiency]]

* [[Wallclock time]]
* [[Checkpoints]]
* [[Using_Signals|Signals]]
* [[Moab]]
* [[User_Serial|Serial Jobs (including GNU Parallel)]]
* [[User_Ramdisk|Ramdisk]]
* [http://www.clusterresources.com/products/mwm/docs/index.shtml Moab workload manager]
* [http://www.clusterresources.com/products/mwm/docs/a.gcommandoverview.shtml Moab commands]
* [http://www.clusterresources.com/products/torque/docs/ Torque resource manager]
* [http://www.clusterresources.com/products/torque/docs/a.acommands.shtml Torque PBS commands]
* [http://support.scinet.utoronto.ca/Manuals/PE5.1-operationanduse.pdf Parallel environment]
* [http://publib.boulder.ibm.com/infocenter/clresctr/vxrx/index.jsp Cluster information center] (with error codes)
* [http://support.scinet.utoronto.ca/Manuals/LL-usingandadministering.pdf LoadLeveler: using & administering]

==Programming==
===General===
* [[Media:SciDev-XLCompilers.pdf|Performance Tuning with the IBM XL Compilers]]: Slides fromt the SciNet Develop Seminar by Kit Barton, Sep 17, 2012.
* [[Media:Remotescinet.pdf‎|Remote Development]], slides from TechTalk Jun 13, 2012
* [[Scientific Software Development Course]], part I of the SciNet's Scientific Computing Course
* [http://software-carpentry.org Software Carpentry Resources]
* Version Control: [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Thursday_Morning_BP_Revision_Control/Thursday_Morning_BP_Revision_Control.mp4 Video]/ [[Media:Snug_techtalk_revcontrol.pdf | Slides]]
* [[IBM_Nov_Workshop | IBM AIX Workshop, SciNet, Nov 2008 ]]
* [[IBM_Compiler_Workshop | IBM Compiler Workshop, SciNet, Feb 2009]]
* SNUG Techtalk Dec 2011 [[Media:Snug_techtalk_compiler.pdf | Intel Compiler Optimizations]]

===Fortran===
* Modern Fortran Course (1 day), SciNet, 19 Apr 2011
** [[Media:ModernFortran.pdf | Slides]]
** [[Media:ModernFortran.tgz | Source Code]]
* [http://software.intel.com/sites/products/documentation/hpc/compilerp* [http://support.scinet.utoronto.ca/Manuals/xlf-compiler.pdf IBM Fortran compiler] [http://support.scinet.utoronto.ca/Manuals/xlf-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlf-proguide.pdf optimization]

===C++===
* [[Media:Cpp11.pdf|Slides]] and [http://support.scinet.utoronto.ca/CourseVideo/Cpp11/cpp11.html recording] of the SciNet Developer Seminar on C++11, March 20, 2013
* Scientific C++ Course (1 day), SciNet, 15 March 2011
** [[Media:Scientific-c%2B%2B.pdf|Slides]] (updated on Apr 26, 2012)
** [[Media:Scinetcppexamples.tgz|Example source code]]
** [[Videos_of_the_One-Day_Scientific_C%2B%2B_Class | Videos of the Scientific C++ class]]
* [http://software.intel.com/sites/products/documentation/hpc/compilerpro/en-us/cpp/lin/compiler_c/index.htm Intel C & C++ compiler]
ro/en-us/fortran/lin/compiler_f/index.htm Intel Fortran compiler]
* [http://support.scinet.utoronto.ca/Manuals/xlC++-compiler.pdf IBM C++ compiler], [http://support.scinet.utoronto.ca/Manuals/xlC++-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlC++-proguide.pdf optimization]

===C===
* C refresher: [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Monday_Morning_C_Review/Monday_Morning_C_Review.mp4 Video]/ [[Media:Monday_Morning_C_Review.pdf| Slides]]
* [http://support.scinet.utoronto.ca/Manuals/xlc-compiler.pdf IBM C compiler], [http://support.scinet.utoronto.ca/Manuals/xlc-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlc-proguide.pdf optimization]

===Cilk Plus===
* [[Media:TechTalkSzalwinskiCilkPlus.pdf|CilkPlus - Getting Started]] (slides of the May 2014 TechTalk by Chris Szalwinski)

===Hadoop===
* Introduction to Hadoop for HPCers, Part I - MapReduce: [[Media:Hadoop-PartI.pdf|Slides]], [[Media:HadoopPart1examples.tgz|Source Code]], [http://support.scinet.utoronto.ca/~ljdursi/SciNetHadoopVM.zip Virtual Machine]

===Perl===
* [[Perl]]
===Python===
* [[Python]]
* [[IPython Notebook on GPC]] (January 2014 TechTalk)
* [[Research Computing with Python]] (Modular Course, Fall 2013)
* [[Intro to Tkinter|Python GUIs with Python and TkInter]] [[Media:Tkinter.pdf|slides]] [[Media:Tkinter_code.tgz|code]]
* [http://support.scinet.utoronto.ca/Snug/scinet-f2py/scinet-f2py.html f2py: Fortran and Python] (June 2011 TechTalk by Pierre de Buyl)

===R===
* [[R Statistical Package]]
===Lua===
* [[Media:PeterColberg_Lua_scinet.pdf | Scripting HALMD with Lua and Luabind]] (May 2011 TechTalk by Peter Colberg)

==Parallel Programming==
* [[2014 Ontario Summerschool on High Performance Computing Central]] (incl. OpenMP and MPI)
* [[High Performance Scientific Computing Course 2014]] (OpenMP and MPI, with video recordings of lectures)
* [[Media:SciNet_MPI3.0_seminar.pdf | MPI 3 Developer Seminar]]
* [[Media:Ds-openmp.pdf| OpenMP 4 Developer Seminar]]
* [[Ontario Summerschool on High Performance Computing Central]] (2013)
* [[High Performance Scientific Computing]], part 3 of SciNet's Scientific Computing Course (Winter 2012)
* Parallel Programming Course (5 days), SciNet, May 2011
** [[Parallel_Scientific_Computing_-_May_2011 | Videos, slides and code]]
* Parallel Computing for Computational Fluid Dynamics (CFD), SciNet, 23 March 2011
** [[Media:parCFD-mpi.pdf | Slides]]
** [[Media:parCFD.tgz | Source Code]]
* Intro to Practical Parallel Programming (1 day), SciNet, 22 Sept 2010:
**[[Media:PPP-Intro-Morning.pdf|Morning Slides, Intro and OpenMP ]]
**[[Media:PPP-Intro-Afternoon.pdf|Afternoon Slides, MPI]]
**[[Media:Intro-ppp.tgz|Example source code]]
* Parallel Scientific Computing Workshop (5 days), SciNet, Aug 2009:
**[[ Parallel_Scientific_Computing_-_Aug_09 | Slides ]]
**[http://www.cita.utoronto.ca/~ljdursi/PSP/ Video]
* [http://www.vscse.org/ Virtual School for CSE] Web courses (Jul/Aug 2010):
** Petascale programming environments and tools
** Big data for science
** Proven algorithmic techniques for many-core processors
* [https://computing.llnl.gov/tutorials/mpi/ LLNL MPI Tutorial]: This was the basis for the MPI workshop at SciNet.
* [http://software.intel.com/sites/products/documentation/hpc/mpi/linux/reference_manual.pdf Intel MPI library]
* [[GPC MPI Versions]]
* [[Co-array Fortran on the GPC]]
* [[IBM_Feb_Workshop | IBM MPI Workshop, SciNet, Feb 2009]]
* [http://support.scinet.utoronto.ca/Manuals/UPC/compiler.pdf IBM UPC compiler], [http://support.scinet.utoronto.ca/Manuals/UPC/langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/UPC/upcopt.pdf optimization], [http://support.scinet.utoronto.ca/Manuals/UPC/standlib.pdf library], [http://support.scinet.utoronto.ca/Manuals/UPC/upcusersguide.pdf user's guide], [http://support.scinet.utoronto.ca/Manuals/UPC/proguide.pdf programmer's guide]

==GPU Computing==
* 1-day "Introduction to GPGPU Programming with CUDA" Course (May 2015): [https://support.scinet.utoronto.ca/education/go.php/251/index.php/ib/1//p_course/251].
* 2-day "Programming GPUs with CUDA" Course (2015 Summer School): [https://wiki.scinet.utoronto.ca/wiki/index.php/2015_Ontario_Summer_School_for_High_Performance_Computing_Central#Programming_GPUs_with_CUDA]
* [[Media:SNUG_NOV_CUDA.pdf | New Features in CUDA 5 & 6, Nov. 2014]]
* [[Media:Westgrid_CUDA.pdf | Intro to GPU Computing Using CUDA]] (WestGrid Spring 2014 Seminar Series)
* 1.5 hour intro to CUDA, March 2013: [[Media:CUDA-Graphics-Intro-2013.pdf | Slides]] and [[Media:CUDA-Graphics-Intro-2013.tgz | Source Code]]
* [[CUDA_Minicourse_Fall_2012 | CITA/SciNet CUDA Minicourse, Fall 2012]]
* [[SciNet GPU Workshop July 2010]]
* Intro to GPGPU Programming: [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Friday_Morning_GPGPU/Friday_Morning_GPGPU.mp4 Video]/ [[Media:Gpgpu.pdf | Slides]] (from 5 day parallel programming course at SciNet, May 2011)
* 1-day intro to GPGPU using CUDA Course (Aug 2011): [[Media:Intro-gpu.tgz | Source Code]], [[Media:IntroGPGPU-Aug2011.pdf | Slides]].
* [http://developer.nvidia.com/object/cuda_training.html NVidia archived courses for GPGPU Programming]
* [http://www.pgroup.com/doc/pgiug.pdf PGI Compiler User's Guide]
* [http://www.pgroup.com/doc/pgiref.pdf PGI Compiler Reference Manual]
* [http://www.pgroup.com/doc/pgifortref.pdf PGI Fortran reference]
* [http://www.pgroup.com/doc/pgicudaforug.pdf PGI CUDA Fortran Programming Guide and Reference]
* [http://www.pgroup.com/doc/openACC_gs.pdf PGI OpenACC Getting Started Guide]

==Performance Tuning==
* [[Performance and Profiling Course, April 2013]]
* [[Introduction To Performance]]
* Performance tools for [[Performance_And_Debugging_Tools:_GPC | GPC ]] and [[Performance_And_Debugging_Tools:_TCS | TCS ]]
* Dec 2010 SNUG TechTalk: [[Media:ProfillingTechTalk-Dec2010.pdf | Profiling Tools on GPC]]
* [http://support.scinet.utoronto.ca/Manuals/JUMP-AIX-POWER6-AppsPerformanceTuning-wp032008.pdf Performance tuning]
* [[Media:Mpi-tuning-parameters.pdf‎ | MPI Tuning Parameters]] - SNUG TechTalk, Feb 2012
* [http://cnx.org/content/col11136/latest/ High Performance Computing Book] Online version of an older O'Reilly book which covers the basics of (mostly serial) programming for performance. Covers the most important issues today (such as cache) very clearly.
* [http://www.ece.cmu.edu/~franzf/papers/gttse07.pdf How to Write Fast Numerical Code ] Good introduction to thinking about performance and cache.

==Debugging==
* [[Parallel Debugging with DDT]]
* [[Media:SS_Debug.pdf|Debugging with GDB and DDT, half-day session at the Ontario HPC Summerschool 2012 Central Slides]], [[Media:SS_Debug.tgz|Code]].
* [[Media:Snugdebug.pdf|TechTalk: Debuggers & Parallel Debugging on SciNet - gdb, ddd, padb], SciNet User Group Meeting, Nov 2010]] [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Thursday_Morning_Debugging/Thursday_Morning_Debugging.mp4 Video]
* [http://www.allinea.com/downloads/userguide.pdf Allinea DDT (Distributed Debugging Tool) User Guide]

==Math libraries (BLAS, LAPACK, FFT)==
* [[Media:MKLTechTalkMarch2012.pdf|Intel Math Kernel Library (MKL): An overview]] (TechTalk, March, 2012)
* [[Numerical Tools for Physical Scientists]], part 2 of SciNet's Scientific computing course, covers, random nubers, blas, lapack, fft, ...
* [[Media:FP_Consistency.pdf|Intel Compiler Floating Point Consistency]]
* [http://software.intel.com/sites/products/documentation/hpc/mkl/lin/index.htm Math Kernel Library (MKL)]
* [http://software.intel.com/sites/products/documentation/hpc/mkl/vsl/vslnotes.pdf Math Kernel Library's Vector Statistical Library]
* [http://software.intel.com/en-us/articles/intel-mkl-link-line-advisor Math Kernel Library link line advisor] ($MKLPATH → ${MKLPATH} in makefiles)
* [http://publib.boulder.ibm.com/epubs/pdf/am501405.pdf ESSL high performance math library V4] ([http://publib.boulder.ibm.com/epubs/pdf/am601305.pdf V3])
* [http://publib.boulder.ibm.com/epubs/pdf/am601305.pdf Parallel ESSL high performance math library V3.3]
* [http://hal.inria.fr/inria-00576469 Linear Algebra Libraries] by Claire Mouton. 2009 INRIA Technical Report on existing linear algebra libraries for C++ (also here: [http://arxiv.org/abs/1103.3020])

==I/O==

* [[Media:NetCDF.pdf|Introduction to NetCDF4 binary files with Python, C++ and R (TechTalk March 2014)]]
* [[Using MySQL on the GPC]]
* [https://support.scinet.utoronto.ca/education/go.php/19/content.php/cid/59/ Relational Database Basics]
* [[Media:SCIENCEDATA.pdf‎|Sep 2012 SNUG TechTalk: Science=Data]]
* [[Data_Management|Data management]]
* Intro to Parallel I/O, SciNet, Oct 6th, 2010:
**[[Media:Parallel_io_course.pdf‎|Morning & MPI-IO Slides]]
**[[Media:Netcdfhdf5.pdf|NetCDF/HDF5 Slides]]
**[[Media:ParIO.tgz|Source Code]].
* Intro to Parallel I/O, SciNet, Feb, 2013
** [http://support.scinet.utoronto.ca/CourseVideo/ParallelIOcourse/pario-intro/pario-intro.mp4 Video of the Introduction]
** [http://support.scinet.utoronto.ca/CourseVideo/ParallelIOcourse/pario-netcdfhdf5/pario-netcdfhdf5.mp4 Video of the parallel hdf5 and netcdf part] [[Media:Netcdfhdf5.pdf|NetCDF/HDF5 Slides]]
** [http://support.scinet.utoronto.ca/CourseVideo/ParallelIOcourse/pario-mpiio/pario-mpiio.mp4 Video of the MPI-IO part]
* Half-day HPCS2012 Parallel I/O tutorial, covering MPI-IO, HDF5, NetCDF, based on the above: [[Media:ParIO-HPCS2012.pdf|slides (pdf)]] and [[Media:ParIO-HPCS2012.tgz|source code]].
* [[Media:Snugio.pdf|Sept 2010 SNUG TechTalk: Parallel File System and IO]] [http://support.scinet.utoronto.ca/CourseVideo/PPPcourse/Friday_Morning_IO/Friday_Morning_IO.mp4 Video]
* [[File System and I/O dos and don'ts]]
* [[Media:40TB.pdf|So you have 40TB of Data]] -- an overview of things to consider with large data sets.
* [[Media:Adios-techtalk-may2012.pdf|May 2012 SNUG TechTalk: ADIOS for Parallel IO slides]] and [[Media:Adios-techtalk-may2012-src.tgz|source code]]
* [[hdf5_table|Writting / Reading a table in hdf5]]
* [[NetCDF_table|Writting / Reading a table in NetCDF]]
* Intro to Parallel I/O, SciNet, Sept 215
** MPI-IO [[Media:ParIO.MPIIO.2015.pdf |Slides]] and [[Media:ParIO.MPIIO.tar.gz | Sample codes]]
** [[Media:netcdf2015.pdf | NETCDF]] & [[Media:hfd5.pdf | HDF5]]

==Infiniband Networking==
* [[Media:Snug_techtalk_Infiniband.pdf | TechTalk on SciNet's Infiniband Network & MPI options ]]

==Visualization==
* [[Using Paraview]]
* [[VNC|VNC on the GPC]] (see also [[Media:Ttvnc.pdf|slides of the TechTalk on VNC]])
* [[Software_and_Libraries#anchor_viz|Visualization Software on the GPC]]
* [https://wiki.scinet.utoronto.ca/wiki/images/5/51/Remoteviz.pdf Remote visualization (X-forwarding and VNC)]
* [[Using Paraview]]
* [https://support.scinet.utoronto.ca/~mponce/ss2016/ss2016_visualization-I.pdf gnuplot, xmgrace, remote visualization tools (X-forwarding and VNC), python's matplotlib]
* [https://support.scinet.utoronto.ca/~mponce/ss2016/ss2016_visualization-II.pdf Brief overview of ParaView & VisIt]
* [https://support.scinet.utoronto.ca/education/go.php/242/file_storage/index.php/download/1/files%5B%5D/6399/ VisIt Basics]
* [https://support.scinet.utoronto.ca/education/go.php/273/file_storage/index.php/download/1/files%5B%5D/7363/ Scientific Visualization using VisIt (Basics and Advanced)], M.Ponce and A.Razoumov
* [https://support.scinet.utoronto.ca/education/get.php/8_ComplexNetworks.pdf Intro to Complex Networks Visualization, with Python]

* [http://scienceillustrated.ca Science Illustrated:] Two-day symposium on Visualizing Science, Feb 2011
* [http://www.kmdi.utoronto.ca/story/2011/03/si-science-illustrated-symposium-success Videos of the talks given at Science Illustrated] (recorded by [http://www.kmdi.utoronto.ca KMDI] at [http://www.utoronto.ca UoT]):
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=94ff5cd5-be6e-4fc6-9be6-dd2222342bcd Opening remarks] by Paul Young
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=4255c34e-15e7-4b24-ba99-78f5c8fa4381 Information Visualization and the Myth of Information Overload] by Christopher Collins
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=adcf02bf-16cb-46cc-8cdc-1a65e9071d6b Beyond Basic Visualization] by Ramses Van Zon
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=47baf346-3599-4fa3-9b10-6a58faa6b33c Network Visualization & Analysis] by Igor Jurisica
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=7e754a2e-7be5-476e-bb54-37def37bc07e Simulation and Visualization of Blood Flow] by David Steinman
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=d7622587-2c31-49c1-99d7-f0f16c078801 Scientific Visualizations: Does the Science Matter?] by Thomas Lucas
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=9963c637-6840-454f-a57f-9a1be6456616 How can visualization impact public perception of science?] Panelists: Jay Ingram, Peter Calamai, Reni Barlow, Hooley McLaughlin
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=88e50cff-db9b-4c71-b10d-781fec60a2c0 How Info Graphics are Created for the Mainstream Media] by Peter Calamai
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=3897e2a3-1fda-42be-ab78-edbab090fd9e Design Boot Camp] by Graham Huber
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=4c27ed76-7292-407e-83a6-814e1461eccd Visualization Large Datasets] by Jonathan Dursi
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=7d49c845-3937-44e2-a300-9b8ffe57a857 Visualizing Colliding Black Holes] by Herald Pfeiffer
** [http://itube.ischool.utoronto.ca/Panopto/Pages/Viewer/Default.aspx?id=7d7e4803-39cd-4c8e-a443-bc2a7b1b3c28 Closing remarks] by Mubdi Rahman

==Applications==
{{:Knowledge Base: Applications}}
* See also [[User Codes]]

== Development Systems ==

=== KNL ===

* SNUG TechTalk October 2016 - [[Media:Snug_techtalk_KNL.pdf | Intel Xeon Phi Knights Landing ]]

=Manuals=

==Intel compilers and libraries (GPC)==
* [http://software.intel.com/sites/products/documentation/doclib/stdxe/2013/composerxe/compiler/cpp-lin/index.htm C & C++ compiler]
* [http://software.intel.com/sites/products/documentation/doclib/stdxe/2013/composerxe/compiler/fortran-lin/index.htm Fortran compiler]
* [[Media:FP_Consistency.pdf|Intel Compiler Floating Point Consistency]]
* [[Media:Compiler_qrg12.pdf‎|Intel Compiler Optimization Guide]]
* [http://software.intel.com/sites/products/documentation/hpc/mkl/lin/index.htm Math Kernel Library (MKL)]
* [http://software.intel.com/sites/products/documentation/hpc/mkl/vsl/vslnotes.pdf Math Kernel Library's Vector Statistical Library]
* [http://software.intel.com/sites/products/documentation/hpc/mpi/linux/reference_manual.pdf Intel MPI library]
* [http://software.intel.com/en-us/articles/intel-mkl-link-line-advisor Math Kernel Library link line advisor] ($MKLPATH → ${MKLPATH} in makefiles)

==IBM compilers and libraries (TCS/P7)==
* [http://support.scinet.utoronto.ca/Manuals/xlc-compiler.pdf C compiler], [http://support.scinet.utoronto.ca/Manuals/xlc-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlc-proguide.pdf optimization]
* [http://support.scinet.utoronto.ca/Manuals/xlC++-compiler.pdf C++ compiler], [http://support.scinet.utoronto.ca/Manuals/xlC++-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlC++-proguide.pdf optimization]
* [http://support.scinet.utoronto.ca/Manuals/xlf-compiler.pdf Fortran compiler] [http://support.scinet.utoronto.ca/Manuals/xlf-langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/xlf-proguide.pdf optimization]
* [http://support.scinet.utoronto.ca/Manuals/UPC/compiler.pdf UPC compiler], [http://support.scinet.utoronto.ca/Manuals/UPC/langref.pdf language], [http://support.scinet.utoronto.ca/Manuals/UPC/upcopt.pdf optimization], [http://support.scinet.utoronto.ca/Manuals/UPC/standlib.pdf library], [http://support.scinet.utoronto.ca/Manuals/UPC/upcusersguide.pdf user's guide], [http://support.scinet.utoronto.ca/Manuals/UPC/proguide.pdf programmer's guide]
* [http://publib.boulder.ibm.com/epubs/pdf/am501405.pdf ESSL high performance math library V4] ([http://publib.boulder.ibm.com/epubs/pdf/am601305.pdf V3])
* [[Media:essl51.pdf|ESSL high performance math library V5.1 for Linux on Power]]
* [http://support.scinet.utoronto.ca/Manuals/JUMP-AIX-POWER6-AppsPerformanceTuning-wp032008.pdf Performance tuning]
* [http://support.scinet.utoronto.ca/Manuals/PE5.1-operationanduse.pdf Parallel environment]
* [http://publib.boulder.ibm.com/infocenter/clresctr/vxrx/index.jsp Cluster information center] (with error codes)
* [http://support.scinet.utoronto.ca/Manuals/LL-usingandadministering.pdf LoadLeveler: using & administering]

==PGI compilers (ARC)==
* [http://www.pgroup.com/doc/pgiug.pdf Compiler User's Guide]
* [http://www.pgroup.com/doc/pgiref.pdf Compiler Reference Manual]
* [http://www.pgroup.com/doc/pgifortref.pdf Fortran reference]
* [http://www.pgroup.com/doc/pgicudaforug.pdf CUDA Fortran Programming Guide and Reference]
* [http://www.pgroup.com/doc/openACC_gs.pdf OpenACC Getting Started Guide] (Note: $PGI/linux86-64/12.5/doc contains a newer version.)

==Scheduler (Adaptive Computing/Cluster Resources)==
* [http://docs.adaptivecomputing.com/mwm/archive/7-0/help.htm Moab workload manager]
* [http://docs.adaptivecomputing.com/mwm/archive/7-0/Content/a.gcommandoverview.html Moab commands]
* [http://docs.adaptivecomputing.com/torque/archive/4-0-1/help.htm Torque resource manager]
* [http://docs.adaptivecomputing.com/torque/archive/4-0-1/help.htm#topics/12-appendices/commandsOverview.htm Torque PBS commands]

==DDT Debugger (Allinea)==
* [http://www.allinea.com/downloads/userguide.pdf Distributed Debugging Tool User Guide]

Jupyter Hub on SciNet

2018-08-09T17:29:32Z

Rzon:

Knights Landing

2018-08-09T17:29:19Z

Rzon:

P8

2018-08-09T17:29:04Z

Rzon:

P7 Linux Cluster

2018-08-09T17:28:50Z

Rzon:

HPSS

2018-08-09T17:28:31Z

Rzon:

SOSCIP GPU

2018-08-09T17:28:05Z

Rzon:

__NOTOC__

{| style="border-spacing: 8px; width:100%"
| valign="top" style="cellpadding:1em; padding:1em; border:2px solid; background-color:#f6f674; border-radius:5px"|
'''WARNING: SciNet is in the process of replacing this wiki with a new documentation site. For current information, please go to [https://docs.scinet.utoronto.ca https://docs.scinet.utoronto.ca]'''
|}

{{Infobox Computer
|image=[[Image:S882lc.png|center|300px|thumb]]
|name=SOSCIP GPU
|installed=September 2017
|operatingsystem= Ubuntu 16.04 le
|loginnode= sgc01
|nnodes= 14x Power 8 with 4x NVIDIA P100
|rampernode=512 GB
|corespernode= 2 x 10core (20 physical, 160 SMT)
|interconnect=Infiniband EDR
|vendorcompilers=xlc/xlf, nvcc
}}

== SOSCIP ==

The SOSCIP GPU Cluster is a Southern Ontario Smart Computing Innovation Platform ([http://soscip.org/ SOSCIP]) resource located at theUniversity of Toronto's SciNet HPC facility. The SOSCIP multi-university/industry consortium is funded by the Ontario Government and the Federal Economic Development Agency for Southern Ontario [http://www.research.utoronto.ca/about/our-research-partners/soscip/].

== Support Email ==

Please use [mailto:soscip-support@scinet.utoronto.ca <soscip-support@scinet.utoronto.ca>] for SOSCIP GPU specific inquiries.

== Specifications==

The SOSCIP GPU Cluster consists of of 14 IBM Power 822LC "Minsky" Servers each with 2x10core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 20 physical cores to support up to 160 threads. Each node has 4x NVIDIA Tesla P100 GPUs each with 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.

== Access and Login ==

In order to obtain access to the system, you must request access to the SOSCIP GPU Platform. Instructions will have been sent to your sponsoring faculty member via E-mail at the beginning of your SOSCIP project.

Access to the SOSCIP GPU Platform is provided through the BGQ login node, '''<tt> bgqdev.scinet.utoronto.ca </tt>''' using ssh, and from there you can proceed to the GPU development node '''<tt>sgc01-ib0</tt>''' via ssh. Your user name and password is the same as it is for SciNet systems.

== Filesystem ==

The filesystem is shared with the BGQ system. See [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ#Filesystem here ] for details.

== Job Submission ==

The SOSCIP GPU cluster uses [https://slurm.schedmd.com/ SLURM ] as a job scheduler and jobs are scheduled by node, ie 20 cores and 4 GPUs each. Jobs are submitted from the development node '''<tt>sgc01</tt>'''. The maximum walltime per job is 12 hours (except in the 'long' queue, see below) with up to 8 nodes.

<pre>
$ sbatch myjob.script
</pre>

Where myjob.script is

<pre>
#!/bin/bash
#SBATCH --nodes=1
#SBATCH --ntasks=20 # MPI tasks (needed for srun)
#SBATCH --time=00:10:00 # H:M:S
#SBATCH --gres=gpu:4 # Ask for 4 GPUs per node

cd $SLURM_SUBMIT_DIR

hostname
nvidia-smi
</pre>

More information about the <tt>sbatch</tt> command is found [https://slurm.schedmd.com/sbatch.html here].

You can query job information using

<pre>
squeue
</pre>

To see only your own jobs, run

<pre>
squeue -u <userid>
</pre>

Once your job is running, SLURM creates a file usually named <tt>slurm<jobid>.out</tt> in the directory from where you issued the <tt>sbatch</tt> command. This contains the console output from your job. You can monitor the output of your job by using the <tt>tail -f <file></tt> command.

To cancel a job use

<pre>
scancel $JOBID
</pre>

=== Longer jobs ===

If your job takes more than 12 hours, the sbatch command will not let you submit your job. There is, however, a way to have jobs up to 24 hours long, by specifying "-p long" as an option (i.e., add <tt>#SBATCH -p long</tt> to your job script). The priority of such jobs may be throttled in the future if we see that the 'long' queue is having a negative efffect on turnover time in the queue.

=== Interactive ===

For an interactive session use

<pre>
salloc --gres=gpu:4
</pre>

After executing this command, you may have to wait in the queue until a system is available.

More information about the <tt>salloc</tt> command is [https://slurm.schedmd.com/salloc.html here].

=== Automatic Re-submission and Job Dependencies ===

Commonly you may have a job that you know will take longer to run than what is permissible in the queue. As long as your program contains checkpoint or restart capability, you can have one job automatically submit the next. In the following example it is assumed that the program finishes before the time limit requested and then resubmits itself by logging into the development nodes. Job dependencies and a maximum number of job re-submissions are used to ensure sequential operation.

<pre>
#!/bin/bash

#SBATCH --nodes=1
#SBATCH --ntasks=20 # MPI tasks (needed for srun)
#SBATCH --time=00:10:00 # H:M:S
#SBATCH --gres=gpu:4 # Ask for 4 GPUs per node

cd $SLURM_SUBMIT_DIR

: ${job_number:="1"} # set job_nubmer to 1 if it is undefined
job_number_max=3

echo "hi from ${SLURM_JOB_ID}"

#RUN JOB HERE

# SUBMIT NEXT JOB
if [[ ${job_number} -lt ${job_number_max} ]]
then
(( job_number++ ))
next_jobid=$(ssh sgc01-ib0 "cd $SLURM_SUBMIT_DIR; /opt/slurm/bin/sbatch --export=job_number=${job_number} -d afterok:${SLURM_JOB_ID} thisscript.sh | awk '{print $4}'")
echo "submitted ${next_jobid}"
fi

sleep 15

echo "${SLURM_JOB_ID} done"

</pre>
===Packing single-GPU jobs within one SLURM job submission===
Jobs are scheduled by node (4 GPUs) on SOSCIP GPU cluster. If user's code/program cannot utilize all 4 GPUs, user can use GNU Parallel tool to pack 4 or more single-GPU jobs into one SLURM job. Below is an example of submitting 4 single-GPU python codes within one job: (When using GNU parallel for a publication please cite as per '''''parallel --citation''''')
<pre>
#!/bin/bash
#SBATCH --nodes=1
#SBATCH --ntasks=20 # MPI tasks (needed for srun)
#SBATCH --time=00:10:00 # H:M:S
#SBATCH --gres=gpu:4 # Ask for 4 GPUs per node

module load gnu-parallel/20180422
cd $SLURM_SUBMIT_DIR

parallel -a jobname-params.input --colsep ' ' -j 4 'CUDA_VISIBLE_DEVICES=$(( {%} - 1 )) numactl -N $(( ({%} -1) / 2 )) python {1} {2} {3} &> jobname-{#}.out'
</pre>
The jobname-params.input file contains:
<pre>
code-1.py --param1=a --param2=b
code-2.py --param1=c --param2=d
code-3.py --param1=e --param2=f
code-4.py --param1=g --param2=h
</pre>
*In the above example, GNU Parallel tool will read '''jobname-params.input''' file and separate parameters. Each row in the input file has to contain exact 3 parameters to '''python'''. code-N.py is also considered as a parameter. User can change parameter number in the '''parallel''' command ({1} {2} {3}...).
*'''"-j 4"''' flag limits the max number of jobs to be 4. User can have more rows in the input file, but GNU Parallel tool only executes maximum of 4 at the same time.
*'''"CUDA_VISIBLE_DEVICES=$(( {%} - 1 ))"''' will set one GPU for each job. '''"numactl -N $(( ({%} -1) / 2 ))"''' will bind 2 jobs on CPU socket 0, other 2 jobs on socket 1. {%} is job slot which will be translated to 1 or 2 or 3 or 4 in this case.
*Outputs will be jobname-1.out, jobname-2.out,jobname-3.out,jobname-4.out... {#} is job number which will be translated to the row number in the input file.

== Software Installed ==

=== IBM PowerAI ===

The PowerAI platform contains popular open machine learning frameworks such as '''Caffe, TensorFlow, and Torch'''. Run the <tt>module avail</tt> command for a complete listing. More information is available at this link: https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/. Release 4.0 is currently installed.

===GNU Compilers ===

System default compiler is GCC/5.4.0. More recent versions of the GNU Compiler Collection (C/C++/Fortran) are provided in the IBM Advance Toolchain with enhancements for the POWER8 CPU. To load the newer advance toolchain version use:

Advance Toolchain V10.0
<pre>
module load gcc/6.4.1
</pre>

Advance Toolchain V11.0
<pre>
module load gcc/7.3.1
</pre>

More information about the IBM Advance Toolchain can be found here: [https://developer.ibm.com/linuxonpower/advance-toolchain/ https://developer.ibm.com/linuxonpower/advance-toolchain/]

=== IBM XL Compilers ===

To load the native IBM xlc/xlc++ and xlf (Fortran) compilers, run

<pre>
module load xlc/13.1.5
module load xlf/15.1.5
</pre>

IBM XL Compilers are enabled for use with NVIDIA GPUs, including support for OpenMP 4.5 GPU offloading and integration with NVIDIA's nvcc command to compile host-side code for the POWER8 CPU.

Information about the IBM XL Compilers can be found at the following links:

[https://www.ibm.com/support/knowledgecenter/SSXVZZ_13.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL C/C++]

[https://www.ibm.com/support/knowledgecenter/SSAT4T_15.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL Fortran]

=== NVIDIA GPU Driver ===

The current NVIDIA driver version is 396.26

=== CUDA ===

The current installed CUDA Tookits is are version 8.0, 9.0 and 9.1.

<pre>
module load cuda/8.0
or
module load cuda/9.0
or
module load cuda/9.1
or
module load cuda/9.2
</pre>

The CUDA driver is installed locally, however the CUDA Toolkit is installed in:

<pre>
/usr/local/cuda-8.0
/usr/local/cuda-9.0
/usr/local/cuda-9.1
/usr/local/cuda-9.2
</pre>

Note that the <tt>/usr/local/cuda</tt> directory is linked to the <tt>/usr/local/cuda-9.2</tt> directory.

Documentation and API reference information for the CUDA Toolkit can be found here: [http://docs.nvidia.com/cuda/index.html http://docs.nvidia.com/cuda/index.html]

=== OpenMPI ===

Currently OpenMPI has been setup on the 14 nodes connected over EDR Infiniband.

<pre>
$ module load openmpi/2.1.1-gcc-5.4.0
$ module load openmpi/2.1.1-XL-13_15.1.5
</pre>

== Other Software ==

Other software packages can be installed onto the SOSCIP GPU Platform. It is best to try installing new software in your own home directory, which will give you control of the software (e.g. exact version, configuration, installing sub-packages, etc.).

In the following subsections are instructions for installing several common software packages.

=== Anaconda (Python) ===

Anaconda is a popular distribution of the Python programming language. It contains several common Python libraries such as SciPy and NumPy as pre-built packages, which eases installation.

Anaconda can be downloaded from here: [https://www.anaconda.com/download/#linux https://www.anaconda.com/download/#linux]

NOTE: Be sure to download the '''Power8''' installer.

TIP: If you plan to use Tensorflow within Anaconda, download the Python 2.7 version of Anaconda

=== cuDNN ===
The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library of primitives for deep neural networks. cuDNN accelerates widely used deep learning frameworks, including Caffe2, MATLAB, Microsoft Cognitive Toolkit, TensorFlow, Theano, and PyTorch. If a specific version of cuDNN is needed, user can download from https://developer.nvidia.com/cudnn and choose '''"cuDNN [VERSION] Library for Linux (Power8/Power9)"'''.

The default cuDNN installed on the system is version 6 with CUDA-8 from IBM PowerAI. More recent cuDNN versions are installed as modules:
<pre>
cudnn/cuda9.0/7.0.5
</pre>

=== Keras ===

Keras ([https://keras.io/ https://keras.io/]) is a popular high-level deep learning software development framework. It runs on top of other deep-learning frameworks such as TensorFlow.

*The easiest way to install Keras is to install Anaconda first, then install Keras by using using the pip command. Keras uses TensorFlow underneath to run neural network models. Before running code using Keras, be sure to load the PowerAI TensorFlow module and the cuda module.

*Keras can also be installed into a Python virtual environment by using '''pip'''. User can install optimized scipy (built with OpenBLAS) before installing Keras.
In a virtual environment (python2.7 as example):
<pre>
pip install /scinet/sgc/Libraries/scipy/scipy-1.1.0-cp27-cp27mu-linux_ppc64le.whl
pip install keras
</pre>

=== NumPy/SciPy (built with OpenBLAS) ===

Optimized NumPy and SciPy are provided as Python wheels located in '''/scinet/sgc/Libraries/numpy''' and '''/scinet/sgc/Libraries/scipy''' and can be installed by '''pip'''. Please uninstall old numpy/scipy before installing the new ones.

=== PyTorch ===

PyTorch is the Python implementation of the Torch framework for deep learning.

It is suggested that you use PyTorch within Anaconda.

There is currently no build of PyTorch for POWER8-based systems. You will need to compile it from source.

Obtain the source code from here: [http://pytorch.org/ http://pytorch.org/]

Before building PyTorch, make sure to load cuda by running

<pre>
module load cuda/8.0
</pre>

NOTE: Do not have the gcc modules loaded when building PyTorch. Use the default version of gcc (currently v5.4.0) included with the operating system. Build will fail with later versions of gcc.

=== TensorFlow (new versions and python3) ===

The TensorFlow which is included in PowerAI may not be the most recent version. Newer versions of TensorFlow are provided as prebuilt Python Wheels that users can use '''pip''' to install under user space. Custom Python wheels are stored in '''/scinet/sgc/Applications/TensorFlow_wheels'''. It is highly recommended to install custom TensorFlow wheels into a Python virtual environment.

====Installing with Python2.7:====
<div class="toccolours mw-collapsible mw-collapsed" style="overflow:auto;">
* Create a virtual environment '''tensorflow-1.8-py2''' with packages installed with system:
<pre>
virtualenv --python=python2.7 --system-site-packages tensorflow-1.8-py2
</pre>
* Activate virtual environment:
<pre>
source tensorflow-1.8-py2/bin/activate
</pre>
* Install TensorFlow into the virtual environment: (A custom Numpy built with OpenBLAS library can be installed)
<pre>
pip install --upgrade --force-reinstall /scinet/sgc/Libraries/numpy/numpy-1.14.3-cp27-cp27mu-linux_ppc64le.whl
pip install /scinet/sgc/Applications/TensorFlow_wheels/tensorflow-1.8.0-cp27-cp27mu-linux_ppc64le.whl
</pre>
</div>

====Installing with Python3.5:====
<div class="toccolours mw-collapsible mw-collapsed" style="overflow:auto;">
* Create a virtual environment '''tensorflow-1.8-py3''' with packages installed with system:
<pre>
virtualenv --python=python3.5 --system-site-packages tensorflow-1.8-py3
</pre>
* Activate virtual environment:
<pre>
source tensorflow-1.8-py3/bin/activate
</pre>
* Install TensorFlow into the virtual environment: (A custom Numpy built with OpenBLAS library can be installed)
<pre>
pip3 install --upgrade --force-reinstall /scinet/sgc/Libraries/numpy/numpy-1.14.3-cp35-cp35m-linux_ppc64le.whl
pip3 install /scinet/sgc/Applications/TensorFlow_wheels/tensorflow-1.8.0-cp35-cp35m-linux_ppc64le.whl
</pre>
</div>

====Submitting jobs====
<div class="toccolours mw-collapsible mw-collapsed" style="overflow:auto;">
The above myjob.script file needs to be modified to run custom TensorFlow. '''cuda/9.0''' and '''cudnn/cuda9.0/7.0.5''' modules need to be loaded. Virtual environment needs to be activated.
<pre>
#!/bin/bash
#SBATCH --nodes=1
#SBATCH --ntasks=20 # MPI tasks (needed for srun)
#SBATCH --time=00:10:00 # H:M:S
#SBATCH --gres=gpu:4 # Ask for 4 GPUs per node

module purge
module load cuda/9.0 cudnn/cuda9.0/7.0.5
source tensorflow-1.8-py2/bin/activate #change this to the location where virtual environment is created

cd $SLURM_SUBMIT_DIR
python code.py
</pre>
</div>

== LINKS ==

[https://www.olcf.ornl.gov/kb_articles/summitdev-quickstart/#System_Overview Summit Dev System at ORNL]

== DOCUMENTATION ==

# GPU Cluster Introduction: [[Media:GPU_Training_01.pdf‎|SOSCIP GPU Platform]]

SOSCIP GPU

2018-08-09T17:27:54Z

Rzon:

__NOTOC__

{| style="border-spacing: 8px; width:100%"
| valign="top" style="cellpadding:1em; padding:1em; border:2px solid; background-color:#f6f674; border-radius:5px"|
'''WARNING: SciNet is in the process of replacing this wiki with a new documentation site. For current information, please go to [https://docs.scinet.utoronto.ca https://docs.scinet.utoronto.ca]'''
|}

{| style="border-spacing: 8px; width:100%"
| valign="top" style="cellpadding:1em; padding:1em; border:2px solid; background-color:#f6f674; border-radius:5px"|
'''WARNING: SciNet is in the process of replacing this wiki with a new documentation site. For current information, please go to [https://docs.scinet.utoronto.ca https://docs.scinet.utoronto.ca]'''
|}

{{Infobox Computer
|image=[[Image:S882lc.png|center|300px|thumb]]
|name=SOSCIP GPU
|installed=September 2017
|operatingsystem= Ubuntu 16.04 le
|loginnode= sgc01
|nnodes= 14x Power 8 with 4x NVIDIA P100
|rampernode=512 GB
|corespernode= 2 x 10core (20 physical, 160 SMT)
|interconnect=Infiniband EDR
|vendorcompilers=xlc/xlf, nvcc
}}

== SOSCIP ==

The SOSCIP GPU Cluster is a Southern Ontario Smart Computing Innovation Platform ([http://soscip.org/ SOSCIP]) resource located at theUniversity of Toronto's SciNet HPC facility. The SOSCIP multi-university/industry consortium is funded by the Ontario Government and the Federal Economic Development Agency for Southern Ontario [http://www.research.utoronto.ca/about/our-research-partners/soscip/].

== Support Email ==

Please use [mailto:soscip-support@scinet.utoronto.ca <soscip-support@scinet.utoronto.ca>] for SOSCIP GPU specific inquiries.

== Specifications==

The SOSCIP GPU Cluster consists of of 14 IBM Power 822LC "Minsky" Servers each with 2x10core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 20 physical cores to support up to 160 threads. Each node has 4x NVIDIA Tesla P100 GPUs each with 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.

== Access and Login ==

In order to obtain access to the system, you must request access to the SOSCIP GPU Platform. Instructions will have been sent to your sponsoring faculty member via E-mail at the beginning of your SOSCIP project.

Access to the SOSCIP GPU Platform is provided through the BGQ login node, '''<tt> bgqdev.scinet.utoronto.ca </tt>''' using ssh, and from there you can proceed to the GPU development node '''<tt>sgc01-ib0</tt>''' via ssh. Your user name and password is the same as it is for SciNet systems.

== Filesystem ==

The filesystem is shared with the BGQ system. See [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ#Filesystem here ] for details.

== Job Submission ==

The SOSCIP GPU cluster uses [https://slurm.schedmd.com/ SLURM ] as a job scheduler and jobs are scheduled by node, ie 20 cores and 4 GPUs each. Jobs are submitted from the development node '''<tt>sgc01</tt>'''. The maximum walltime per job is 12 hours (except in the 'long' queue, see below) with up to 8 nodes.

<pre>
$ sbatch myjob.script
</pre>

Where myjob.script is

<pre>
#!/bin/bash
#SBATCH --nodes=1
#SBATCH --ntasks=20 # MPI tasks (needed for srun)
#SBATCH --time=00:10:00 # H:M:S
#SBATCH --gres=gpu:4 # Ask for 4 GPUs per node

cd $SLURM_SUBMIT_DIR

hostname
nvidia-smi
</pre>

More information about the <tt>sbatch</tt> command is found [https://slurm.schedmd.com/sbatch.html here].

You can query job information using

<pre>
squeue
</pre>

To see only your own jobs, run

<pre>
squeue -u <userid>
</pre>

Once your job is running, SLURM creates a file usually named <tt>slurm<jobid>.out</tt> in the directory from where you issued the <tt>sbatch</tt> command. This contains the console output from your job. You can monitor the output of your job by using the <tt>tail -f <file></tt> command.

To cancel a job use

<pre>
scancel $JOBID
</pre>

=== Longer jobs ===

If your job takes more than 12 hours, the sbatch command will not let you submit your job. There is, however, a way to have jobs up to 24 hours long, by specifying "-p long" as an option (i.e., add <tt>#SBATCH -p long</tt> to your job script). The priority of such jobs may be throttled in the future if we see that the 'long' queue is having a negative efffect on turnover time in the queue.

=== Interactive ===

For an interactive session use

<pre>
salloc --gres=gpu:4
</pre>

After executing this command, you may have to wait in the queue until a system is available.

More information about the <tt>salloc</tt> command is [https://slurm.schedmd.com/salloc.html here].

=== Automatic Re-submission and Job Dependencies ===

Commonly you may have a job that you know will take longer to run than what is permissible in the queue. As long as your program contains checkpoint or restart capability, you can have one job automatically submit the next. In the following example it is assumed that the program finishes before the time limit requested and then resubmits itself by logging into the development nodes. Job dependencies and a maximum number of job re-submissions are used to ensure sequential operation.

<pre>
#!/bin/bash

#SBATCH --nodes=1
#SBATCH --ntasks=20 # MPI tasks (needed for srun)
#SBATCH --time=00:10:00 # H:M:S
#SBATCH --gres=gpu:4 # Ask for 4 GPUs per node

cd $SLURM_SUBMIT_DIR

: ${job_number:="1"} # set job_nubmer to 1 if it is undefined
job_number_max=3

echo "hi from ${SLURM_JOB_ID}"

#RUN JOB HERE

# SUBMIT NEXT JOB
if [[ ${job_number} -lt ${job_number_max} ]]
then
(( job_number++ ))
next_jobid=$(ssh sgc01-ib0 "cd $SLURM_SUBMIT_DIR; /opt/slurm/bin/sbatch --export=job_number=${job_number} -d afterok:${SLURM_JOB_ID} thisscript.sh | awk '{print $4}'")
echo "submitted ${next_jobid}"
fi

sleep 15

echo "${SLURM_JOB_ID} done"

</pre>
===Packing single-GPU jobs within one SLURM job submission===
Jobs are scheduled by node (4 GPUs) on SOSCIP GPU cluster. If user's code/program cannot utilize all 4 GPUs, user can use GNU Parallel tool to pack 4 or more single-GPU jobs into one SLURM job. Below is an example of submitting 4 single-GPU python codes within one job: (When using GNU parallel for a publication please cite as per '''''parallel --citation''''')
<pre>
#!/bin/bash
#SBATCH --nodes=1
#SBATCH --ntasks=20 # MPI tasks (needed for srun)
#SBATCH --time=00:10:00 # H:M:S
#SBATCH --gres=gpu:4 # Ask for 4 GPUs per node

module load gnu-parallel/20180422
cd $SLURM_SUBMIT_DIR

parallel -a jobname-params.input --colsep ' ' -j 4 'CUDA_VISIBLE_DEVICES=$(( {%} - 1 )) numactl -N $(( ({%} -1) / 2 )) python {1} {2} {3} &> jobname-{#}.out'
</pre>
The jobname-params.input file contains:
<pre>
code-1.py --param1=a --param2=b
code-2.py --param1=c --param2=d
code-3.py --param1=e --param2=f
code-4.py --param1=g --param2=h
</pre>
*In the above example, GNU Parallel tool will read '''jobname-params.input''' file and separate parameters. Each row in the input file has to contain exact 3 parameters to '''python'''. code-N.py is also considered as a parameter. User can change parameter number in the '''parallel''' command ({1} {2} {3}...).
*'''"-j 4"''' flag limits the max number of jobs to be 4. User can have more rows in the input file, but GNU Parallel tool only executes maximum of 4 at the same time.
*'''"CUDA_VISIBLE_DEVICES=$(( {%} - 1 ))"''' will set one GPU for each job. '''"numactl -N $(( ({%} -1) / 2 ))"''' will bind 2 jobs on CPU socket 0, other 2 jobs on socket 1. {%} is job slot which will be translated to 1 or 2 or 3 or 4 in this case.
*Outputs will be jobname-1.out, jobname-2.out,jobname-3.out,jobname-4.out... {#} is job number which will be translated to the row number in the input file.

== Software Installed ==

=== IBM PowerAI ===

The PowerAI platform contains popular open machine learning frameworks such as '''Caffe, TensorFlow, and Torch'''. Run the <tt>module avail</tt> command for a complete listing. More information is available at this link: https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/. Release 4.0 is currently installed.

===GNU Compilers ===

System default compiler is GCC/5.4.0. More recent versions of the GNU Compiler Collection (C/C++/Fortran) are provided in the IBM Advance Toolchain with enhancements for the POWER8 CPU. To load the newer advance toolchain version use:

Advance Toolchain V10.0
<pre>
module load gcc/6.4.1
</pre>

Advance Toolchain V11.0
<pre>
module load gcc/7.3.1
</pre>

More information about the IBM Advance Toolchain can be found here: [https://developer.ibm.com/linuxonpower/advance-toolchain/ https://developer.ibm.com/linuxonpower/advance-toolchain/]

=== IBM XL Compilers ===

To load the native IBM xlc/xlc++ and xlf (Fortran) compilers, run

<pre>
module load xlc/13.1.5
module load xlf/15.1.5
</pre>

IBM XL Compilers are enabled for use with NVIDIA GPUs, including support for OpenMP 4.5 GPU offloading and integration with NVIDIA's nvcc command to compile host-side code for the POWER8 CPU.

Information about the IBM XL Compilers can be found at the following links:

[https://www.ibm.com/support/knowledgecenter/SSXVZZ_13.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL C/C++]

[https://www.ibm.com/support/knowledgecenter/SSAT4T_15.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL Fortran]

=== NVIDIA GPU Driver ===

The current NVIDIA driver version is 396.26

=== CUDA ===

The current installed CUDA Tookits is are version 8.0, 9.0 and 9.1.

<pre>
module load cuda/8.0
or
module load cuda/9.0
or
module load cuda/9.1
or
module load cuda/9.2
</pre>

The CUDA driver is installed locally, however the CUDA Toolkit is installed in:

<pre>
/usr/local/cuda-8.0
/usr/local/cuda-9.0
/usr/local/cuda-9.1
/usr/local/cuda-9.2
</pre>

Note that the <tt>/usr/local/cuda</tt> directory is linked to the <tt>/usr/local/cuda-9.2</tt> directory.

Documentation and API reference information for the CUDA Toolkit can be found here: [http://docs.nvidia.com/cuda/index.html http://docs.nvidia.com/cuda/index.html]

=== OpenMPI ===

Currently OpenMPI has been setup on the 14 nodes connected over EDR Infiniband.

<pre>
$ module load openmpi/2.1.1-gcc-5.4.0
$ module load openmpi/2.1.1-XL-13_15.1.5
</pre>

== Other Software ==

Other software packages can be installed onto the SOSCIP GPU Platform. It is best to try installing new software in your own home directory, which will give you control of the software (e.g. exact version, configuration, installing sub-packages, etc.).

In the following subsections are instructions for installing several common software packages.

=== Anaconda (Python) ===

Anaconda is a popular distribution of the Python programming language. It contains several common Python libraries such as SciPy and NumPy as pre-built packages, which eases installation.

Anaconda can be downloaded from here: [https://www.anaconda.com/download/#linux https://www.anaconda.com/download/#linux]

NOTE: Be sure to download the '''Power8''' installer.

TIP: If you plan to use Tensorflow within Anaconda, download the Python 2.7 version of Anaconda

=== cuDNN ===
The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library of primitives for deep neural networks. cuDNN accelerates widely used deep learning frameworks, including Caffe2, MATLAB, Microsoft Cognitive Toolkit, TensorFlow, Theano, and PyTorch. If a specific version of cuDNN is needed, user can download from https://developer.nvidia.com/cudnn and choose '''"cuDNN [VERSION] Library for Linux (Power8/Power9)"'''.

The default cuDNN installed on the system is version 6 with CUDA-8 from IBM PowerAI. More recent cuDNN versions are installed as modules:
<pre>
cudnn/cuda9.0/7.0.5
</pre>

=== Keras ===

Keras ([https://keras.io/ https://keras.io/]) is a popular high-level deep learning software development framework. It runs on top of other deep-learning frameworks such as TensorFlow.

*The easiest way to install Keras is to install Anaconda first, then install Keras by using using the pip command. Keras uses TensorFlow underneath to run neural network models. Before running code using Keras, be sure to load the PowerAI TensorFlow module and the cuda module.

*Keras can also be installed into a Python virtual environment by using '''pip'''. User can install optimized scipy (built with OpenBLAS) before installing Keras.
In a virtual environment (python2.7 as example):
<pre>
pip install /scinet/sgc/Libraries/scipy/scipy-1.1.0-cp27-cp27mu-linux_ppc64le.whl
pip install keras
</pre>

=== NumPy/SciPy (built with OpenBLAS) ===

Optimized NumPy and SciPy are provided as Python wheels located in '''/scinet/sgc/Libraries/numpy''' and '''/scinet/sgc/Libraries/scipy''' and can be installed by '''pip'''. Please uninstall old numpy/scipy before installing the new ones.

=== PyTorch ===

PyTorch is the Python implementation of the Torch framework for deep learning.

It is suggested that you use PyTorch within Anaconda.

There is currently no build of PyTorch for POWER8-based systems. You will need to compile it from source.

Obtain the source code from here: [http://pytorch.org/ http://pytorch.org/]

Before building PyTorch, make sure to load cuda by running

<pre>
module load cuda/8.0
</pre>

NOTE: Do not have the gcc modules loaded when building PyTorch. Use the default version of gcc (currently v5.4.0) included with the operating system. Build will fail with later versions of gcc.

=== TensorFlow (new versions and python3) ===

The TensorFlow which is included in PowerAI may not be the most recent version. Newer versions of TensorFlow are provided as prebuilt Python Wheels that users can use '''pip''' to install under user space. Custom Python wheels are stored in '''/scinet/sgc/Applications/TensorFlow_wheels'''. It is highly recommended to install custom TensorFlow wheels into a Python virtual environment.

====Installing with Python2.7:====
<div class="toccolours mw-collapsible mw-collapsed" style="overflow:auto;">
* Create a virtual environment '''tensorflow-1.8-py2''' with packages installed with system:
<pre>
virtualenv --python=python2.7 --system-site-packages tensorflow-1.8-py2
</pre>
* Activate virtual environment:
<pre>
source tensorflow-1.8-py2/bin/activate
</pre>
* Install TensorFlow into the virtual environment: (A custom Numpy built with OpenBLAS library can be installed)
<pre>
pip install --upgrade --force-reinstall /scinet/sgc/Libraries/numpy/numpy-1.14.3-cp27-cp27mu-linux_ppc64le.whl
pip install /scinet/sgc/Applications/TensorFlow_wheels/tensorflow-1.8.0-cp27-cp27mu-linux_ppc64le.whl
</pre>
</div>

====Installing with Python3.5:====
<div class="toccolours mw-collapsible mw-collapsed" style="overflow:auto;">
* Create a virtual environment '''tensorflow-1.8-py3''' with packages installed with system:
<pre>
virtualenv --python=python3.5 --system-site-packages tensorflow-1.8-py3
</pre>
* Activate virtual environment:
<pre>
source tensorflow-1.8-py3/bin/activate
</pre>
* Install TensorFlow into the virtual environment: (A custom Numpy built with OpenBLAS library can be installed)
<pre>
pip3 install --upgrade --force-reinstall /scinet/sgc/Libraries/numpy/numpy-1.14.3-cp35-cp35m-linux_ppc64le.whl
pip3 install /scinet/sgc/Applications/TensorFlow_wheels/tensorflow-1.8.0-cp35-cp35m-linux_ppc64le.whl
</pre>
</div>

====Submitting jobs====
<div class="toccolours mw-collapsible mw-collapsed" style="overflow:auto;">
The above myjob.script file needs to be modified to run custom TensorFlow. '''cuda/9.0''' and '''cudnn/cuda9.0/7.0.5''' modules need to be loaded. Virtual environment needs to be activated.
<pre>
#!/bin/bash
#SBATCH --nodes=1
#SBATCH --ntasks=20 # MPI tasks (needed for srun)
#SBATCH --time=00:10:00 # H:M:S
#SBATCH --gres=gpu:4 # Ask for 4 GPUs per node

module purge
module load cuda/9.0 cudnn/cuda9.0/7.0.5
source tensorflow-1.8-py2/bin/activate #change this to the location where virtual environment is created

cd $SLURM_SUBMIT_DIR
python code.py
</pre>
</div>

== LINKS ==

[https://www.olcf.ornl.gov/kb_articles/summitdev-quickstart/#System_Overview Summit Dev System at ORNL]

== DOCUMENTATION ==

# GPU Cluster Introduction: [[Media:GPU_Training_01.pdf‎|SOSCIP GPU Platform]]

BGQ

2018-08-09T17:27:41Z

Rzon:

SciNet User Support Library

2018-05-15T18:59:32Z

Rzon: