<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-GB">
	<id>https://oldwiki.scinet.utoronto.ca/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Northrup</id>
	<title>oldwiki.scinet.utoronto.ca - User contributions [en-gb]</title>
	<link rel="self" type="application/atom+xml" href="https://oldwiki.scinet.utoronto.ca/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Northrup"/>
	<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php/Special:Contributions/Northrup"/>
	<updated>2026-06-25T16:53:33Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.35.12</generator>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9320</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9320"/>
		<updated>2018-05-06T21:23:06Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:down.png|up|link=HPSS]][https://docs.scinet.utoronto.ca/index.php/HPSS HPSS]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute is decommissioned, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;30 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9245</id>
		<title>SciNet User Support Library</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9245"/>
		<updated>2018-04-26T16:54:44Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System News */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{|  style=&amp;quot;border-spacing: 8px;&amp;quot;&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
{{SciNetWiki:System_Alerts}}&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
==System News==&lt;br /&gt;
* April 23, 2018: GPC &amp;amp; Sandy decommissioned.&lt;br /&gt;
* April 10, 2018: Niagara commissioned.&lt;br /&gt;
* March 20th, 2018: Gravity decommissioned.&lt;br /&gt;
* Dec 4, 2017 : scratchtcs decommissioned.&lt;br /&gt;
* Nov 28, 2017: The GPC has been reduced from 30,912 to 16,800 cores to make room for the installation of Niagara.&lt;br /&gt;
* Sept 29, 2017: The TCS has been decommissioned.&lt;br /&gt;
&lt;br /&gt;
([[Previous System News]])&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; width=&amp;quot;50%&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f6e8e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== QuickStart Guides ==&lt;br /&gt;
* [[SciNet Command Line Utilities]]&lt;br /&gt;
* [[Media:SciNet_Tutorial.pdf|SciNet User Tutorial]]&lt;br /&gt;
* [[GPC_Quickstart|GPC: General Purpose Cluster]]&lt;br /&gt;
* [[Sandy| Sandy: Intel Sandybridge nodes ]]&lt;br /&gt;
* [[Gravity| Gravity: GPU nodes]]&lt;br /&gt;
* [[P7_Linux_Cluster|P7: Power 7 Linux cluster]]&lt;br /&gt;
* [[BGQ|BGQ: BlueGene/Q clusters]]&lt;br /&gt;
* [[Software_and_Libraries|Software and libraries]]&lt;br /&gt;
* [[Data_Management|Data management]]&lt;br /&gt;
* [[FAQ | FAQ (frequently asked questions)]]&lt;br /&gt;
* [[Acknowledging_SciNet | Acknowledging SciNet]]&lt;br /&gt;
* [https://courses.scinet.utoronto.ca SciNet education]&lt;br /&gt;
* [https://www.youtube.com/channel/UC42CaO-AAQhwqa8RGzE3daQ SciNet YouTube Channel]&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8f6e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== Tutorials and Manuals ==&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#SciNet_Basics|SciNet basics]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Linux|Linux]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Batch_job_management|Batch job management]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Programming|Programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Parallel_Programming|Parallel programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#GPU_Computing|GPU computing]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Performance_Tuning|Performance tuning]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Debugging|Debugging]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Math_libraries_.28BLAS.2C_LAPACK.2C_FFT.29|Math libraries (BLAS, LAPACK, FFT)]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#I/O|I/O and databases]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Visualization|Visualization]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Applications|Applications]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Manuals|Manuals (compilers etc)]]&lt;br /&gt;
* [[2016_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
* [[2015_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8e8f6; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== What's New On The Wiki ==&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: Updated [[GPC Quickstart]] with info on email notifications from the scheduler.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Hdf5]] compilation page updated.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Research Computing with Python 2014]] lectures 5-8&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: &amp;quot;Modern CUDA Features&amp;quot; TechTalk slides in [[SciNet TechTalks and Seminars]].&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: [[Research Computing with Python 2014]] lectures 1-4&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[User_Tips#Reducing_virtual_memory_consumption_for_multithreaded_programs| Tip on reducing virtual memory consumption for multithreaded programs]]&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Improved information on the [[Python]] versions installed on the GPC, and which modules are included in each version.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Description on using job arrays on the GPC on the [[Scheduler]] page.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[Intro to Tkinter|Tkinter instructions]], [[Media:Tkinter.pdf|slides]] and [[Media:Tkinter_code.tgz|code]] for the TkInter workshop held in September.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Instructions on using [[Hadoop for HPCers|Hadoop]] (for the Hadoop workshop held in September).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
Previous new stuff can be found in the [[What's new archive]].&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f4f4fa; border-radius:7px&amp;quot; |&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
{{SciNetWiki:Community_Portal}}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
{{#Widget:Twitter|shell.background=#9fb1c2|tweets.links=#4775c1|tweets.color=#000000|tweets.background=#ffffff|user=SciNetHPC}}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [[Old Main Page]] --!&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9244</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9244"/>
		<updated>2018-04-26T16:44:57Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:down.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute is decommissioned, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;9 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9243</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9243"/>
		<updated>2018-04-26T16:44:39Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:down.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute is decommissioned, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;9 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9242</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9242"/>
		<updated>2018-04-26T16:43:40Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:down.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute is decommissioned, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;9 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9239</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9239"/>
		<updated>2018-04-24T14:42:30Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:down.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:down.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:down.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute is decommissioned, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;9 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9238</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9238"/>
		<updated>2018-04-23T16:00:20Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:down.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:down.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up50.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:down.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; BGQ front-end node is down, hopefully fixed later today. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute is decommissioned, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;9 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9237</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9237"/>
		<updated>2018-04-23T16:00:03Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:down.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:down.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up50.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:down.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; BGQ front-end node is down, hopefully fixed late today. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute is decommissioned, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;9 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9236</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9236"/>
		<updated>2018-04-23T15:19:05Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:down.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:down.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:down.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute is decommissioned, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;9 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9235</id>
		<title>SciNet User Support Library</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9235"/>
		<updated>2018-04-23T15:18:51Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System News */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{|  style=&amp;quot;border-spacing: 8px;&amp;quot;&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
{{SciNetWiki:System_Alerts}}&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
==System News==&lt;br /&gt;
* April 23, 2018: GPC decommissioned.&lt;br /&gt;
* April 10, 2018: Niagara commissioned.&lt;br /&gt;
* Dec 4, 2017 : scratchtcs decommissioned.&lt;br /&gt;
* Nov 28, 2017: The GPC has been reduced from 30,912 to 16,800 cores to make room for the installation of Niagara.&lt;br /&gt;
* Sept 29, 2017: The TCS has been decommissioned.&lt;br /&gt;
&lt;br /&gt;
([[Previous System News]])&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; width=&amp;quot;50%&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f6e8e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== QuickStart Guides ==&lt;br /&gt;
* [[SciNet Command Line Utilities]]&lt;br /&gt;
* [[Media:SciNet_Tutorial.pdf|SciNet User Tutorial]]&lt;br /&gt;
* [[GPC_Quickstart|GPC: General Purpose Cluster]]&lt;br /&gt;
* [[Sandy| Sandy: Intel Sandybridge nodes ]]&lt;br /&gt;
* [[Gravity| Gravity: GPU nodes]]&lt;br /&gt;
* [[P7_Linux_Cluster|P7: Power 7 Linux cluster]]&lt;br /&gt;
* [[BGQ|BGQ: BlueGene/Q clusters]]&lt;br /&gt;
* [[Software_and_Libraries|Software and libraries]]&lt;br /&gt;
* [[Data_Management|Data management]]&lt;br /&gt;
* [[FAQ | FAQ (frequently asked questions)]]&lt;br /&gt;
* [[Acknowledging_SciNet | Acknowledging SciNet]]&lt;br /&gt;
* [https://courses.scinet.utoronto.ca SciNet education]&lt;br /&gt;
* [https://www.youtube.com/channel/UC42CaO-AAQhwqa8RGzE3daQ SciNet YouTube Channel]&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8f6e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== Tutorials and Manuals ==&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#SciNet_Basics|SciNet basics]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Linux|Linux]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Batch_job_management|Batch job management]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Programming|Programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Parallel_Programming|Parallel programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#GPU_Computing|GPU computing]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Performance_Tuning|Performance tuning]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Debugging|Debugging]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Math_libraries_.28BLAS.2C_LAPACK.2C_FFT.29|Math libraries (BLAS, LAPACK, FFT)]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#I/O|I/O and databases]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Visualization|Visualization]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Applications|Applications]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Manuals|Manuals (compilers etc)]]&lt;br /&gt;
* [[2016_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
* [[2015_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8e8f6; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== What's New On The Wiki ==&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: Updated [[GPC Quickstart]] with info on email notifications from the scheduler.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Hdf5]] compilation page updated.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Research Computing with Python 2014]] lectures 5-8&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: &amp;quot;Modern CUDA Features&amp;quot; TechTalk slides in [[SciNet TechTalks and Seminars]].&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: [[Research Computing with Python 2014]] lectures 1-4&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[User_Tips#Reducing_virtual_memory_consumption_for_multithreaded_programs| Tip on reducing virtual memory consumption for multithreaded programs]]&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Improved information on the [[Python]] versions installed on the GPC, and which modules are included in each version.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Description on using job arrays on the GPC on the [[Scheduler]] page.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[Intro to Tkinter|Tkinter instructions]], [[Media:Tkinter.pdf|slides]] and [[Media:Tkinter_code.tgz|code]] for the TkInter workshop held in September.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Instructions on using [[Hadoop for HPCers|Hadoop]] (for the Hadoop workshop held in September).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
Previous new stuff can be found in the [[What's new archive]].&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f4f4fa; border-radius:7px&amp;quot; |&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
{{SciNetWiki:Community_Portal}}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
{{#Widget:Twitter|shell.background=#9fb1c2|tweets.links=#4775c1|tweets.color=#000000|tweets.background=#ffffff|user=SciNetHPC}}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [[Old Main Page]] --!&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9234</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9234"/>
		<updated>2018-04-23T15:18:16Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:down.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:down.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 23 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute is decommissioned, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;9 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9233</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9233"/>
		<updated>2018-04-23T15:17:23Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:down.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:down.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
GPC-compute will be decommissioned on &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;Sat 21 Apr 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;9 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9214</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9214"/>
		<updated>2018-04-19T01:11:16Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
GPC-compute will be decommissioned on &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;Sat 21 Apr 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;, GPC-storage available until &amp;lt;font color=red&amp;gt;&amp;lt;b&amp;gt;9 May 2018&amp;lt;/b&amp;gt;&amp;lt;/font&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu 18 Apr 2018 &amp;lt;/b&amp;gt;  Niagara system will undergo an upgrade to its Infiniband network between 9am and 12pm, should be transparent to users, however there is a chance of network interruption.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Fri 13 Apr 2018 &amp;lt;/b&amp;gt; HPSS system will be down for a few hours on &amp;lt;b&amp;gt;Mon, Apr/16, 9AM&amp;lt;/b&amp;gt;, for hardware upgrades, in preparation for the eventual move to the Niagara side.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9206</id>
		<title>SciNet User Support Library</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9206"/>
		<updated>2018-04-13T01:22:16Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System News */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{|  style=&amp;quot;border-spacing: 8px;&amp;quot;&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
{{SciNetWiki:System_Alerts}}&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
==System News==&lt;br /&gt;
* April 10, 2018: Niagara commissioned.&lt;br /&gt;
* Dec 4, 2017 : scratchtcs decommissioned.&lt;br /&gt;
* Nov 28, 2017: The GPC has been reduced from 30,912 to 16,800 cores to make room for the installation of Niagara.&lt;br /&gt;
* Sept 29, 2017: The TCS has been decommissioned.&lt;br /&gt;
&lt;br /&gt;
([[Previous System News]])&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; width=&amp;quot;50%&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f6e8e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== QuickStart Guides ==&lt;br /&gt;
* [[SciNet Command Line Utilities]]&lt;br /&gt;
* [[Media:SciNet_Tutorial.pdf|SciNet User Tutorial]]&lt;br /&gt;
* [[GPC_Quickstart|GPC: General Purpose Cluster]]&lt;br /&gt;
* [[Sandy| Sandy: Intel Sandybridge nodes ]]&lt;br /&gt;
* [[Gravity| Gravity: GPU nodes]]&lt;br /&gt;
* [[P7_Linux_Cluster|P7: Power 7 Linux cluster]]&lt;br /&gt;
* [[BGQ|BGQ: BlueGene/Q clusters]]&lt;br /&gt;
* [[Software_and_Libraries|Software and libraries]]&lt;br /&gt;
* [[Data_Management|Data management]]&lt;br /&gt;
* [[FAQ | FAQ (frequently asked questions)]]&lt;br /&gt;
* [[Acknowledging_SciNet | Acknowledging SciNet]]&lt;br /&gt;
* [https://courses.scinet.utoronto.ca SciNet education]&lt;br /&gt;
* [https://www.youtube.com/channel/UC42CaO-AAQhwqa8RGzE3daQ SciNet YouTube Channel]&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8f6e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== Tutorials and Manuals ==&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#SciNet_Basics|SciNet basics]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Linux|Linux]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Batch_job_management|Batch job management]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Programming|Programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Parallel_Programming|Parallel programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#GPU_Computing|GPU computing]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Performance_Tuning|Performance tuning]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Debugging|Debugging]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Math_libraries_.28BLAS.2C_LAPACK.2C_FFT.29|Math libraries (BLAS, LAPACK, FFT)]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#I/O|I/O and databases]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Visualization|Visualization]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Applications|Applications]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Manuals|Manuals (compilers etc)]]&lt;br /&gt;
* [[2016_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
* [[2015_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8e8f6; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== What's New On The Wiki ==&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: Updated [[GPC Quickstart]] with info on email notifications from the scheduler.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Hdf5]] compilation page updated.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Research Computing with Python 2014]] lectures 5-8&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: &amp;quot;Modern CUDA Features&amp;quot; TechTalk slides in [[SciNet TechTalks and Seminars]].&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: [[Research Computing with Python 2014]] lectures 1-4&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[User_Tips#Reducing_virtual_memory_consumption_for_multithreaded_programs| Tip on reducing virtual memory consumption for multithreaded programs]]&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Improved information on the [[Python]] versions installed on the GPC, and which modules are included in each version.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Description on using job arrays on the GPC on the [[Scheduler]] page.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[Intro to Tkinter|Tkinter instructions]], [[Media:Tkinter.pdf|slides]] and [[Media:Tkinter_code.tgz|code]] for the TkInter workshop held in September.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Instructions on using [[Hadoop for HPCers|Hadoop]] (for the Hadoop workshop held in September).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
Previous new stuff can be found in the [[What's new archive]].&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f4f4fa; border-radius:7px&amp;quot; |&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
{{SciNetWiki:Community_Portal}}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
{{#Widget:Twitter|shell.background=#9fb1c2|tweets.links=#4775c1|tweets.color=#000000|tweets.background=#ffffff|user=SciNetHPC}}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [[Old Main Page]] --!&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9205</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9205"/>
		<updated>2018-04-13T01:22:04Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;b&amp;gt; Sat 21 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute will be decommissioned, GPC-storage available until &amp;lt;b&amp;gt; 9 May 2018 &amp;lt;/b&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open to users.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9204</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9204"/>
		<updated>2018-04-13T01:21:36Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;b&amp;gt; Sat 21 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute will be decommissioned, GPC-storage available until &amp;lt;b&amp;gt; 9 May 2018 &amp;lt;/b&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open for general Users&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9203</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9203"/>
		<updated>2018-04-13T01:21:19Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=https://docs.scinet.utoronto.ca/index.php/Main_Page]][https://docs.scinet.utoronto.ca Niagara]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;b&amp;gt; Sat 21 Apr 2018 &amp;lt;/b&amp;gt; GPC-compute will be decommissioned, GPC-storage available until &amp;lt;b&amp;gt; 9 Ma , 2018 &amp;lt;/b&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Tue 10 Apr 2018 &amp;lt;/b&amp;gt; Niagara is open for general Users&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed 4 Apr 2018 &amp;lt;/b&amp;gt; We are very close to the production launch of Niagara, the new system installed at SciNet.&lt;br /&gt;
While the RAC allocation year officially starts today, April 4/18, the Niagara system is still undergoing some final tuning and software updates, so the plan is to officially open it to users on next week.&lt;br /&gt;
&lt;br /&gt;
All active GPC users will have their accounts, $HOME, and $PROJECT, transferred to the new&lt;br /&gt;
Niagara system.  Those of you who are new to SciNet, but got RAC allocations on Niagara,&lt;br /&gt;
will have your accounts created and ready for you to login.&lt;br /&gt;
&lt;br /&gt;
We are planning an extended [https://support.scinet.utoronto.ca/education/go.php/370/index.php Intro to SciNet/Niagara session], available in person at our office, and webcast on Vidyo and possibly other means, on Wednesday April 11 at noon EST.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9202</id>
		<title>SciNet User Support Library</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9202"/>
		<updated>2018-04-13T01:18:41Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System News */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{|  style=&amp;quot;border-spacing: 8px;&amp;quot;&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
{{SciNetWiki:System_Alerts}}&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
==System News==&lt;br /&gt;
* April 9, 2018: Niagara commissioned.&lt;br /&gt;
* Dec 4, 2017 : scratchtcs decommissioned.&lt;br /&gt;
* Nov 28, 2017: The GPC has been reduced from 30,912 to 16,800 cores to make room for the installation of Niagara.&lt;br /&gt;
* Sept 29, 2017: The TCS has been decommissioned.&lt;br /&gt;
&lt;br /&gt;
([[Previous System News]])&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; width=&amp;quot;50%&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f6e8e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== QuickStart Guides ==&lt;br /&gt;
* [[SciNet Command Line Utilities]]&lt;br /&gt;
* [[Media:SciNet_Tutorial.pdf|SciNet User Tutorial]]&lt;br /&gt;
* [[GPC_Quickstart|GPC: General Purpose Cluster]]&lt;br /&gt;
* [[Sandy| Sandy: Intel Sandybridge nodes ]]&lt;br /&gt;
* [[Gravity| Gravity: GPU nodes]]&lt;br /&gt;
* [[P7_Linux_Cluster|P7: Power 7 Linux cluster]]&lt;br /&gt;
* [[BGQ|BGQ: BlueGene/Q clusters]]&lt;br /&gt;
* [[Software_and_Libraries|Software and libraries]]&lt;br /&gt;
* [[Data_Management|Data management]]&lt;br /&gt;
* [[FAQ | FAQ (frequently asked questions)]]&lt;br /&gt;
* [[Acknowledging_SciNet | Acknowledging SciNet]]&lt;br /&gt;
* [https://courses.scinet.utoronto.ca SciNet education]&lt;br /&gt;
* [https://www.youtube.com/channel/UC42CaO-AAQhwqa8RGzE3daQ SciNet YouTube Channel]&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8f6e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== Tutorials and Manuals ==&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#SciNet_Basics|SciNet basics]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Linux|Linux]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Batch_job_management|Batch job management]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Programming|Programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Parallel_Programming|Parallel programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#GPU_Computing|GPU computing]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Performance_Tuning|Performance tuning]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Debugging|Debugging]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Math_libraries_.28BLAS.2C_LAPACK.2C_FFT.29|Math libraries (BLAS, LAPACK, FFT)]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#I/O|I/O and databases]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Visualization|Visualization]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Applications|Applications]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Manuals|Manuals (compilers etc)]]&lt;br /&gt;
* [[2016_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
* [[2015_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8e8f6; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== What's New On The Wiki ==&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: Updated [[GPC Quickstart]] with info on email notifications from the scheduler.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Hdf5]] compilation page updated.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Research Computing with Python 2014]] lectures 5-8&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: &amp;quot;Modern CUDA Features&amp;quot; TechTalk slides in [[SciNet TechTalks and Seminars]].&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: [[Research Computing with Python 2014]] lectures 1-4&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[User_Tips#Reducing_virtual_memory_consumption_for_multithreaded_programs| Tip on reducing virtual memory consumption for multithreaded programs]]&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Improved information on the [[Python]] versions installed on the GPC, and which modules are included in each version.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Description on using job arrays on the GPC on the [[Scheduler]] page.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[Intro to Tkinter|Tkinter instructions]], [[Media:Tkinter.pdf|slides]] and [[Media:Tkinter_code.tgz|code]] for the TkInter workshop held in September.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Instructions on using [[Hadoop for HPCers|Hadoop]] (for the Hadoop workshop held in September).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
Previous new stuff can be found in the [[What's new archive]].&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f4f4fa; border-radius:7px&amp;quot; |&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
{{SciNetWiki:Community_Portal}}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
{{#Widget:Twitter|shell.background=#9fb1c2|tweets.links=#4775c1|tweets.color=#000000|tweets.background=#ffffff|user=SciNetHPC}}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [[Old Main Page]] --!&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9201</id>
		<title>SciNet User Support Library</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9201"/>
		<updated>2018-04-13T01:18:29Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System News */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{|  style=&amp;quot;border-spacing: 8px;&amp;quot;&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
{{SciNetWiki:System_Alerts}}&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
==System News==&lt;br /&gt;
* April 9, 2018: Niagara commissioned for general users.&lt;br /&gt;
* Feb 27, 2018: gravity decommissioned.&lt;br /&gt;
* Dec 4, 2017 : scratchtcs decommissioned.&lt;br /&gt;
* Nov 28, 2017: The GPC has been reduced from 30,912 to 16,800 cores to make room for the installation of Niagara.&lt;br /&gt;
* Sept 29, 2017: The TCS has been decommissioned.&lt;br /&gt;
&lt;br /&gt;
([[Previous System News]])&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; width=&amp;quot;50%&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f6e8e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== QuickStart Guides ==&lt;br /&gt;
* [[SciNet Command Line Utilities]]&lt;br /&gt;
* [[Media:SciNet_Tutorial.pdf|SciNet User Tutorial]]&lt;br /&gt;
* [[GPC_Quickstart|GPC: General Purpose Cluster]]&lt;br /&gt;
* [[Sandy| Sandy: Intel Sandybridge nodes ]]&lt;br /&gt;
* [[Gravity| Gravity: GPU nodes]]&lt;br /&gt;
* [[P7_Linux_Cluster|P7: Power 7 Linux cluster]]&lt;br /&gt;
* [[BGQ|BGQ: BlueGene/Q clusters]]&lt;br /&gt;
* [[Software_and_Libraries|Software and libraries]]&lt;br /&gt;
* [[Data_Management|Data management]]&lt;br /&gt;
* [[FAQ | FAQ (frequently asked questions)]]&lt;br /&gt;
* [[Acknowledging_SciNet | Acknowledging SciNet]]&lt;br /&gt;
* [https://courses.scinet.utoronto.ca SciNet education]&lt;br /&gt;
* [https://www.youtube.com/channel/UC42CaO-AAQhwqa8RGzE3daQ SciNet YouTube Channel]&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8f6e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== Tutorials and Manuals ==&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#SciNet_Basics|SciNet basics]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Linux|Linux]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Batch_job_management|Batch job management]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Programming|Programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Parallel_Programming|Parallel programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#GPU_Computing|GPU computing]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Performance_Tuning|Performance tuning]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Debugging|Debugging]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Math_libraries_.28BLAS.2C_LAPACK.2C_FFT.29|Math libraries (BLAS, LAPACK, FFT)]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#I/O|I/O and databases]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Visualization|Visualization]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Applications|Applications]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Manuals|Manuals (compilers etc)]]&lt;br /&gt;
* [[2016_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
* [[2015_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8e8f6; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== What's New On The Wiki ==&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: Updated [[GPC Quickstart]] with info on email notifications from the scheduler.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Hdf5]] compilation page updated.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Research Computing with Python 2014]] lectures 5-8&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: &amp;quot;Modern CUDA Features&amp;quot; TechTalk slides in [[SciNet TechTalks and Seminars]].&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: [[Research Computing with Python 2014]] lectures 1-4&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[User_Tips#Reducing_virtual_memory_consumption_for_multithreaded_programs| Tip on reducing virtual memory consumption for multithreaded programs]]&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Improved information on the [[Python]] versions installed on the GPC, and which modules are included in each version.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Description on using job arrays on the GPC on the [[Scheduler]] page.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[Intro to Tkinter|Tkinter instructions]], [[Media:Tkinter.pdf|slides]] and [[Media:Tkinter_code.tgz|code]] for the TkInter workshop held in September.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Instructions on using [[Hadoop for HPCers|Hadoop]] (for the Hadoop workshop held in September).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
Previous new stuff can be found in the [[What's new archive]].&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f4f4fa; border-radius:7px&amp;quot; |&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
{{SciNetWiki:Community_Portal}}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
{{#Widget:Twitter|shell.background=#9fb1c2|tweets.links=#4775c1|tweets.color=#000000|tweets.background=#ffffff|user=SciNetHPC}}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [[Old Main Page]] --!&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Gravity&amp;diff=9175</id>
		<title>Gravity</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Gravity&amp;diff=9175"/>
		<updated>2018-02-27T18:04:15Z</updated>

		<summary type="html">&lt;p&gt;Northrup: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:Ibm_idataplex_dx360_m4.jpg|center|300px|thumb]]&lt;br /&gt;
|name=Gravity &lt;br /&gt;
|installed=December 2012&lt;br /&gt;
|operatingsystem= Linux Centos 6.4&lt;br /&gt;
|loginnode= gravity01 (from &amp;lt;tt&amp;gt;login.scinet&amp;lt;/tt&amp;gt;)&lt;br /&gt;
|nnodes=49 (588 cpu cores, 50176 gpu cores)&lt;br /&gt;
|rampernode=32 Gb &lt;br /&gt;
|corespernode=12 with 2x M2090 GPUs&lt;br /&gt;
|interconnect=QDR Infiniband&lt;br /&gt;
|vendorcompilers=nvcc,pgcc,icc,gcc&lt;br /&gt;
|queuetype=Torque&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
'''&amp;lt;font color=red&amp;gt;The Gravity cluster has been decommissioned on February 26, 2018. A new system for large parallel jobs, Niagara, will be replacing the [[GPC_Quickstart|GPC]] and [[TCS_Quickstart|TCS]], and is expected to be in production in early 2018. Contributed systems like [[Sandy]] and [[Gravity]] will also reach their end-of-life in this transition. The aim is to keep at least 50% of the GPC available during the installation of the new system. Users will be informed about further details of the transition as they become available. &amp;lt;/font&amp;gt;'''&lt;br /&gt;
&lt;br /&gt;
The Gravity cluster, consists of 49 x86_64 nodes each with two hex core Intel Xeon (Sandybridge) E5-2620 2.0GHz CPUs with 32GB of RAM per node.  Each node has two NVIDIA Tesla M2090 GPUs&lt;br /&gt;
with CUDA Capability 2.0 (Fermi) each with 512 CUDA Cores and 6 GB of RAM.  The nodes are interconnected with 3:1 blocking QDR Infiniband for MPI communications&lt;br /&gt;
and disk I/O to the SciNet GPFS filesystems.  In total this cluster contains 588 x86_64 cores with 1,568 GB of system RAM and 98 GPUs with 588 GB GPU RAM total.&lt;br /&gt;
&lt;br /&gt;
'''NB -''' gravity is a user-contributed system acquired through a CFI LOF to a specific PI. Policies regarding use by other groups are under development and subject to change at any time. &lt;br /&gt;
&lt;br /&gt;
Note that SciNet has a mailing lists for people interested in GPGPU computing. To receive information on courses, workshop and other GPGPU related events, sign up at https://support.scinet.utoronto.ca/mailman/listinfo/scinet-gpgpu.&lt;br /&gt;
&lt;br /&gt;
== Nodes ==&lt;br /&gt;
&lt;br /&gt;
=== Login ===&lt;br /&gt;
First login via ssh with your scinet account at &amp;lt;tt&amp;gt;login.scinet.utoronto.ca&amp;lt;/tt&amp;gt;, and from there you can proceed to '''&amp;lt;tt&amp;gt;gravity01&amp;lt;/tt&amp;gt;''' which &lt;br /&gt;
is the GPU development node.&lt;br /&gt;
&lt;br /&gt;
=== Devel ===&lt;br /&gt;
&lt;br /&gt;
As mentioned '''&amp;lt;tt&amp;gt;gravity01&amp;lt;/tt&amp;gt;''' is the head/develop node for interactive use.  This node is for compiling, short testing, and submitting&lt;br /&gt;
batch jobs to the compute nodes.  It is a shared resource so treat it accordingly and use the queue and compute nodes for long are large&lt;br /&gt;
computations.&lt;br /&gt;
&lt;br /&gt;
=== ARC Experimental (ARCX) Xeon Phi/ Tesla K20 ===&lt;br /&gt;
&lt;br /&gt;
A separate devel node, '''&amp;lt;tt&amp;gt;arcX&amp;lt;/tt&amp;gt;''', with a single Intel Xeon Phi and a NVIDIA Tesla K20 is also available for testing these newer technologies.&lt;br /&gt;
For full details see the '''[[phi| Xeon Phi / Tesla K20 ]]''' wiki page.&lt;br /&gt;
&lt;br /&gt;
=== Compute ===&lt;br /&gt;
&lt;br /&gt;
To access the other 48 compute nodes with GPU's you need to use the queue, similar to the standard GPC compute nodes.&lt;br /&gt;
Currently the nodes are scheduled by complete node, 12 cores and 2 GPUs, and a maximum walltime of 12 hours.&lt;br /&gt;
&lt;br /&gt;
For an interactive job use&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
qsub -l nodes=1:ppn=12:gpus=2,walltime=12:00:00 -q gravity -I&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
or for a batch job use&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
qsub script.sh &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
where &amp;lt;tt&amp;gt;script.sh&amp;lt;/tt&amp;gt; is&lt;br /&gt;
&amp;lt;source lang=&amp;quot;bash&amp;quot;&amp;gt;&lt;br /&gt;
#!/bin/bash&lt;br /&gt;
# Torque submission script for Gravity&lt;br /&gt;
#&lt;br /&gt;
#PBS -l nodes=2:ppn=12:gpus=2,walltime=1:00:00&lt;br /&gt;
#PBS -N GPUtest&lt;br /&gt;
#PBS -q gravity&lt;br /&gt;
cd $PBS_O_WORKDIR&lt;br /&gt;
&lt;br /&gt;
# EXECUTION COMMAND; -np = nodes*ppn&lt;br /&gt;
mpirun -np 24 ./a.out&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To check running jobs on the gpu nodes only use&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
showq -w class=gravity&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Important note:'''&lt;br /&gt;
&lt;br /&gt;
A bug in the torque scheduler currently sets the environment variable CUDA_VISIBLE_DEVICES to an incorrect value.  Loading any one of the cuda modules will correct this, so be to do this in your job script or in your interactive jobs.&lt;br /&gt;
&lt;br /&gt;
== Software ==&lt;br /&gt;
&lt;br /&gt;
The same software installed on the GPC is available on Gravity using the same modules framework. &lt;br /&gt;
See [[GPC_Quickstart#Modules_and_Environment_Variables | here]] for full details.&lt;br /&gt;
&lt;br /&gt;
==Programming Frameworks==&lt;br /&gt;
&lt;br /&gt;
Currently there are four programming frameworks to use: NVIDIA's CUDA framework, PGI's CUDA Fortran, PGI's implementation of OpenACC, or OpenCL.&lt;br /&gt;
&lt;br /&gt;
=== NVIDIA toolkit ===&lt;br /&gt;
&lt;br /&gt;
==== Driver Version ====&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version for gravity is 331.67.&lt;br /&gt;
&lt;br /&gt;
==== CUDA ====&lt;br /&gt;
&lt;br /&gt;
The current installed CUDA Toolkits for gravity are 3.2, 4.0, 4.1 (default), 4.2, 5.0, 5.5, and 6.0.  A cuda/6.5 module is installed as well, but only for the K20 GPU on the arcX node; that version will not work on the gravity nodes.  To use CUDA version 6.0 (recommended for the gravity nodes), just use the following module command&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/4.8.1 cuda/6.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
(gcc is a prerequisite of the cuda module; using earlier versions of gcc likely will also work.)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- Note that to use the full 6GB or memory per GPU, CUDA 3.2 or newer must be used. --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The CUDA driver is installed locally, however the CUDA Toolkits are installed in:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/scinet/arc/cuda-$VERSION/&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The environment variable $SCINET_CUDA_INSTALL is set when a cuda module is loaded and it points to the&lt;br /&gt;
install location.  This is useful when setting up makefiles and if you use the NVIDIA_SDK&lt;br /&gt;
build evironment, modify the NVIDIA_SDK/C/common/common.mk file accordingly.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
CUDA_INSTALL_PATH = $SCINET_CUDA_INSTALL &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The Nvidia cuda compiler (which uses gcc/4.4.6 by default for CUDA &amp;lt; 4.1, while cuda/4.2 uses gcc/4.6.1), is called &amp;lt;tt&amp;gt;nvcc&amp;lt;/tt&amp;gt;,&lt;br /&gt;
&lt;br /&gt;
You'll have to let the cuda compiler know about the capabilities of the Fermi graphics card by supplying the flag &lt;br /&gt;
&amp;lt;pre&amp;gt;-arch=sm_13&amp;lt;/pre&amp;gt; or &amp;lt;pre&amp;gt;-arch=sm_20&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== NVIDIA Toolkit ====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For cuda 5.0 and 5.5, the sdk code samples can be copied from the directory&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$SCINET_CUDA_INSTALL/samples/&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
NOTE: Not all of the CUDA and OpenCL examples will compile as many require OpenGL graphic libraries not installed on the nodes.&lt;br /&gt;
&lt;br /&gt;
==== OpenCL ====&lt;br /&gt;
 &lt;br /&gt;
As of 3.0, OpenCL 1.1 is included in the CUDA Toolkit so loading the CUDA module is all that is required.&lt;br /&gt;
&lt;br /&gt;
===PGI compilers===&lt;br /&gt;
&lt;br /&gt;
As of July 2012, The PGI suite of compilers is installed.  These can be accessed by &lt;br /&gt;
&amp;lt;pre&amp;gt;$  module load gcc/4.6.1 pgi/12.6&amp;lt;/pre&amp;gt;&lt;br /&gt;
(if you use the older pgi/12.5, gcc/4.4.6 is a requirement, and is used, for instance, in the CUDA parts of the PGI compilers). These compilers use their own cuda installation, so you do not need to load an additional cuda module. By default, they use a cuda 4.1 installation, but you can request cuda 4.2 as well using the &amp;lt;tt&amp;gt;-Mcuda=4.2&amp;lt;/tt&amp;gt; option.&lt;br /&gt;
&lt;br /&gt;
The compilation commands are pgcc, pgcpp and pgfortran for c, c++ and fortran, respectively. As usual, we advice to compiler with optimization using the flags&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
-O4 -fastsse&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
The compilers will then optimize for the specific machine that you are compiling on.&lt;br /&gt;
&lt;br /&gt;
The PGI compilers support OpenMP as well through the compile and link flags&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
-mp&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== CUDA Fortran ====&lt;br /&gt;
&lt;br /&gt;
The PGI fortran compiler (&amp;lt;tt&amp;gt;pgfortran&amp;lt;/tt&amp;gt;, also &amp;lt;tt&amp;gt;pgf77&amp;lt;/tt&amp;gt; and &amp;lt;tt&amp;gt;pgf90&amp;lt;/tt&amp;gt;) understands CUDA extensions to fortran. &lt;br /&gt;
This compiler will automatically understand these extension for source files with the file extension &amp;lt;tt&amp;gt;.cuf&amp;lt;/tt&amp;gt;  Otherwise, you have to specify &lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
-Mcuda=4.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== OpenACC ====&lt;br /&gt;
&lt;br /&gt;
OpenACC is a compiler-directive approach to GPGPU programming. The PGI compilers (c, c++ and fortran) have a partial implementation of this open specification. To switch this on, use the options&lt;br /&gt;
&amp;lt;pre&amp;gt;-acc -ta=nvidia -Mcuda=4.1&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====More documentation====&lt;br /&gt;
&lt;br /&gt;
Manuals are on the [[Knowledge_Base:_Tutorials_and_Manuals| Tutorials and Manuals]] page.&lt;br /&gt;
&lt;br /&gt;
===Other compilers===&lt;br /&gt;
&lt;br /&gt;
* '''gcc,g++,gfortran''' - GNU compiler (nvcc need to have either gcc-4.4 or gcc-4.6 module loaded to work correctly)&lt;br /&gt;
* '''icc,icpc,ifort''' - Intel compiler&lt;br /&gt;
&lt;br /&gt;
===Debuggers===&lt;br /&gt;
&lt;br /&gt;
* '''ddt''' - Allinea's graphical DDT debugger, in the &amp;lt;tt&amp;gt;ddt&amp;lt;/tt&amp;gt; module. The most recent version, ddt 4.0 supports cuda 4.0, 4.1, 4.2 and 5.0.&lt;br /&gt;
* '''cuda-gdb''' - Nvidia text based gdb variant, part of the cuda module.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Note that to debug both host and cuda device code, you have to give the &amp;lt;pre&amp;gt;-g -G&amp;lt;/pre&amp;gt; pair of flags to nvcc.&lt;br /&gt;
&lt;br /&gt;
===MPI===&lt;br /&gt;
&lt;br /&gt;
The GPC MPI packages can be used on this system. See the GPC section on [[ GPC_Quickstart#MPI |MPI ]] for more details.&lt;br /&gt;
&lt;br /&gt;
While these mpi packages should work with the PGI compilers as well, this has not been tested and standard wrappers like mpif90 may not work.&lt;br /&gt;
&lt;br /&gt;
Alternatively, for mpi compilations with the PGI compilers, you can load the mpich1 mpi implementation with &amp;lt;pre&amp;gt;module load mpich1/pgi&amp;lt;/pre&amp;gt;after which you can use the option &amp;lt;pre&amp;gt;-Mmpi&amp;lt;/pre&amp;gt; or the wrapper scripts &amp;lt;tt&amp;gt;mpicc, mpiCC and mpif90&amp;lt;/tt&amp;gt;, as well as &amp;lt;tt&amp;gt;mpirun&amp;lt;/tt&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
=== Driver Version ===&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version installed is 331.67&lt;br /&gt;
&lt;br /&gt;
== Documentation ==&lt;br /&gt;
* CUDA&lt;br /&gt;
** [http://lmgtfy.com/?q=cuda google &amp;quot;CUDA&amp;quot;]&lt;br /&gt;
&lt;br /&gt;
* OpenCL&lt;br /&gt;
** see above&lt;br /&gt;
&lt;br /&gt;
== User Codes ==&lt;br /&gt;
&lt;br /&gt;
Please discuss and put any relevant information/problems/best practices you have encountered when using/developing for CUDA and/or OpenCL&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Sandy&amp;diff=9174</id>
		<title>Sandy</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Sandy&amp;diff=9174"/>
		<updated>2018-02-27T18:03:45Z</updated>

		<summary type="html">&lt;p&gt;Northrup: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:Ibm_idataplex_dx360_m4.jpg|center|300px|thumb]]&lt;br /&gt;
|name=Sandy&lt;br /&gt;
|installed=February 2013&lt;br /&gt;
|operatingsystem= Linux Centos 6.4&lt;br /&gt;
|loginnode= gpc0[1-4] (from &amp;lt;tt&amp;gt;login.scinet&amp;lt;/tt&amp;gt;)&lt;br /&gt;
|nnodes=44 (704 cores)&lt;br /&gt;
|rampernode=64 Gb &lt;br /&gt;
|corespernode=16 (32 threads)&lt;br /&gt;
|interconnect=QDR Infiniband&lt;br /&gt;
|vendorcompilers=icc,gcc&lt;br /&gt;
|queuetype=Torque&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''&amp;lt;font color=red&amp;gt;The Sandy cluster will be decommissioned by the end of 2017. A new system for large parallel jobs, Niagara, will be replacing the [[TCS_Quickstart|TCS]], the [[GPC_Quickstart|GPC]] and contributed systems like Sandy, and is expected to be in production in early 2018. The aim is to keep at least 50% of the GPC available during the installation of the new system. Users will be informed about further details of the transition as they become available. &amp;lt;/font&amp;gt;'''&lt;br /&gt;
&lt;br /&gt;
The Sandybrdige (Sandy) cluster, consists of 76 x86_64 nodes each with two octal core Intel Xeon (Sandybridge) E5-2650 2.0GHz CPUs with 64GB of RAM per node. &lt;br /&gt;
The nodes are interconnected with 2.6:1 blocking QDR Infiniband for MPI communications&lt;br /&gt;
and disk I/O to the SciNet GPFS filesystems.  In total this cluster contains 1216 x86_64 cores with 4,864 GB of total RAM. &lt;br /&gt;
&lt;br /&gt;
'''NB -''' Sandy is a user-contributed system acquired through a CFI LOF to a specific PI. Policies regarding use by other groups are under development and subject to change at any time. &lt;br /&gt;
&lt;br /&gt;
== Nodes ==&lt;br /&gt;
&lt;br /&gt;
=== Login ===&lt;br /&gt;
First login via ssh with your scinet account at &amp;lt;tt&amp;gt;login.scinet.utoronto.ca&amp;lt;/tt&amp;gt;, and from there you can proceed to the normal GPC devel nodes '''&amp;lt;tt&amp;gt;gpc0[1-8]&amp;lt;/tt&amp;gt;'''.&lt;br /&gt;
&lt;br /&gt;
===Compilers===&lt;br /&gt;
&lt;br /&gt;
The Sandy nodes are fully compatible with all software/modules built for the standard GPC nodes, see [[GPC_Quickstart#Compilers | GPC Quickstart Compilers ]]; however as they a newer architecture they also have added CPU instructions that your program may benefit from. To ensure that you are using these sandy specific optimizations use the following Intel compiler flags with the latest Intel compiler when you compile specifically for the sandy nodes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load intel/14.0.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Optimize your code for the SANDY nodes using the following compiler flags: &lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
   -O3 -march=corei7-avx &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
*More questions about compiling? See the [[FAQ#Compiling_your_Code|FAQ]].&lt;br /&gt;
*NOTE: Code compiled using these option may not be backwards compatible with the regular GPC nodes.&lt;br /&gt;
&lt;br /&gt;
=== Compute ===&lt;br /&gt;
&lt;br /&gt;
To access the sandybridge compute nodes you need to use the queue, similar to the standard GPC compute nodes.&lt;br /&gt;
Currently the nodes are scheduled by complete node, 16 cores and a maximum walltime of 48 hours.&lt;br /&gt;
&lt;br /&gt;
For an interactive job use&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
qsub -l nodes=1:ppn=16,walltime=12:00:00 -q sandy -I&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
or for a batch job use&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
qsub script.sh &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
where &amp;lt;tt&amp;gt;script.sh&amp;lt;/tt&amp;gt; is&lt;br /&gt;
&amp;lt;source lang=&amp;quot;bash&amp;quot;&amp;gt;&lt;br /&gt;
#!/bin/bash&lt;br /&gt;
# Torque submission script for Sandy&lt;br /&gt;
#&lt;br /&gt;
#PBS -l nodes=2:ppn=16,walltime=1:00:00&lt;br /&gt;
#PBS -N sandytest&lt;br /&gt;
#PBS -q sandy&lt;br /&gt;
cd $PBS_O_WORKDIR&lt;br /&gt;
&lt;br /&gt;
# EXECUTION COMMAND; -np = nodes*ppn&lt;br /&gt;
mpirun -np 32 ./a.out&lt;br /&gt;
&amp;lt;/source&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To check running jobs on the sandy nodes only use&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
showq -w class=sandy&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Software ==&lt;br /&gt;
&lt;br /&gt;
The same software installed on the GPC is available on Sandy using the same modules framework. &lt;br /&gt;
See [[GPC_Quickstart#Modules_and_Environment_Variables | here]] for full details.&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9173</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9173"/>
		<updated>2018-02-27T17:44:07Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 27Feb 2018 09:31:00 EST &amp;lt;/b&amp;gt; Gravity decommissioned Tuesday Feb 27th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 14:39:57 EST 2018 &amp;lt;/b&amp;gt; Electrical work is finished, systems back in normal operation. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 8:15:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre has been restored. Only the P7 system still seems to be unreachable, all other systems are up (within the limitations due to the scheduled partial outage today).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 14 7:30:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre failed, so none of the systems can be reached. We are investigating if the connectivity can be restored before today's partial outage is over.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Scheduled partial system outage February 15th, 8am-5pm, for electrical work.  HPSS, atlas and some GPC nodes will be unavailable.    &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Gravity GPU Cluster to be decommissioned February 26th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Feb 09 10:22:40 EST 2018 &amp;lt;/b&amp;gt; HPSS is back in service&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Jan 31 10:16:35 EST 2018 &amp;lt;/b&amp;gt; The HPSS core node had a hardware failure overnight. Vendor has been contacted for support and part replacement.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon Jan 29 17:39:03 EST 2018 &amp;lt;/b&amp;gt; Maintenance is finished. Most systems will come back online in less than one hour.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Thu Jan 18 21:03:04 EST 2018&amp;lt;/b&amp;gt; '''All systems will be unavailable January 29th 7am thru January 30th for annual scheduled maintenance.''' &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9172</id>
		<title>SciNet User Support Library</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9172"/>
		<updated>2018-02-27T17:43:21Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System News */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{|  style=&amp;quot;border-spacing: 8px;&amp;quot;&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
{{SciNetWiki:System_Alerts}}&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
==System News==&lt;br /&gt;
&lt;br /&gt;
* Feb 27, 2017 : gravity decommissioned.&lt;br /&gt;
&lt;br /&gt;
* Dec 4, 2017 : scratchtcs decommissioned. &lt;br /&gt;
&lt;br /&gt;
* Dec 1, 2017, 00:00 am to 06:00 am:  The connection to the data center will be down for a scheduled network maintenance.  Jobs will continue to run, but login sessions will be terminated at midnight.&lt;br /&gt;
&lt;br /&gt;
* Nov 28, 2017, 12:00 noon: The [[GPC_Quickstart | GPC]] will be reduced from 30,912 to 16,800 cores to make room for the installation of Niagara.&lt;br /&gt;
&lt;br /&gt;
* Sept 29, 2017: The [[TCS_Quickstart | TCS]] was decommissioned on Sept. 29, 2017&lt;br /&gt;
&lt;br /&gt;
* Mar 3:  GPC:  Version 7.0 of Allinea Forge (DDT Debugger, MAP, Performance Reports) installed as a module.&lt;br /&gt;
&lt;br /&gt;
* Jan 26:  New larger (1.8PB) $SCRATCH storage brought online.   &lt;br /&gt;
&lt;br /&gt;
* Oct 24: P8: 2 new Power 8 Development Nodes, [[ P8 ]], with 4x Nvidia P100 (Pascal) GPUs, available for users.&lt;br /&gt;
&lt;br /&gt;
* Sept 19: KNL: Intel Knights Landing Development Nodes, [[Knights_Landing | KNL ]], available for users.&lt;br /&gt;
&lt;br /&gt;
* Sept 13: GPC:  Version 6.1 of Allinea Forge (DDT Debugger, MAP, Performance Reports) installed as a module.&lt;br /&gt;
&lt;br /&gt;
* Sept 13: GPC: Version 17.0.0 of the Intel Compiler and Tools are installed as modules.&lt;br /&gt;
&lt;br /&gt;
* Aug 20:  P8: Power 8 Development Nodes, [[ P8 ]], with 2x Nvidia K80, GPUs available for users.&lt;br /&gt;
&lt;br /&gt;
([[Previous System News]])&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; width=&amp;quot;50%&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f6e8e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== QuickStart Guides ==&lt;br /&gt;
* [[SciNet Command Line Utilities]]&lt;br /&gt;
* [[Media:SciNet_Tutorial.pdf|SciNet User Tutorial]]&lt;br /&gt;
* [[GPC_Quickstart|GPC: General Purpose Cluster]]&lt;br /&gt;
* [[Sandy| Sandy: Intel Sandybridge nodes ]]&lt;br /&gt;
* [[Gravity| Gravity: GPU nodes]]&lt;br /&gt;
* [[P7_Linux_Cluster|P7: Power 7 Linux cluster]]&lt;br /&gt;
* [[BGQ|BGQ: BlueGene/Q clusters]]&lt;br /&gt;
* [[Software_and_Libraries|Software and libraries]]&lt;br /&gt;
* [[Data_Management|Data management]]&lt;br /&gt;
* [[FAQ | FAQ (frequently asked questions)]]&lt;br /&gt;
* [[Acknowledging_SciNet | Acknowledging SciNet]]&lt;br /&gt;
* [https://courses.scinet.utoronto.ca SciNet education]&lt;br /&gt;
* [https://www.youtube.com/channel/UC42CaO-AAQhwqa8RGzE3daQ SciNet YouTube Channel]&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8f6e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== Tutorials and Manuals ==&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#SciNet_Basics|SciNet basics]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Linux|Linux]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Batch_job_management|Batch job management]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Programming|Programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Parallel_Programming|Parallel programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#GPU_Computing|GPU computing]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Performance_Tuning|Performance tuning]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Debugging|Debugging]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Math_libraries_.28BLAS.2C_LAPACK.2C_FFT.29|Math libraries (BLAS, LAPACK, FFT)]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#I/O|I/O and databases]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Visualization|Visualization]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Applications|Applications]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Manuals|Manuals (compilers etc)]]&lt;br /&gt;
* [[2016_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
* [[2015_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8e8f6; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== What's New On The Wiki ==&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: Updated [[GPC Quickstart]] with info on email notifications from the scheduler.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Hdf5]] compilation page updated.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Research Computing with Python 2014]] lectures 5-8&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: &amp;quot;Modern CUDA Features&amp;quot; TechTalk slides in [[SciNet TechTalks and Seminars]].&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: [[Research Computing with Python 2014]] lectures 1-4&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[User_Tips#Reducing_virtual_memory_consumption_for_multithreaded_programs| Tip on reducing virtual memory consumption for multithreaded programs]]&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Improved information on the [[Python]] versions installed on the GPC, and which modules are included in each version.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Description on using job arrays on the GPC on the [[Scheduler]] page.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[Intro to Tkinter|Tkinter instructions]], [[Media:Tkinter.pdf|slides]] and [[Media:Tkinter_code.tgz|code]] for the TkInter workshop held in September.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Instructions on using [[Hadoop for HPCers|Hadoop]] (for the Hadoop workshop held in September).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
Previous new stuff can be found in the [[What's new archive]].&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f4f4fa; border-radius:7px&amp;quot; |&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
{{SciNetWiki:Community_Portal}}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
{{#Widget:Twitter|shell.background=#9fb1c2|tweets.links=#4775c1|tweets.color=#000000|tweets.background=#ffffff|user=SciNetHPC}}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [[Old Main Page]] --!&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9171</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9171"/>
		<updated>2018-02-27T17:42:46Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 27Feb 2018 09:31:00 EST &amp;lt;/b&amp;gt; Gravity decommissioned Tuesday Feb 27th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 14:39:57 EST 2018 &amp;lt;/b&amp;gt; Electrical work is finished, systems back in normal operation. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 8:15:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre has been restored. Only the P7 system still seems to be unreachable, all other systems are up (within the limitations due to the scheduled partial outage today).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 14 7:30:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre failed, so none of the systems can be reached. We are investigating if the connectivity can be restored before today's partial outage is over.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Scheduled partial system outage February 15th, 8am-5pm, for electrical work.  HPSS, atlas and some GPC nodes will be unavailable.    &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Gravity GPU Cluster to be decommissioned February 26th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Feb 09 10:22:40 EST 2018 &amp;lt;/b&amp;gt; HPSS is back in service&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Jan 31 10:16:35 EST 2018 &amp;lt;/b&amp;gt; The HPSS core node had a hardware failure overnight. Vendor has been contacted for support and part replacement.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon Jan 29 17:39:03 EST 2018 &amp;lt;/b&amp;gt; Maintenance is finished. Most systems will come back online in less than one hour.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Thu Jan 18 21:03:04 EST 2018&amp;lt;/b&amp;gt; '''All systems will be unavailable January 29th 7am thru January 30th for annual scheduled maintenance.''' &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9170</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9170"/>
		<updated>2018-02-27T03:33:09Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 26 Feb 2018 09:31:00 EST &amp;lt;/b&amp;gt; Gravity will be decommissioned Tuesday Feb 27th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 14:39:57 EST 2018 &amp;lt;/b&amp;gt; Electrical work is finished, systems back in normal operation. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 8:15:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre has been restored. Only the P7 system still seems to be unreachable, all other systems are up (within the limitations due to the scheduled partial outage today).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 14 7:30:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre failed, so none of the systems can be reached. We are investigating if the connectivity can be restored before today's partial outage is over.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Scheduled partial system outage February 15th, 8am-5pm, for electrical work.  HPSS, atlas and some GPC nodes will be unavailable.    &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Gravity GPU Cluster to be decommissioned February 26th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Feb 09 10:22:40 EST 2018 &amp;lt;/b&amp;gt; HPSS is back in service&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Jan 31 10:16:35 EST 2018 &amp;lt;/b&amp;gt; The HPSS core node had a hardware failure overnight. Vendor has been contacted for support and part replacement.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon Jan 29 17:39:03 EST 2018 &amp;lt;/b&amp;gt; Maintenance is finished. Most systems will come back online in less than one hour.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Thu Jan 18 21:03:04 EST 2018&amp;lt;/b&amp;gt; '''All systems will be unavailable January 29th 7am thru January 30th for annual scheduled maintenance.''' &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9169</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9169"/>
		<updated>2018-02-26T14:32:21Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon 26 Feb 2018 09:31:00 EST &amp;lt;/b&amp;gt; Gravity will be decommissioned Tuesday Feb 27th.  &amp;lt;/b&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 14:39:57 EST 2018 &amp;lt;/b&amp;gt; Electrical work is finished, systems back in normal operation. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 8:15:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre has been restored. Only the P7 system still seems to be unreachable, all other systems are up (within the limitations due to the scheduled partial outage today).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 14 7:30:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre failed, so none of the systems can be reached. We are investigating if the connectivity can be restored before today's partial outage is over.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Scheduled partial system outage February 15th, 8am-5pm, for electrical work.  HPSS, atlas and some GPC nodes will be unavailable.    &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Gravity GPU Cluster to be decommissioned February 26th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Feb 09 10:22:40 EST 2018 &amp;lt;/b&amp;gt; HPSS is back in service&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Jan 31 10:16:35 EST 2018 &amp;lt;/b&amp;gt; The HPSS core node had a hardware failure overnight. Vendor has been contacted for support and part replacement.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon Jan 29 17:39:03 EST 2018 &amp;lt;/b&amp;gt; Maintenance is finished. Most systems will come back online in less than one hour.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Thu Jan 18 21:03:04 EST 2018&amp;lt;/b&amp;gt; '''All systems will be unavailable January 29th 7am thru January 30th for annual scheduled maintenance.''' &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9165</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9165"/>
		<updated>2018-02-15T19:41:33Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:down.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 14:39:57 EST 2018 &amp;lt;/b&amp;gt; Electrical work is finished, systems back in normal operation. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 8:15:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre has been restored. Only the P7 system still seems to be unreachable, all other systems are up (within the limitations due to the scheduled partial outage today).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 14 7:30:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre failed, so none of the systems can be reached. We are investigating if the connectivity can be restored before today's partial outage is over.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Scheduled partial system outage February 15th, 8am-5pm, for electrical work.  HPSS, atlas and some GPC nodes will be unavailable.    &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Gravity GPU Cluster to be decommissioned February 26th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Feb 09 10:22:40 EST 2018 &amp;lt;/b&amp;gt; HPSS is back in service&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Jan 31 10:16:35 EST 2018 &amp;lt;/b&amp;gt; The HPSS core node had a hardware failure overnight. Vendor has been contacted for support and part replacement.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon Jan 29 17:39:03 EST 2018 &amp;lt;/b&amp;gt; Maintenance is finished. Most systems will come back online in less than one hour.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Thu Jan 18 21:03:04 EST 2018&amp;lt;/b&amp;gt; '''All systems will be unavailable January 29th 7am thru January 30th for annual scheduled maintenance.''' &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9161</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9161"/>
		<updated>2018-02-15T14:09:05Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:down.png|down|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:down.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 15 8:15:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre has been restored. Only the P7 system still seems to be unreachable, all other systems are up (within the limitations due to the scheduled partial outage today).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 14 7:30:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre failed, so none of the systems can be reached. We are investigating if the connectivity can be restored before today's partial outage is over.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Partial system outage February 15th, 8am-5pm, for electrical work.  HPSS, atlas and some GPC nodes will be unavailable.    &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Gravity GPU Cluster to be decommissioned February 26th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Feb 09 10:22:40 EST 2018 &amp;lt;/b&amp;gt; HPSS is back in service&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Jan 31 10:16:35 EST 2018 &amp;lt;/b&amp;gt; The HPSS core node had a hardware failure overnight. Vendor has been contacted for support and part replacement.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon Jan 29 17:39:03 EST 2018 &amp;lt;/b&amp;gt; Maintenance is finished. Most systems will come back online in less than one hour.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Thu Jan 18 21:03:04 EST 2018&amp;lt;/b&amp;gt; '''All systems will be unavailable January 29th 7am thru January 30th for annual scheduled maintenance.''' &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9160</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9160"/>
		<updated>2018-02-15T13:44:47Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:down.png|down|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:down.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 14 8:15:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre has been restored. Only the P7 system still seems to be unreachable, all other systems are up (within the limitations due to the scheduled partial outage today).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Feb 14 7:30:00 EST 2018 &amp;lt;/b&amp;gt; Connectivity to the data centre failed, so none of the systems can be reached. We are investigating if the connectivity can be restored before today's partial outage is over.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Partial system outage February 15th, 8am-5pm, for electrical work.  HPSS, atlas and some GPC nodes will be unavailable.    &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Gravity GPU Cluster to be decommissioned February 26th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Feb 09 10:22:40 EST 2018 &amp;lt;/b&amp;gt; HPSS is back in service&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Jan 31 10:16:35 EST 2018 &amp;lt;/b&amp;gt; The HPSS core node had a hardware failure overnight. Vendor has been contacted for support and part replacement.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon Jan 29 17:39:03 EST 2018 &amp;lt;/b&amp;gt; Maintenance is finished. Most systems will come back online in less than one hour.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Thu Jan 18 21:03:04 EST 2018&amp;lt;/b&amp;gt; '''All systems will be unavailable January 29th 7am thru January 30th for annual scheduled maintenance.''' &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9150</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9150"/>
		<updated>2018-02-11T18:18:19Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Partial system outage February 15th, 8am-5pm, for electrical work.  HPSS, atlas and some GPC nodes will be unavailable.    &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Feb 11 13:14:44 EST 2018 &amp;lt;/b&amp;gt; Gravity GPU Cluster to be decommissioned February 26th.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Feb 09 10:22:40 EST 2018 &amp;lt;/b&amp;gt; HPSS is back in service&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Wed Jan 31 10:16:35 EST 2018 &amp;lt;/b&amp;gt; The HPSS core node had a hardware failure overnight. Vendor has been contacted for support and part replacement.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Mon Jan 29 17:39:03 EST 2018 &amp;lt;/b&amp;gt; Maintenance is finished. Most systems will come back online in less than one hour.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Thu Jan 18 21:03:04 EST 2018&amp;lt;/b&amp;gt; '''All systems will be unavailable January 29th 7am thru January 30th for annual scheduled maintenance.''' &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=P8&amp;diff=9139</id>
		<title>P8</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=P8&amp;diff=9139"/>
		<updated>2018-02-06T01:17:57Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* IBM Compilers */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:P8_s822.jpg|center|300px|thumb]]&lt;br /&gt;
|name=P8 &lt;br /&gt;
|installed=June 2016&lt;br /&gt;
|operatingsystem= Linux RHEL 7.2 le / Ubuntu 16.04 le &lt;br /&gt;
|loginnode= p8t0[1-2] / p8t0[3-4]&lt;br /&gt;
|nnodes= 2x  Power8 with 2x NVIDIA K80,       2x Power 8 with  4x NVIDIA P100&lt;br /&gt;
|rampernode=512 GB&lt;br /&gt;
|corespernode= 2 x 8core (16 physical, 128 SMT)&lt;br /&gt;
|interconnect=Infiniband EDR &lt;br /&gt;
|vendorcompilers=xlc/xlf, nvcc&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== Specifications==&lt;br /&gt;
&lt;br /&gt;
The P8 Test System consists of  of 4 IBM Power 822LC Servers each with 2x8core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 16 physical cores to support up to 128 threads.  2 nodes have two NVIDIA Tesla K80 GPUs with CUDA Capability 3.7 (Kepler), consisting of 2xGK210 GPUs each with 12 GB of RAM connected using PCI-E, and 2 others have 4x NVIDIA Tesla P100 GPUs each wit h 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.&lt;br /&gt;
&lt;br /&gt;
== Compile/Devel/Test ==&lt;br /&gt;
&lt;br /&gt;
First login via ssh with your scinet account at '''&amp;lt;tt&amp;gt;login.scinet.utoronto.ca&amp;lt;/tt&amp;gt;''', and from there you can ssh to &amp;lt;tt&amp;gt;p8t01&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;p8t02&amp;lt;/tt&amp;gt; for the K80 GPUs and to &amp;lt;tt&amp;gt;p8t03&amp;lt;/tt&amp;gt; or &amp;lt;tt&amp;gt;p8t04&amp;lt;/tt&amp;gt; for the Pascal GPUs.&lt;br /&gt;
&lt;br /&gt;
== Software for  ==&lt;br /&gt;
&lt;br /&gt;
==== GNU Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the newer advance toolchain version use:&lt;br /&gt;
&lt;br /&gt;
For '''&amp;lt;tt&amp;gt;p8t0[1-2]&amp;lt;/tt&amp;gt;''' &lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/5.3.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For '''&amp;lt;tt&amp;gt;p8t0[3-4]&amp;lt;/tt&amp;gt;''' &lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/6.2.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== IBM Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the native IBM xlc/xlc++ compilers&lt;br /&gt;
&lt;br /&gt;
For '''&amp;lt;tt&amp;gt;p8t0[1-2]&amp;lt;/tt&amp;gt;''' &lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load xlc/13.1.4&lt;br /&gt;
module load xlf/13.1.4&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For '''&amp;lt;tt&amp;gt;p8t0[3-4]&amp;lt;/tt&amp;gt;''' &lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load xlc/13.1.5_b2&lt;br /&gt;
module load xlf/13.1.5_b2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Driver Version ====&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version is 361.93&lt;br /&gt;
&lt;br /&gt;
==== CUDA ====&lt;br /&gt;
&lt;br /&gt;
The current installed CUDA Tookit is 8.0&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The CUDA driver is installed locally, however the CUDA Toolkit is installed in:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/usr/local/cuda-8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== OpenMPI ====&lt;br /&gt;
&lt;br /&gt;
Currently OpenMPI has been setup on the four nodes connected over QDR Infiniband.&lt;br /&gt;
&lt;br /&gt;
For '''&amp;lt;tt&amp;gt;p8t0[1-2]&amp;lt;/tt&amp;gt;''' &lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load openmpi/1.10.3-gcc-5.3.1&lt;br /&gt;
$ module load openmpi/1.10.3-XL-13_15.1.4&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
For '''&amp;lt;tt&amp;gt;p8t0[3-4]&amp;lt;/tt&amp;gt;''' &lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load openmpi/1.10.3-gcc-6.2.1&lt;br /&gt;
$ module load openmpi/1.10.3-XL-13_15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== PE ====&lt;br /&gt;
&lt;br /&gt;
IBM's Parallel Environment (PE), is available for use with XL compilers using the following&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module pe/xl.perf&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
mpiexec -n 4 ./a.out&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
documentation is [http://publib.boulder.ibm.com/epubs/pdf/c2372832.pdf here]&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=File:Intro_Niagara.pdf&amp;diff=9121</id>
		<title>File:Intro Niagara.pdf</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=File:Intro_Niagara.pdf&amp;diff=9121"/>
		<updated>2018-01-25T14:17:27Z</updated>

		<summary type="html">&lt;p&gt;Northrup: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=File:Snug_techtalk_Niagara.pdf&amp;diff=9120</id>
		<title>File:Snug techtalk Niagara.pdf</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=File:Snug_techtalk_Niagara.pdf&amp;diff=9120"/>
		<updated>2018-01-25T14:15:18Z</updated>

		<summary type="html">&lt;p&gt;Northrup: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9119</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9119"/>
		<updated>2018-01-19T02:05:40Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Thu Jan 18 21:03:04 EST 2018&amp;lt;/b&amp;gt; '''All systems will be unavailable January 29th 7am thru January 30th for annual scheduled maintenance.''' &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Wed Dec 13 13:51:43 EST 2017&amp;lt;/b&amp;gt; The new and improved HPSS system is back in service. Please report any problems you may encounter. Note that Globus access to HPSS is disabled until further notice, due to lack of version compatibility.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9118</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9118"/>
		<updated>2018-01-19T02:04:53Z</updated>

		<summary type="html">&lt;p&gt;Northrup: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Thu Jan 18 21:03:04 EST 2018&amp;lt;/b&amp;gt; All systems will be unavailable January 29th 7am thru January 30th for annual scheduled maintenance. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Wed Dec 13 13:51:43 EST 2017&amp;lt;/b&amp;gt; The new and improved HPSS system is back in service. Please report any problems you may encounter. Note that Globus access to HPSS is disabled until further notice, due to lack of version compatibility.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9117</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9117"/>
		<updated>2018-01-19T02:02:18Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Mon Jan 29 7:00:00 EST 2017 &amp;lt;/b&amp;gt; Scheduled Downtime for Annual Maintenance.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Wed Dec 13 13:51:43 EST 2017&amp;lt;/b&amp;gt; The new and improved HPSS system is back in service. Please report any problems you may encounter. Note that Globus access to HPSS is disabled until further notice, due to lack of version compatibility.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9116</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9116"/>
		<updated>2018-01-19T01:59:44Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|[[File:up.png|up|]]Scheduler&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 13:35:49 EST 2017&amp;lt;/b&amp;gt; All systems have been restored, and jobs are running.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Sat Dec 23 7:00:00 EST 2017&amp;lt;/b&amp;gt; Power outage at the data centre.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;Wed Dec 13 13:51:43 EST 2017&amp;lt;/b&amp;gt; The new and improved HPSS system is back in service. Please report any problems you may encounter. Note that Globus access to HPSS is disabled until further notice, due to lack of version compatibility.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9114</id>
		<title>SOSCIP GPU</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9114"/>
		<updated>2018-01-04T13:34:13Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* LINKS */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:S882lc.png|center|300px|thumb]]&lt;br /&gt;
|name=SOSCIP GPU &lt;br /&gt;
|installed=September 2017&lt;br /&gt;
|operatingsystem= Ubuntu 16.04 le &lt;br /&gt;
|loginnode= sgc01 &lt;br /&gt;
|nnodes= 14x Power 8 with  4x NVIDIA P100&lt;br /&gt;
|rampernode=512 GB&lt;br /&gt;
|corespernode= 2 x 10core (20 physical, 160 SMT)&lt;br /&gt;
|interconnect=Infiniband EDR &lt;br /&gt;
|vendorcompilers=xlc/xlf, nvcc&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== SOSCIP ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster is a Southern Ontario Smart Computing Innovation Platform ([http://soscip.org/ SOSCIP]) resource located at theUniversity of Toronto's SciNet HPC facility. The SOSCIP  multi-university/industry consortium is funded by the Ontario Government and the Federal Economic Development Agency for Southern Ontario [http://www.research.utoronto.ca/about/our-research-partners/soscip/].&lt;br /&gt;
&lt;br /&gt;
== Support Email ==&lt;br /&gt;
&lt;br /&gt;
Please use [mailto:soscip-support@scinet.utoronto.ca &amp;lt;soscip-support@scinet.utoronto.ca&amp;gt;] for SOSCIP GPU specific inquiries.&lt;br /&gt;
&lt;br /&gt;
== Specifications==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster consists of  of 14 IBM Power 822LC &amp;quot;Minsky&amp;quot; Servers each with 2x10core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 20 physical cores to support up to 160 threads.  Each node has 4x NVIDIA Tesla P100 GPUs each with 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.&lt;br /&gt;
&lt;br /&gt;
== Compile/Devel/Test ==&lt;br /&gt;
&lt;br /&gt;
Access is provided through the BGQ login node, '''&amp;lt;tt&amp;gt; bgqdev.scinet.utoronto.ca &amp;lt;/tt&amp;gt;''' using ssh, and from there you can proceed to the GPU development node '''&amp;lt;tt&amp;gt;sgc01-ib0&amp;lt;/tt&amp;gt;'''.&lt;br /&gt;
&lt;br /&gt;
== Filesystem ==&lt;br /&gt;
&lt;br /&gt;
The filesystem is shared with the BGQ system.  See [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ#Filesystem here ] for details.&lt;br /&gt;
&lt;br /&gt;
== Job Submission ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU cluster uses [https://slurm.schedmd.com/ SLURM ] as a job scheduler and jobs are scheduled by node, ie 20 cores and 4 GPUs each. Jobs are submitted from the development node '''&amp;lt;tt&amp;gt;sgc01&amp;lt;/tt&amp;gt;'''. The maximum walltime per job is 12 hours (except in the 'long' queue, see below) with up to 8 nodes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ sbatch myjob.script&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Where myjob.script is &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
hostname&lt;br /&gt;
nvidia-smi&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can queury job information using&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
squeue&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To cancel a job use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
scancel $JOBID&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Longer jobs ===&lt;br /&gt;
&lt;br /&gt;
If your job takes more than 12 hours, the sbatch command will not let you submit your job.  There is, however, a way to have jobs up to 24 hours long, by specifying &amp;quot;-p long&amp;quot; as an option (i.e., add &amp;lt;tt&amp;gt;#SBATCH -p long&amp;lt;/tt&amp;gt; to your job script).  The priority of such jobs may be throttled in the future if we see that the 'long' queue is having a negative efffect on turnover time in the queue.&lt;br /&gt;
&lt;br /&gt;
=== Interactive ===&lt;br /&gt;
&lt;br /&gt;
For an interactive session use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
salloc --gres=gpu:4&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Automatic Re-submission and Job Dependencies ===&lt;br /&gt;
&lt;br /&gt;
Commonly you may have a job that you know will take longer to run than what is permissible in the queue. As long as your program contains checkpoint or restart capability, you can have one job automatically submit the next. In the following example it is assumed that the program finishes before the time limit requested and then resubmits itself by logging into the development nodes.   Job dependencies and a maximum number of job re-submissions are used to ensure sequential operation.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash &lt;br /&gt;
&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
: ${job_number:=&amp;quot;1&amp;quot;}           # set job_nubmer to 1 if it is undefined&lt;br /&gt;
job_number_max=3&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;hi from ${SLURM_JOB_ID}&amp;quot;&lt;br /&gt;
&lt;br /&gt;
#RUN JOB HERE&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# SUBMIT NEXT JOB&lt;br /&gt;
if [[ ${job_number} -lt ${job_number_max} ]]&lt;br /&gt;
then&lt;br /&gt;
  (( job_number++ ))&lt;br /&gt;
  next_jobid=$(ssh sgc01-ib0 &amp;quot;cd $SLURM_SUBMIT_DIR; /opt/slurm/bin/sbatch --export=job_number=${job_number} -d afterok:${SLURM_JOB_ID} thisscript.sh | awk '{print $4}'&amp;quot;)&lt;br /&gt;
  echo &amp;quot;submitted ${next_jobid}&amp;quot;&lt;br /&gt;
fi&lt;br /&gt;
 &lt;br /&gt;
sleep 15&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;${SLURM_JOB_ID} done&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Software Installed ==&lt;br /&gt;
&lt;br /&gt;
=== IBM PowerAI ===&lt;br /&gt;
&lt;br /&gt;
The PowerAI platform contains popular open machine learning frameworks such as '''Caffe, TensorFlow, and Torch'''. Run the &amp;lt;tt&amp;gt;module avail&amp;lt;/tt&amp;gt; command for a complete listing. More information is available at this link: https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/. Release 4.0 is currently installed.&lt;br /&gt;
&lt;br /&gt;
==== GNU Compilers ====&lt;br /&gt;
&lt;br /&gt;
More recent versions of the GNU Compiler Collection (C/C++/Fortran) are provided in the IBM Advanced Toolchain with enhancements for the POWER8 CPU. To load the newer advance toolchain version use:&lt;br /&gt;
&lt;br /&gt;
Advanced Toolchain V10.0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/6.3.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Advanced Toolchain V11.0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/7.2.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
More information about the IBM Advanced Toolchain can be found here: [https://developer.ibm.com/linuxonpower/advance-toolchain/ https://developer.ibm.com/linuxonpower/advance-toolchain/]&lt;br /&gt;
&lt;br /&gt;
==== IBM XL Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the native IBM xlc/xlc++ and xlf (Fortran) compilers, run&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load xlc/13.1.5&lt;br /&gt;
module load xlf/15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM XL Compilers are enabled for use with NVIDIA GPUs, including support for OpenMP 4.5 GPU offloading and integration with NVIDIA's nvcc command to compile host-side code for the POWER8 CPU.&lt;br /&gt;
&lt;br /&gt;
Information about the IBM XL Compilers can be found at the following links:&lt;br /&gt;
&lt;br /&gt;
[https://www.ibm.com/support/knowledgecenter/SSXVZZ_13.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL C/C++]&lt;br /&gt;
&lt;br /&gt;
[https://www.ibm.com/support/knowledgecenter/SSAT4T_15.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL Fortran]&lt;br /&gt;
&lt;br /&gt;
==== NVIDIA GPU Driver Version ====&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version is 384.66&lt;br /&gt;
&lt;br /&gt;
==== CUDA ====&lt;br /&gt;
&lt;br /&gt;
The current installed CUDA Tookits is are version 8.0 and version 9.0&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
or &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/9.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The CUDA driver is installed locally, however the CUDA Toolkit is installed in:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/usr/local/cuda-8.0&lt;br /&gt;
/usr/local/cuda-9.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Note that the &amp;lt;tt&amp;gt;/usr/local/cuda&amp;lt;/tt&amp;gt; directory is linked to the &amp;lt;tt&amp;gt;/usr/local/cuda-9.0&amp;lt;/tt&amp;gt; directory.&lt;br /&gt;
&lt;br /&gt;
Documentation and API reference information for the CUDA Toolkit can be found here: [http://docs.nvidia.com/cuda/index.html http://docs.nvidia.com/cuda/index.html]&lt;br /&gt;
&lt;br /&gt;
==== OpenMPI ====&lt;br /&gt;
&lt;br /&gt;
Currently OpenMPI has been setup on the 14 nodes connected over EDR Infiniband.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load openmpi/2.1.1-gcc-5.4.0&lt;br /&gt;
$ module load openmpi/2.1.1-XL-13_15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Other Software ==&lt;br /&gt;
&lt;br /&gt;
Other software packages can be installed onto the SOSCIP GPU Platform. It is best to try installing new software in your own home directory, which will give you control of the software (e.g. exact version, configuration, installing sub-packages, etc.).&lt;br /&gt;
&lt;br /&gt;
In the following subsections are instructions for installing several common software packages.&lt;br /&gt;
&lt;br /&gt;
=== Anaconda (Python) ===&lt;br /&gt;
&lt;br /&gt;
Anaconda is a popular distribution of the Python programming language. It contains several common Python libraries such as SciPy and NumPy as pre-built packages, which eases installation.&lt;br /&gt;
&lt;br /&gt;
Anaconda can be downloaded from here: [https://www.anaconda.com/download/#linux https://www.anaconda.com/download/#linux]&lt;br /&gt;
&lt;br /&gt;
NOTE: Be sure to download the '''Power8''' installer.&lt;br /&gt;
&lt;br /&gt;
TIP: If you plan to use Tensorflow within Anaconda, download the Python 2.7 version of Anaconda&lt;br /&gt;
&lt;br /&gt;
=== Keras ===&lt;br /&gt;
&lt;br /&gt;
Keras ([https://keras.io/ https://keras.io/]) is a popular high-level deep learning software development framework. It runs on top of other deep-learning frameworks such as TensorFlow.&lt;br /&gt;
&lt;br /&gt;
The easiest way to install Keras is to install Anaconda first, then install Keras by using using the pip command.&lt;br /&gt;
&lt;br /&gt;
Keras uses TensorFlow underneath to run neural network models. Before running code using Keras, be sure to load the PowerAI TensorFlow module and the cuda module.&lt;br /&gt;
&lt;br /&gt;
=== PyTorch ===&lt;br /&gt;
&lt;br /&gt;
PyTorch is the Python implementation of the Torch framework for deep learning. &lt;br /&gt;
&lt;br /&gt;
It is suggested that you use PyTorch within Anaconda.&lt;br /&gt;
&lt;br /&gt;
There is currently no build of PyTorch for POWER8-based systems. You will need to compile it from source.&lt;br /&gt;
&lt;br /&gt;
Obtain the source code from here: [http://pytorch.org/ http://pytorch.org/]&lt;br /&gt;
&lt;br /&gt;
Before building PyTorch, make sure to load cuda by running &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
NOTE: Do not have the gcc modules loaded when building PyTorch. Use the default version of gcc (currently v5.4.0) included with the operating system. Build will fail with later versions of gcc.&lt;br /&gt;
&lt;br /&gt;
== LINKS ==&lt;br /&gt;
&lt;br /&gt;
[https://www.olcf.ornl.gov/kb_articles/summitdev-quickstart/#System_Overview  Summit Dev System at ORNL]&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9113</id>
		<title>SOSCIP GPU</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9113"/>
		<updated>2018-01-04T13:34:02Z</updated>

		<summary type="html">&lt;p&gt;Northrup: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:S882lc.png|center|300px|thumb]]&lt;br /&gt;
|name=SOSCIP GPU &lt;br /&gt;
|installed=September 2017&lt;br /&gt;
|operatingsystem= Ubuntu 16.04 le &lt;br /&gt;
|loginnode= sgc01 &lt;br /&gt;
|nnodes= 14x Power 8 with  4x NVIDIA P100&lt;br /&gt;
|rampernode=512 GB&lt;br /&gt;
|corespernode= 2 x 10core (20 physical, 160 SMT)&lt;br /&gt;
|interconnect=Infiniband EDR &lt;br /&gt;
|vendorcompilers=xlc/xlf, nvcc&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== SOSCIP ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster is a Southern Ontario Smart Computing Innovation Platform ([http://soscip.org/ SOSCIP]) resource located at theUniversity of Toronto's SciNet HPC facility. The SOSCIP  multi-university/industry consortium is funded by the Ontario Government and the Federal Economic Development Agency for Southern Ontario [http://www.research.utoronto.ca/about/our-research-partners/soscip/].&lt;br /&gt;
&lt;br /&gt;
== Support Email ==&lt;br /&gt;
&lt;br /&gt;
Please use [mailto:soscip-support@scinet.utoronto.ca &amp;lt;soscip-support@scinet.utoronto.ca&amp;gt;] for SOSCIP GPU specific inquiries.&lt;br /&gt;
&lt;br /&gt;
== Specifications==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster consists of  of 14 IBM Power 822LC &amp;quot;Minsky&amp;quot; Servers each with 2x10core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 20 physical cores to support up to 160 threads.  Each node has 4x NVIDIA Tesla P100 GPUs each with 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.&lt;br /&gt;
&lt;br /&gt;
== Compile/Devel/Test ==&lt;br /&gt;
&lt;br /&gt;
Access is provided through the BGQ login node, '''&amp;lt;tt&amp;gt; bgqdev.scinet.utoronto.ca &amp;lt;/tt&amp;gt;''' using ssh, and from there you can proceed to the GPU development node '''&amp;lt;tt&amp;gt;sgc01-ib0&amp;lt;/tt&amp;gt;'''.&lt;br /&gt;
&lt;br /&gt;
== Filesystem ==&lt;br /&gt;
&lt;br /&gt;
The filesystem is shared with the BGQ system.  See [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ#Filesystem here ] for details.&lt;br /&gt;
&lt;br /&gt;
== Job Submission ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU cluster uses [https://slurm.schedmd.com/ SLURM ] as a job scheduler and jobs are scheduled by node, ie 20 cores and 4 GPUs each. Jobs are submitted from the development node '''&amp;lt;tt&amp;gt;sgc01&amp;lt;/tt&amp;gt;'''. The maximum walltime per job is 12 hours (except in the 'long' queue, see below) with up to 8 nodes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ sbatch myjob.script&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Where myjob.script is &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
hostname&lt;br /&gt;
nvidia-smi&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can queury job information using&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
squeue&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To cancel a job use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
scancel $JOBID&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Longer jobs ===&lt;br /&gt;
&lt;br /&gt;
If your job takes more than 12 hours, the sbatch command will not let you submit your job.  There is, however, a way to have jobs up to 24 hours long, by specifying &amp;quot;-p long&amp;quot; as an option (i.e., add &amp;lt;tt&amp;gt;#SBATCH -p long&amp;lt;/tt&amp;gt; to your job script).  The priority of such jobs may be throttled in the future if we see that the 'long' queue is having a negative efffect on turnover time in the queue.&lt;br /&gt;
&lt;br /&gt;
=== Interactive ===&lt;br /&gt;
&lt;br /&gt;
For an interactive session use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
salloc --gres=gpu:4&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Automatic Re-submission and Job Dependencies ===&lt;br /&gt;
&lt;br /&gt;
Commonly you may have a job that you know will take longer to run than what is permissible in the queue. As long as your program contains checkpoint or restart capability, you can have one job automatically submit the next. In the following example it is assumed that the program finishes before the time limit requested and then resubmits itself by logging into the development nodes.   Job dependencies and a maximum number of job re-submissions are used to ensure sequential operation.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash &lt;br /&gt;
&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
: ${job_number:=&amp;quot;1&amp;quot;}           # set job_nubmer to 1 if it is undefined&lt;br /&gt;
job_number_max=3&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;hi from ${SLURM_JOB_ID}&amp;quot;&lt;br /&gt;
&lt;br /&gt;
#RUN JOB HERE&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# SUBMIT NEXT JOB&lt;br /&gt;
if [[ ${job_number} -lt ${job_number_max} ]]&lt;br /&gt;
then&lt;br /&gt;
  (( job_number++ ))&lt;br /&gt;
  next_jobid=$(ssh sgc01-ib0 &amp;quot;cd $SLURM_SUBMIT_DIR; /opt/slurm/bin/sbatch --export=job_number=${job_number} -d afterok:${SLURM_JOB_ID} thisscript.sh | awk '{print $4}'&amp;quot;)&lt;br /&gt;
  echo &amp;quot;submitted ${next_jobid}&amp;quot;&lt;br /&gt;
fi&lt;br /&gt;
 &lt;br /&gt;
sleep 15&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;${SLURM_JOB_ID} done&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Software Installed ==&lt;br /&gt;
&lt;br /&gt;
=== IBM PowerAI ===&lt;br /&gt;
&lt;br /&gt;
The PowerAI platform contains popular open machine learning frameworks such as '''Caffe, TensorFlow, and Torch'''. Run the &amp;lt;tt&amp;gt;module avail&amp;lt;/tt&amp;gt; command for a complete listing. More information is available at this link: https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/. Release 4.0 is currently installed.&lt;br /&gt;
&lt;br /&gt;
==== GNU Compilers ====&lt;br /&gt;
&lt;br /&gt;
More recent versions of the GNU Compiler Collection (C/C++/Fortran) are provided in the IBM Advanced Toolchain with enhancements for the POWER8 CPU. To load the newer advance toolchain version use:&lt;br /&gt;
&lt;br /&gt;
Advanced Toolchain V10.0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/6.3.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Advanced Toolchain V11.0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/7.2.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
More information about the IBM Advanced Toolchain can be found here: [https://developer.ibm.com/linuxonpower/advance-toolchain/ https://developer.ibm.com/linuxonpower/advance-toolchain/]&lt;br /&gt;
&lt;br /&gt;
==== IBM XL Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the native IBM xlc/xlc++ and xlf (Fortran) compilers, run&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load xlc/13.1.5&lt;br /&gt;
module load xlf/15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM XL Compilers are enabled for use with NVIDIA GPUs, including support for OpenMP 4.5 GPU offloading and integration with NVIDIA's nvcc command to compile host-side code for the POWER8 CPU.&lt;br /&gt;
&lt;br /&gt;
Information about the IBM XL Compilers can be found at the following links:&lt;br /&gt;
&lt;br /&gt;
[https://www.ibm.com/support/knowledgecenter/SSXVZZ_13.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL C/C++]&lt;br /&gt;
&lt;br /&gt;
[https://www.ibm.com/support/knowledgecenter/SSAT4T_15.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL Fortran]&lt;br /&gt;
&lt;br /&gt;
==== NVIDIA GPU Driver Version ====&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version is 384.66&lt;br /&gt;
&lt;br /&gt;
==== CUDA ====&lt;br /&gt;
&lt;br /&gt;
The current installed CUDA Tookits is are version 8.0 and version 9.0&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
or &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/9.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The CUDA driver is installed locally, however the CUDA Toolkit is installed in:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/usr/local/cuda-8.0&lt;br /&gt;
/usr/local/cuda-9.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Note that the &amp;lt;tt&amp;gt;/usr/local/cuda&amp;lt;/tt&amp;gt; directory is linked to the &amp;lt;tt&amp;gt;/usr/local/cuda-9.0&amp;lt;/tt&amp;gt; directory.&lt;br /&gt;
&lt;br /&gt;
Documentation and API reference information for the CUDA Toolkit can be found here: [http://docs.nvidia.com/cuda/index.html http://docs.nvidia.com/cuda/index.html]&lt;br /&gt;
&lt;br /&gt;
==== OpenMPI ====&lt;br /&gt;
&lt;br /&gt;
Currently OpenMPI has been setup on the 14 nodes connected over EDR Infiniband.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load openmpi/2.1.1-gcc-5.4.0&lt;br /&gt;
$ module load openmpi/2.1.1-XL-13_15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Other Software ==&lt;br /&gt;
&lt;br /&gt;
Other software packages can be installed onto the SOSCIP GPU Platform. It is best to try installing new software in your own home directory, which will give you control of the software (e.g. exact version, configuration, installing sub-packages, etc.).&lt;br /&gt;
&lt;br /&gt;
In the following subsections are instructions for installing several common software packages.&lt;br /&gt;
&lt;br /&gt;
=== Anaconda (Python) ===&lt;br /&gt;
&lt;br /&gt;
Anaconda is a popular distribution of the Python programming language. It contains several common Python libraries such as SciPy and NumPy as pre-built packages, which eases installation.&lt;br /&gt;
&lt;br /&gt;
Anaconda can be downloaded from here: [https://www.anaconda.com/download/#linux https://www.anaconda.com/download/#linux]&lt;br /&gt;
&lt;br /&gt;
NOTE: Be sure to download the '''Power8''' installer.&lt;br /&gt;
&lt;br /&gt;
TIP: If you plan to use Tensorflow within Anaconda, download the Python 2.7 version of Anaconda&lt;br /&gt;
&lt;br /&gt;
=== Keras ===&lt;br /&gt;
&lt;br /&gt;
Keras ([https://keras.io/ https://keras.io/]) is a popular high-level deep learning software development framework. It runs on top of other deep-learning frameworks such as TensorFlow.&lt;br /&gt;
&lt;br /&gt;
The easiest way to install Keras is to install Anaconda first, then install Keras by using using the pip command.&lt;br /&gt;
&lt;br /&gt;
Keras uses TensorFlow underneath to run neural network models. Before running code using Keras, be sure to load the PowerAI TensorFlow module and the cuda module.&lt;br /&gt;
&lt;br /&gt;
=== PyTorch ===&lt;br /&gt;
&lt;br /&gt;
PyTorch is the Python implementation of the Torch framework for deep learning. &lt;br /&gt;
&lt;br /&gt;
It is suggested that you use PyTorch within Anaconda.&lt;br /&gt;
&lt;br /&gt;
There is currently no build of PyTorch for POWER8-based systems. You will need to compile it from source.&lt;br /&gt;
&lt;br /&gt;
Obtain the source code from here: [http://pytorch.org/ http://pytorch.org/]&lt;br /&gt;
&lt;br /&gt;
Before building PyTorch, make sure to load cuda by running &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
NOTE: Do not have the gcc modules loaded when building PyTorch. Use the default version of gcc (currently v5.4.0) included with the operating system. Build will fail with later versions of gcc.&lt;br /&gt;
&lt;br /&gt;
== LINKS ==&lt;br /&gt;
&lt;br /&gt;
[https://www.olcf.ornl.gov/kb_articles/summitdev-quickstart/#System_Overview | Summit Dev System at ORNL]&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9083</id>
		<title>SciNet User Support Library</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SciNet_User_Support_Library&amp;diff=9083"/>
		<updated>2017-12-04T15:21:06Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System News */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{|  style=&amp;quot;border-spacing: 8px;&amp;quot;&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
{{SciNetWiki:System_Alerts}}&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;cellpadding:1em; padding:1em; border:3px solid #0645ad; background-color:#f6f6f6; border-radius:7px&amp;quot;|&lt;br /&gt;
==System News==&lt;br /&gt;
&lt;br /&gt;
* Dec 4, 2017 : scratchtcs decommissioned. &lt;br /&gt;
&lt;br /&gt;
* Dec 1, 2017, 00:00 am to 06:00 am:  The connection to the data center will be down for a scheduled network maintenance.  Jobs will continue to run, but login sessions will be terminated at midnight.&lt;br /&gt;
&lt;br /&gt;
* Nov 28, 2017, 12:00 noon: The [[GPC_Quickstart | GPC]] will be reduced from 30,912 to 16,800 cores to make room for the installation of Niagara.&lt;br /&gt;
&lt;br /&gt;
* Sept 29, 2017: The [[TCS_Quickstart | TCS]] was decommissioned on Sept. 29, 2017&lt;br /&gt;
&lt;br /&gt;
* Mar 3:  GPC:  Version 7.0 of Allinea Forge (DDT Debugger, MAP, Performance Reports) installed as a module.&lt;br /&gt;
&lt;br /&gt;
* Jan 26:  New larger (1.8PB) $SCRATCH storage brought online.   &lt;br /&gt;
&lt;br /&gt;
* Oct 24: P8: 2 new Power 8 Development Nodes, [[ P8 ]], with 4x Nvidia P100 (Pascal) GPUs, available for users.&lt;br /&gt;
&lt;br /&gt;
* Sept 19: KNL: Intel Knights Landing Development Nodes, [[Knights_Landing | KNL ]], available for users.&lt;br /&gt;
&lt;br /&gt;
* Sept 13: GPC:  Version 6.1 of Allinea Forge (DDT Debugger, MAP, Performance Reports) installed as a module.&lt;br /&gt;
&lt;br /&gt;
* Sept 13: GPC: Version 17.0.0 of the Intel Compiler and Tools are installed as modules.&lt;br /&gt;
&lt;br /&gt;
* Aug 20:  P8: Power 8 Development Nodes, [[ P8 ]], with 2x Nvidia K80, GPUs available for users.&lt;br /&gt;
&lt;br /&gt;
([[Previous System News]])&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; width=&amp;quot;50%&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f6e8e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== QuickStart Guides ==&lt;br /&gt;
* [[SciNet Command Line Utilities]]&lt;br /&gt;
* [[Media:SciNet_Tutorial.pdf|SciNet User Tutorial]]&lt;br /&gt;
* [[GPC_Quickstart|GPC: General Purpose Cluster]]&lt;br /&gt;
* [[Sandy| Sandy: Intel Sandybridge nodes ]]&lt;br /&gt;
* [[Gravity| Gravity: GPU nodes]]&lt;br /&gt;
* [[P7_Linux_Cluster|P7: Power 7 Linux cluster]]&lt;br /&gt;
* [[BGQ|BGQ: BlueGene/Q clusters]]&lt;br /&gt;
* [[Software_and_Libraries|Software and libraries]]&lt;br /&gt;
* [[Data_Management|Data management]]&lt;br /&gt;
* [[FAQ | FAQ (frequently asked questions)]]&lt;br /&gt;
* [[Acknowledging_SciNet | Acknowledging SciNet]]&lt;br /&gt;
* [https://courses.scinet.utoronto.ca SciNet education]&lt;br /&gt;
* [https://www.youtube.com/channel/UC42CaO-AAQhwqa8RGzE3daQ SciNet YouTube Channel]&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8f6e8; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== Tutorials and Manuals ==&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#SciNet_Basics|SciNet basics]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Linux|Linux]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Batch_job_management|Batch job management]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Programming|Programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Parallel_Programming|Parallel programming]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#GPU_Computing|GPU computing]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Performance_Tuning|Performance tuning]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Debugging|Debugging]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Math_libraries_.28BLAS.2C_LAPACK.2C_FFT.29|Math libraries (BLAS, LAPACK, FFT)]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#I/O|I/O and databases]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Visualization|Visualization]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Applications|Applications]]&lt;br /&gt;
* [[Knowledge_Base:_Tutorials_and_Manuals#Manuals|Manuals (compilers etc)]]&lt;br /&gt;
* [[2016_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
* [[2015_Ontario_Summer_School_for_High_Performance_Computing_Central]]&lt;br /&gt;
|-&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#e8e8f6; border-radius:7px&amp;quot; |&lt;br /&gt;
&lt;br /&gt;
== What's New On The Wiki ==&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: Updated [[GPC Quickstart]] with info on email notifications from the scheduler.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Hdf5]] compilation page updated.&lt;br /&gt;
&lt;br /&gt;
* Dec 2014: [[Research Computing with Python 2014]] lectures 5-8&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: &amp;quot;Modern CUDA Features&amp;quot; TechTalk slides in [[SciNet TechTalks and Seminars]].&lt;br /&gt;
&lt;br /&gt;
* Nov 2014: [[Research Computing with Python 2014]] lectures 1-4&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[User_Tips#Reducing_virtual_memory_consumption_for_multithreaded_programs| Tip on reducing virtual memory consumption for multithreaded programs]]&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Improved information on the [[Python]] versions installed on the GPC, and which modules are included in each version.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Description on using job arrays on the GPC on the [[Scheduler]] page.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: [[Intro to Tkinter|Tkinter instructions]], [[Media:Tkinter.pdf|slides]] and [[Media:Tkinter_code.tgz|code]] for the TkInter workshop held in September.&lt;br /&gt;
&lt;br /&gt;
* Sept 2014: Instructions on using [[Hadoop for HPCers|Hadoop]] (for the Hadoop workshop held in September).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
Previous new stuff can be found in the [[What's new archive]].&lt;br /&gt;
| valign=&amp;quot;top&amp;quot; style=&amp;quot;padding:1em; border:1px solid #aaaaaa; background-color:#f4f4fa; border-radius:7px&amp;quot; |&lt;br /&gt;
&amp;lt;div style='text-align:left;'&amp;gt;&lt;br /&gt;
{{SciNetWiki:Community_Portal}}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
{{#Widget:Twitter|shell.background=#9fb1c2|tweets.links=#4775c1|tweets.color=#000000|tweets.background=#ffffff|user=SciNetHPC}}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [[Old Main Page]] --!&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9071</id>
		<title>SOSCIP GPU</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9071"/>
		<updated>2017-11-23T16:08:19Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* GNU Compilers */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:S882lc.png|center|300px|thumb]]&lt;br /&gt;
|name=SOSCIP GPU &lt;br /&gt;
|installed=September 2017&lt;br /&gt;
|operatingsystem= Ubuntu 16.04 le &lt;br /&gt;
|loginnode= sgc01 &lt;br /&gt;
|nnodes= 14x Power 8 with  4x NVIDIA P100&lt;br /&gt;
|rampernode=512 GB&lt;br /&gt;
|corespernode= 2 x 10core (20 physical, 160 SMT)&lt;br /&gt;
|interconnect=Infiniband EDR &lt;br /&gt;
|vendorcompilers=xlc/xlf, nvcc&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== SOSCIP ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster is a Southern Ontario Smart Computing Innovation Platform ([http://soscip.org/ SOSCIP]) resource located at theUniversity of Toronto's SciNet HPC facility. The SOSCIP  multi-university/industry consortium is funded by the Ontario Government and the Federal Economic Development Agency for Southern Ontario [http://www.research.utoronto.ca/about/our-research-partners/soscip/].&lt;br /&gt;
&lt;br /&gt;
== Support Email ==&lt;br /&gt;
&lt;br /&gt;
Please use [mailto:soscip-support@scinet.utoronto.ca &amp;lt;soscip-support@scinet.utoronto.ca&amp;gt;] for SOSCIP GPU specific inquiries.&lt;br /&gt;
&lt;br /&gt;
== Specifications==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster consists of  of 14 IBM Power 822LC &amp;quot;Minsky&amp;quot; Servers each with 2x10core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 20 physical cores to support up to 160 threads.  Each node has 4x NVIDIA Tesla P100 GPUs each with 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.&lt;br /&gt;
&lt;br /&gt;
== Compile/Devel/Test ==&lt;br /&gt;
&lt;br /&gt;
Access is provided through the BGQ login node, '''&amp;lt;tt&amp;gt; bgqdev.scinet.utoronto.ca &amp;lt;/tt&amp;gt;''' using ssh, and from there you can proceed to the GPU development node '''&amp;lt;tt&amp;gt;sgc01-ib0&amp;lt;/tt&amp;gt;'''.&lt;br /&gt;
&lt;br /&gt;
== Filesystem ==&lt;br /&gt;
&lt;br /&gt;
The filesystem is shared with the BGQ system.  See [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ#Filesystem here ] for details.&lt;br /&gt;
&lt;br /&gt;
== Job Submission ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU cluster uses [https://slurm.schedmd.com/ SLURM ] as a job scheduler and jobs are scheduled by node, ie 20 cores and 4 GPUs each. Jobs are submitted from the development node '''&amp;lt;tt&amp;gt;sgc01&amp;lt;/tt&amp;gt;'''. The maximum walltime per job is 12 hours (except in the 'long' queue, see below) with up to 8 nodes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ sbatch myjob.script&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Where myjob.script is &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
hostname&lt;br /&gt;
nvidia-smi&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can queury job information using&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
squeue&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To cancel a job use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
scancel $JOBID&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Longer jobs ===&lt;br /&gt;
&lt;br /&gt;
If your job takes more than 12 hours, the sbatch command will not let you submit your job.  There is, however, a way to have jobs up to 24 hours long, by specifying &amp;quot;-p long&amp;quot; as an option (i.e., add &amp;lt;tt&amp;gt;#SBATCH -p long&amp;lt;/tt&amp;gt; to your job script).  The priority of such jobs may be throttled in the future if we see that the 'long' queue is having a negative efffect on turnover time in the queue.&lt;br /&gt;
&lt;br /&gt;
=== Interactive ===&lt;br /&gt;
&lt;br /&gt;
For an interactive session use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
salloc --gres=gpu:4&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Automatic Re-submission and Job Dependencies ===&lt;br /&gt;
&lt;br /&gt;
Commonly you may have a job that you know will take longer to run than what is permissible in the queue. As long as your program contains checkpoint or restart capability, you can have one job automatically submit the next. In the following example it is assumed that the program finishes before the time limit requested and then resubmits itself by logging into the development nodes.   Job dependencies and a maximum number of job re-submissions are used to ensure sequential operation.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash &lt;br /&gt;
&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
: ${job_number:=&amp;quot;1&amp;quot;}           # set job_nubmer to 1 if it is undefined&lt;br /&gt;
job_number_max=3&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;hi from ${SLURM_JOB_ID}&amp;quot;&lt;br /&gt;
&lt;br /&gt;
#RUN JOB HERE&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# SUBMIT NEXT JOB&lt;br /&gt;
if [[ ${job_number} -lt ${job_number_max} ]]&lt;br /&gt;
then&lt;br /&gt;
  (( job_number++ ))&lt;br /&gt;
  next_jobid=$(ssh sgc01-ib0 &amp;quot;cd $SLURM_SUBMIT_DIR; /opt/slurm/bin/sbatch --export=job_number=${job_number} -d afterok:${SLURM_JOB_ID} thisscript.sh | awk '{print $4}'&amp;quot;)&lt;br /&gt;
  echo &amp;quot;submitted ${next_jobid}&amp;quot;&lt;br /&gt;
fi&lt;br /&gt;
 &lt;br /&gt;
sleep 15&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;${SLURM_JOB_ID} done&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Software ==&lt;br /&gt;
&lt;br /&gt;
==== GNU Compilers ====&lt;br /&gt;
&lt;br /&gt;
More recent versions of the GNU Compiler Collection (C/C++/Fortran) are provided in the IBM Advanced Toolchain with enhancements for the POWER8 CPU. To load the newer advance toolchain version use:&lt;br /&gt;
&lt;br /&gt;
Advanced Toolchain V10.0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/6.3.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Advanced Toolchain V11.0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/7.2.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
More information about the IBM Advanced Toolchain can be found here: [https://developer.ibm.com/linuxonpower/advance-toolchain/ https://developer.ibm.com/linuxonpower/advance-toolchain/]&lt;br /&gt;
&lt;br /&gt;
==== IBM XL Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the native IBM xlc/xlc++ and xlf (Fortran) compilers, run&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load xlc/13.1.5&lt;br /&gt;
module load xlf/15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Information about the IBM XL Compilers can be found at the following links:&lt;br /&gt;
&lt;br /&gt;
[https://www.ibm.com/support/knowledgecenter/SSXVZZ_13.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL C/C++]&lt;br /&gt;
&lt;br /&gt;
[https://www.ibm.com/support/knowledgecenter/SSAT4T_15.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL Fortran]&lt;br /&gt;
&lt;br /&gt;
==== NVIDIA GPU Driver Version ====&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version is 384.66&lt;br /&gt;
&lt;br /&gt;
==== CUDA ====&lt;br /&gt;
&lt;br /&gt;
The current installed CUDA Tookit is 8.0&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The CUDA driver is installed locally, however the CUDA Toolkit is installed in:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/usr/local/cuda-8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== OpenMPI ====&lt;br /&gt;
&lt;br /&gt;
Currently OpenMPI has been setup on the 14 nodes connected over EDR Infiniband.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load openmpi/2.1.1-gcc-5.4.0&lt;br /&gt;
$ module load openmpi/2.1.1-XL-13_15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== IBM PowerAI ===&lt;br /&gt;
&lt;br /&gt;
The PowerAI platform contains popular open machine learning frameworks such as Caffe, Tensorflow, and Torch. Run the &amp;lt;tt&amp;gt;module avail&amp;lt;/tt&amp;gt; command for a complete listing. More information is available at this link: https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/. Release 4.0 is currently installed.&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9070</id>
		<title>SOSCIP GPU</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9070"/>
		<updated>2017-11-23T16:07:47Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* GNU Compilers */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:S882lc.png|center|300px|thumb]]&lt;br /&gt;
|name=SOSCIP GPU &lt;br /&gt;
|installed=September 2017&lt;br /&gt;
|operatingsystem= Ubuntu 16.04 le &lt;br /&gt;
|loginnode= sgc01 &lt;br /&gt;
|nnodes= 14x Power 8 with  4x NVIDIA P100&lt;br /&gt;
|rampernode=512 GB&lt;br /&gt;
|corespernode= 2 x 10core (20 physical, 160 SMT)&lt;br /&gt;
|interconnect=Infiniband EDR &lt;br /&gt;
|vendorcompilers=xlc/xlf, nvcc&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== SOSCIP ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster is a Southern Ontario Smart Computing Innovation Platform ([http://soscip.org/ SOSCIP]) resource located at theUniversity of Toronto's SciNet HPC facility. The SOSCIP  multi-university/industry consortium is funded by the Ontario Government and the Federal Economic Development Agency for Southern Ontario [http://www.research.utoronto.ca/about/our-research-partners/soscip/].&lt;br /&gt;
&lt;br /&gt;
== Support Email ==&lt;br /&gt;
&lt;br /&gt;
Please use [mailto:soscip-support@scinet.utoronto.ca &amp;lt;soscip-support@scinet.utoronto.ca&amp;gt;] for SOSCIP GPU specific inquiries.&lt;br /&gt;
&lt;br /&gt;
== Specifications==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster consists of  of 14 IBM Power 822LC &amp;quot;Minsky&amp;quot; Servers each with 2x10core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 20 physical cores to support up to 160 threads.  Each node has 4x NVIDIA Tesla P100 GPUs each with 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.&lt;br /&gt;
&lt;br /&gt;
== Compile/Devel/Test ==&lt;br /&gt;
&lt;br /&gt;
Access is provided through the BGQ login node, '''&amp;lt;tt&amp;gt; bgqdev.scinet.utoronto.ca &amp;lt;/tt&amp;gt;''' using ssh, and from there you can proceed to the GPU development node '''&amp;lt;tt&amp;gt;sgc01-ib0&amp;lt;/tt&amp;gt;'''.&lt;br /&gt;
&lt;br /&gt;
== Filesystem ==&lt;br /&gt;
&lt;br /&gt;
The filesystem is shared with the BGQ system.  See [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ#Filesystem here ] for details.&lt;br /&gt;
&lt;br /&gt;
== Job Submission ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU cluster uses [https://slurm.schedmd.com/ SLURM ] as a job scheduler and jobs are scheduled by node, ie 20 cores and 4 GPUs each. Jobs are submitted from the development node '''&amp;lt;tt&amp;gt;sgc01&amp;lt;/tt&amp;gt;'''. The maximum walltime per job is 12 hours (except in the 'long' queue, see below) with up to 8 nodes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ sbatch myjob.script&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Where myjob.script is &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
hostname&lt;br /&gt;
nvidia-smi&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can queury job information using&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
squeue&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To cancel a job use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
scancel $JOBID&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Longer jobs ===&lt;br /&gt;
&lt;br /&gt;
If your job takes more than 12 hours, the sbatch command will not let you submit your job.  There is, however, a way to have jobs up to 24 hours long, by specifying &amp;quot;-p long&amp;quot; as an option (i.e., add &amp;lt;tt&amp;gt;#SBATCH -p long&amp;lt;/tt&amp;gt; to your job script).  The priority of such jobs may be throttled in the future if we see that the 'long' queue is having a negative efffect on turnover time in the queue.&lt;br /&gt;
&lt;br /&gt;
=== Interactive ===&lt;br /&gt;
&lt;br /&gt;
For an interactive session use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
salloc --gres=gpu:4&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Automatic Re-submission and Job Dependencies ===&lt;br /&gt;
&lt;br /&gt;
Commonly you may have a job that you know will take longer to run than what is permissible in the queue. As long as your program contains checkpoint or restart capability, you can have one job automatically submit the next. In the following example it is assumed that the program finishes before the time limit requested and then resubmits itself by logging into the development nodes.   Job dependencies and a maximum number of job re-submissions are used to ensure sequential operation.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash &lt;br /&gt;
&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
: ${job_number:=&amp;quot;1&amp;quot;}           # set job_nubmer to 1 if it is undefined&lt;br /&gt;
job_number_max=3&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;hi from ${SLURM_JOB_ID}&amp;quot;&lt;br /&gt;
&lt;br /&gt;
#RUN JOB HERE&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# SUBMIT NEXT JOB&lt;br /&gt;
if [[ ${job_number} -lt ${job_number_max} ]]&lt;br /&gt;
then&lt;br /&gt;
  (( job_number++ ))&lt;br /&gt;
  next_jobid=$(ssh sgc01-ib0 &amp;quot;cd $SLURM_SUBMIT_DIR; /opt/slurm/bin/sbatch --export=job_number=${job_number} -d afterok:${SLURM_JOB_ID} thisscript.sh | awk '{print $4}'&amp;quot;)&lt;br /&gt;
  echo &amp;quot;submitted ${next_jobid}&amp;quot;&lt;br /&gt;
fi&lt;br /&gt;
 &lt;br /&gt;
sleep 15&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;${SLURM_JOB_ID} done&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Software ==&lt;br /&gt;
&lt;br /&gt;
==== GNU Compilers ====&lt;br /&gt;
&lt;br /&gt;
More recent versions of the GNU Compiler Collection (C/C++/Fortran) are provided in the IBM Advanced Toolchain with enhancements for the POWER8 CPU. To load the newer advance toolchain version use:&lt;br /&gt;
&lt;br /&gt;
AT10.0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/6.3.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
AT11.0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/7.2.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
More information about the IBM Advanced Toolchain can be found here: [https://developer.ibm.com/linuxonpower/advance-toolchain/ https://developer.ibm.com/linuxonpower/advance-toolchain/]&lt;br /&gt;
&lt;br /&gt;
==== IBM XL Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the native IBM xlc/xlc++ and xlf (Fortran) compilers, run&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load xlc/13.1.5&lt;br /&gt;
module load xlf/15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Information about the IBM XL Compilers can be found at the following links:&lt;br /&gt;
&lt;br /&gt;
[https://www.ibm.com/support/knowledgecenter/SSXVZZ_13.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL C/C++]&lt;br /&gt;
&lt;br /&gt;
[https://www.ibm.com/support/knowledgecenter/SSAT4T_15.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL Fortran]&lt;br /&gt;
&lt;br /&gt;
==== NVIDIA GPU Driver Version ====&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version is 384.66&lt;br /&gt;
&lt;br /&gt;
==== CUDA ====&lt;br /&gt;
&lt;br /&gt;
The current installed CUDA Tookit is 8.0&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The CUDA driver is installed locally, however the CUDA Toolkit is installed in:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/usr/local/cuda-8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== OpenMPI ====&lt;br /&gt;
&lt;br /&gt;
Currently OpenMPI has been setup on the 14 nodes connected over EDR Infiniband.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load openmpi/2.1.1-gcc-5.4.0&lt;br /&gt;
$ module load openmpi/2.1.1-XL-13_15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== IBM PowerAI ===&lt;br /&gt;
&lt;br /&gt;
The PowerAI platform contains popular open machine learning frameworks such as Caffe, Tensorflow, and Torch. Run the &amp;lt;tt&amp;gt;module avail&amp;lt;/tt&amp;gt; command for a complete listing. More information is available at this link: https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/. Release 4.0 is currently installed.&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9069</id>
		<title>SOSCIP GPU</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9069"/>
		<updated>2017-11-23T16:07:15Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* GNU Compilers */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:S882lc.png|center|300px|thumb]]&lt;br /&gt;
|name=SOSCIP GPU &lt;br /&gt;
|installed=September 2017&lt;br /&gt;
|operatingsystem= Ubuntu 16.04 le &lt;br /&gt;
|loginnode= sgc01 &lt;br /&gt;
|nnodes= 14x Power 8 with  4x NVIDIA P100&lt;br /&gt;
|rampernode=512 GB&lt;br /&gt;
|corespernode= 2 x 10core (20 physical, 160 SMT)&lt;br /&gt;
|interconnect=Infiniband EDR &lt;br /&gt;
|vendorcompilers=xlc/xlf, nvcc&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== SOSCIP ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster is a Southern Ontario Smart Computing Innovation Platform ([http://soscip.org/ SOSCIP]) resource located at theUniversity of Toronto's SciNet HPC facility. The SOSCIP  multi-university/industry consortium is funded by the Ontario Government and the Federal Economic Development Agency for Southern Ontario [http://www.research.utoronto.ca/about/our-research-partners/soscip/].&lt;br /&gt;
&lt;br /&gt;
== Support Email ==&lt;br /&gt;
&lt;br /&gt;
Please use [mailto:soscip-support@scinet.utoronto.ca &amp;lt;soscip-support@scinet.utoronto.ca&amp;gt;] for SOSCIP GPU specific inquiries.&lt;br /&gt;
&lt;br /&gt;
== Specifications==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster consists of  of 14 IBM Power 822LC &amp;quot;Minsky&amp;quot; Servers each with 2x10core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 20 physical cores to support up to 160 threads.  Each node has 4x NVIDIA Tesla P100 GPUs each with 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.&lt;br /&gt;
&lt;br /&gt;
== Compile/Devel/Test ==&lt;br /&gt;
&lt;br /&gt;
Access is provided through the BGQ login node, '''&amp;lt;tt&amp;gt; bgqdev.scinet.utoronto.ca &amp;lt;/tt&amp;gt;''' using ssh, and from there you can proceed to the GPU development node '''&amp;lt;tt&amp;gt;sgc01-ib0&amp;lt;/tt&amp;gt;'''.&lt;br /&gt;
&lt;br /&gt;
== Filesystem ==&lt;br /&gt;
&lt;br /&gt;
The filesystem is shared with the BGQ system.  See [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ#Filesystem here ] for details.&lt;br /&gt;
&lt;br /&gt;
== Job Submission ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU cluster uses [https://slurm.schedmd.com/ SLURM ] as a job scheduler and jobs are scheduled by node, ie 20 cores and 4 GPUs each. Jobs are submitted from the development node '''&amp;lt;tt&amp;gt;sgc01&amp;lt;/tt&amp;gt;'''. The maximum walltime per job is 12 hours (except in the 'long' queue, see below) with up to 8 nodes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ sbatch myjob.script&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Where myjob.script is &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
hostname&lt;br /&gt;
nvidia-smi&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can queury job information using&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
squeue&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To cancel a job use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
scancel $JOBID&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Longer jobs ===&lt;br /&gt;
&lt;br /&gt;
If your job takes more than 12 hours, the sbatch command will not let you submit your job.  There is, however, a way to have jobs up to 24 hours long, by specifying &amp;quot;-p long&amp;quot; as an option (i.e., add &amp;lt;tt&amp;gt;#SBATCH -p long&amp;lt;/tt&amp;gt; to your job script).  The priority of such jobs may be throttled in the future if we see that the 'long' queue is having a negative efffect on turnover time in the queue.&lt;br /&gt;
&lt;br /&gt;
=== Interactive ===&lt;br /&gt;
&lt;br /&gt;
For an interactive session use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
salloc --gres=gpu:4&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Automatic Re-submission and Job Dependencies ===&lt;br /&gt;
&lt;br /&gt;
Commonly you may have a job that you know will take longer to run than what is permissible in the queue. As long as your program contains checkpoint or restart capability, you can have one job automatically submit the next. In the following example it is assumed that the program finishes before the time limit requested and then resubmits itself by logging into the development nodes.   Job dependencies and a maximum number of job re-submissions are used to ensure sequential operation.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash &lt;br /&gt;
&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
: ${job_number:=&amp;quot;1&amp;quot;}           # set job_nubmer to 1 if it is undefined&lt;br /&gt;
job_number_max=3&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;hi from ${SLURM_JOB_ID}&amp;quot;&lt;br /&gt;
&lt;br /&gt;
#RUN JOB HERE&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# SUBMIT NEXT JOB&lt;br /&gt;
if [[ ${job_number} -lt ${job_number_max} ]]&lt;br /&gt;
then&lt;br /&gt;
  (( job_number++ ))&lt;br /&gt;
  next_jobid=$(ssh sgc01-ib0 &amp;quot;cd $SLURM_SUBMIT_DIR; /opt/slurm/bin/sbatch --export=job_number=${job_number} -d afterok:${SLURM_JOB_ID} thisscript.sh | awk '{print $4}'&amp;quot;)&lt;br /&gt;
  echo &amp;quot;submitted ${next_jobid}&amp;quot;&lt;br /&gt;
fi&lt;br /&gt;
 &lt;br /&gt;
sleep 15&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;${SLURM_JOB_ID} done&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Software ==&lt;br /&gt;
&lt;br /&gt;
==== GNU Compilers ====&lt;br /&gt;
&lt;br /&gt;
More recent versions of the GNU Compiler Collection (C/C++/Fortran) are provided in the IBM Advanced Toolchain with enhancements for the POWER8 CPU. To load the newer advance toolchain version use:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/6.3.1&lt;br /&gt;
module load gcc/7.2.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
More information about the IBM Advanced Toolchain can be found here: [https://developer.ibm.com/linuxonpower/advance-toolchain/ https://developer.ibm.com/linuxonpower/advance-toolchain/]&lt;br /&gt;
&lt;br /&gt;
==== IBM XL Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the native IBM xlc/xlc++ and xlf (Fortran) compilers, run&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load xlc/13.1.5&lt;br /&gt;
module load xlf/15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Information about the IBM XL Compilers can be found at the following links:&lt;br /&gt;
&lt;br /&gt;
[https://www.ibm.com/support/knowledgecenter/SSXVZZ_13.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL C/C++]&lt;br /&gt;
&lt;br /&gt;
[https://www.ibm.com/support/knowledgecenter/SSAT4T_15.1.5/com.ibm.compilers.linux.doc/welcome.html IBM XL Fortran]&lt;br /&gt;
&lt;br /&gt;
==== NVIDIA GPU Driver Version ====&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version is 384.66&lt;br /&gt;
&lt;br /&gt;
==== CUDA ====&lt;br /&gt;
&lt;br /&gt;
The current installed CUDA Tookit is 8.0&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The CUDA driver is installed locally, however the CUDA Toolkit is installed in:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/usr/local/cuda-8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== OpenMPI ====&lt;br /&gt;
&lt;br /&gt;
Currently OpenMPI has been setup on the 14 nodes connected over EDR Infiniband.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load openmpi/2.1.1-gcc-5.4.0&lt;br /&gt;
$ module load openmpi/2.1.1-XL-13_15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== IBM PowerAI ===&lt;br /&gt;
&lt;br /&gt;
The PowerAI platform contains popular open machine learning frameworks such as Caffe, Tensorflow, and Torch. Run the &amp;lt;tt&amp;gt;module avail&amp;lt;/tt&amp;gt; command for a complete listing. More information is available at this link: https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/. Release 4.0 is currently installed.&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9061</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9061"/>
		<updated>2017-11-13T04:33:36Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
&lt;br /&gt;
|[[File:up.png|up|]]Login Nodes&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sun Nov 12 23:33:04 EST 2017 &amp;lt;/b&amp;gt; File system issues cleared.  System should be back to normal.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sat Nov 11, 13:43 EDT 2017&amp;lt;/b&amp;gt; File system issues. You may experience problems logging in. We are investigating.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Sat Nov 11, 12:19 EDT 2017&amp;lt;/b&amp;gt; If you are experiencing issues when logging in, try ssh to login-ha.scinet.utoronto.ca.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 18:30 EDT 2017&amp;lt;/b&amp;gt; Connectivity to the datacentre has been restored.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 17:00 EDT 2017&amp;lt;/b&amp;gt; [http://www.systemstatus.utoronto.ca/incidents/zlj7rd5z2mg4 It appears that the optical fibre connection to the datacentre has been cut, some 3 km away from the University.  Bell has been contacted.]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 14:30 EDT 2017&amp;lt;/b&amp;gt; [http://www.systemstatus.utoronto.ca/incidents/zlj7rd5z2mg4 Network issues to the data centre. We are investigating.]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9053</id>
		<title>SOSCIP GPU</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9053"/>
		<updated>2017-11-10T06:30:28Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* = Automatic Re-submission and Job Dependencies */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:S882lc.png|center|300px|thumb]]&lt;br /&gt;
|name=SOSCIP GPU &lt;br /&gt;
|installed=September 2017&lt;br /&gt;
|operatingsystem= Ubuntu 16.04 le &lt;br /&gt;
|loginnode= sgc01 &lt;br /&gt;
|nnodes= 14x Power 8 with  4x NVIDIA P100&lt;br /&gt;
|rampernode=512 GB&lt;br /&gt;
|corespernode= 2 x 10core (20 physical, 160 SMT)&lt;br /&gt;
|interconnect=Infiniband EDR &lt;br /&gt;
|vendorcompilers=xlc/xlf, nvcc&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== SOSCIP ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster is a Southern Ontario Smart Computing Innovation Platform ([http://soscip.org/ SOSCIP]) resource located at theUniversity of Toronto's SciNet HPC facility. The SOSCIP  multi-university/industry consortium is funded by the Ontario Government and the Federal Economic Development Agency for Southern Ontario [http://www.research.utoronto.ca/about/our-research-partners/soscip/].&lt;br /&gt;
&lt;br /&gt;
== Support Email ==&lt;br /&gt;
&lt;br /&gt;
Please use [mailto:soscip-support@scinet.utoronto.ca &amp;lt;soscip-support@scinet.utoronto.ca&amp;gt;] for SOSCIP GPU specific inquiries.&lt;br /&gt;
&lt;br /&gt;
== Specifications==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster consists of  of 14 IBM Power 822LC &amp;quot;Minsky&amp;quot; Servers each with 2x10core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 20 physical cores to support up to 160 threads.  Each node has 4x NVIDIA Tesla P100 GPUs each with 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.&lt;br /&gt;
&lt;br /&gt;
== Compile/Devel/Test ==&lt;br /&gt;
&lt;br /&gt;
Access is provided through the BGQ login node, '''&amp;lt;tt&amp;gt; bgqdev.scinet.utoronto.ca &amp;lt;/tt&amp;gt;''' using ssh, and from there you can proceed to the GPU development node '''&amp;lt;tt&amp;gt;sgc01-ib0&amp;lt;/tt&amp;gt;'''.&lt;br /&gt;
&lt;br /&gt;
== Filesystem ==&lt;br /&gt;
&lt;br /&gt;
The filesystem is shared with the BGQ system.  See [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ#Filesystem here ] for details.&lt;br /&gt;
&lt;br /&gt;
== Job Submission ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU cluster uses [https://slurm.schedmd.com/ SLURM ] as a job scheduler and jobs are scheduled by node, ie 20 cores and 4 GPUs each. Jobs are submitted from the development node '''&amp;lt;tt&amp;gt;sgc01&amp;lt;/tt&amp;gt;'''. The maximum walltime per job is 12 hours (except in the 'long' queue, see below) with up to 8 nodes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ sbatch myjob.script&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Where myjob.script is &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
hostname&lt;br /&gt;
nvidia-smi&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can queury job information using&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
squeue&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To cancel a job use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
scancel $JOBID&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Longer jobs ===&lt;br /&gt;
&lt;br /&gt;
If your job takes more than 12 hours, the sbatch command will not let you submit your job.  There is, however, a way to have jobs up to 24 hours long, by specifying &amp;quot;-p long&amp;quot; as an option (i.e., add &amp;lt;tt&amp;gt;#SBATCH -p long&amp;lt;/tt&amp;gt; to your job script).  The priority of such jobs may be throttled in the future if we see that the 'long' queue is having a negative efffect on turnover time in the queue.&lt;br /&gt;
&lt;br /&gt;
=== Interactive ===&lt;br /&gt;
&lt;br /&gt;
For an interactive session use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
salloc --gres=gpu:4&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Automatic Re-submission and Job Dependencies ===&lt;br /&gt;
&lt;br /&gt;
Commonly you may have a job that you know will take longer to run than what is permissible in the queue. As long as your program contains checkpoint or restart capability, you can have one job automatically submit the next. In the following example it is assumed that the program finishes before the time limit requested and then resubmits itself by logging into the development nodes.   Job dependencies and a maximum number of job re-submissions are used to ensure sequential operation.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash &lt;br /&gt;
&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
: ${job_number:=&amp;quot;1&amp;quot;}           # set job_nubmer to 1 if it is undefined&lt;br /&gt;
job_number_max=3&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;hi from ${SLURM_JOB_ID}&amp;quot;&lt;br /&gt;
&lt;br /&gt;
#RUN JOB HERE&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# SUBMIT NEXT JOB&lt;br /&gt;
if [[ ${job_number} -lt ${job_number_max} ]]&lt;br /&gt;
then&lt;br /&gt;
  (( job_number++ ))&lt;br /&gt;
  next_jobid=$(ssh sgc01-ib0 &amp;quot;cd $SLURM_SUBMIT_DIR; /opt/slurm/bin/sbatch --export=job_number=${job_number} -d afterok:${SLURM_JOB_ID} thisscript.sh | awk '{print $4}'&amp;quot;)&lt;br /&gt;
  echo &amp;quot;submitted ${next_jobid}&amp;quot;&lt;br /&gt;
fi&lt;br /&gt;
 &lt;br /&gt;
sleep 15&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;${SLURM_JOB_ID} done&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Software ==&lt;br /&gt;
&lt;br /&gt;
==== GNU Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the newer advance toolchain version use:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/6.3.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== IBM Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the native IBM xlc/xlc++ compilers&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load xlc/13.1.5&lt;br /&gt;
module load xlf/15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Driver Version ====&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version is 384.66&lt;br /&gt;
&lt;br /&gt;
==== CUDA ====&lt;br /&gt;
&lt;br /&gt;
The current installed CUDA Tookit is 8.0&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The CUDA driver is installed locally, however the CUDA Toolkit is installed in:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/usr/local/cuda-8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== OpenMPI ====&lt;br /&gt;
&lt;br /&gt;
Currently OpenMPI has been setup on the 14 nodes connected over EDR Infiniband.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load openmpi/2.1.1-gcc-5.4.0&lt;br /&gt;
$ module load openmpi/2.1.1-XL-13_15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== IBM PowerAI ===&lt;br /&gt;
&lt;br /&gt;
The PowerAI platform contains popular open machine learning frameworks such as Caffe, Tensorflow, and Torch. Run the &amp;lt;tt&amp;gt;module avail&amp;lt;/tt&amp;gt; command for a complete listing. More information is available at this link: https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/. Release 4.0 is currently installed.&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9052</id>
		<title>SOSCIP GPU</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9052"/>
		<updated>2017-11-10T06:30:04Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* Job Submission */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:S882lc.png|center|300px|thumb]]&lt;br /&gt;
|name=SOSCIP GPU &lt;br /&gt;
|installed=September 2017&lt;br /&gt;
|operatingsystem= Ubuntu 16.04 le &lt;br /&gt;
|loginnode= sgc01 &lt;br /&gt;
|nnodes= 14x Power 8 with  4x NVIDIA P100&lt;br /&gt;
|rampernode=512 GB&lt;br /&gt;
|corespernode= 2 x 10core (20 physical, 160 SMT)&lt;br /&gt;
|interconnect=Infiniband EDR &lt;br /&gt;
|vendorcompilers=xlc/xlf, nvcc&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== SOSCIP ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster is a Southern Ontario Smart Computing Innovation Platform ([http://soscip.org/ SOSCIP]) resource located at theUniversity of Toronto's SciNet HPC facility. The SOSCIP  multi-university/industry consortium is funded by the Ontario Government and the Federal Economic Development Agency for Southern Ontario [http://www.research.utoronto.ca/about/our-research-partners/soscip/].&lt;br /&gt;
&lt;br /&gt;
== Support Email ==&lt;br /&gt;
&lt;br /&gt;
Please use [mailto:soscip-support@scinet.utoronto.ca &amp;lt;soscip-support@scinet.utoronto.ca&amp;gt;] for SOSCIP GPU specific inquiries.&lt;br /&gt;
&lt;br /&gt;
== Specifications==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster consists of  of 14 IBM Power 822LC &amp;quot;Minsky&amp;quot; Servers each with 2x10core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 20 physical cores to support up to 160 threads.  Each node has 4x NVIDIA Tesla P100 GPUs each with 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.&lt;br /&gt;
&lt;br /&gt;
== Compile/Devel/Test ==&lt;br /&gt;
&lt;br /&gt;
Access is provided through the BGQ login node, '''&amp;lt;tt&amp;gt; bgqdev.scinet.utoronto.ca &amp;lt;/tt&amp;gt;''' using ssh, and from there you can proceed to the GPU development node '''&amp;lt;tt&amp;gt;sgc01-ib0&amp;lt;/tt&amp;gt;'''.&lt;br /&gt;
&lt;br /&gt;
== Filesystem ==&lt;br /&gt;
&lt;br /&gt;
The filesystem is shared with the BGQ system.  See [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ#Filesystem here ] for details.&lt;br /&gt;
&lt;br /&gt;
== Job Submission ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU cluster uses [https://slurm.schedmd.com/ SLURM ] as a job scheduler and jobs are scheduled by node, ie 20 cores and 4 GPUs each. Jobs are submitted from the development node '''&amp;lt;tt&amp;gt;sgc01&amp;lt;/tt&amp;gt;'''. The maximum walltime per job is 12 hours (except in the 'long' queue, see below) with up to 8 nodes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ sbatch myjob.script&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Where myjob.script is &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
hostname&lt;br /&gt;
nvidia-smi&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can queury job information using&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
squeue&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To cancel a job use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
scancel $JOBID&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Longer jobs ===&lt;br /&gt;
&lt;br /&gt;
If your job takes more than 12 hours, the sbatch command will not let you submit your job.  There is, however, a way to have jobs up to 24 hours long, by specifying &amp;quot;-p long&amp;quot; as an option (i.e., add &amp;lt;tt&amp;gt;#SBATCH -p long&amp;lt;/tt&amp;gt; to your job script).  The priority of such jobs may be throttled in the future if we see that the 'long' queue is having a negative efffect on turnover time in the queue.&lt;br /&gt;
&lt;br /&gt;
=== Interactive ===&lt;br /&gt;
&lt;br /&gt;
For an interactive session use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
salloc --gres=gpu:4&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Automatic Re-submission and Job Dependencies == &lt;br /&gt;
&lt;br /&gt;
Commonly you may have a job that you know will take longer to run than what is permissible in the queue. As long as your program contains checkpoint or restart capability, you can have one job automatically submit the next. In the following example it is assumed that the program finishes before the time limit requested and then resubmits itself by logging into the development nodes.   Job dependencies and a maximum number of job re-submissions are used to ensure sequential operation.  &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash &lt;br /&gt;
&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
: ${job_number:=&amp;quot;1&amp;quot;}           # set job_nubmer to 1 if it is undefined&lt;br /&gt;
job_number_max=3&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;hi from ${SLURM_JOB_ID}&amp;quot;&lt;br /&gt;
&lt;br /&gt;
#RUN JOB HERE&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# SUBMIT NEXT JOB&lt;br /&gt;
if [[ ${job_number} -lt ${job_number_max} ]]&lt;br /&gt;
then&lt;br /&gt;
  (( job_number++ ))&lt;br /&gt;
  next_jobid=$(ssh sgc01-ib0 &amp;quot;cd $SLURM_SUBMIT_DIR; /opt/slurm/bin/sbatch --export=job_number=${job_number} -d afterok:${SLURM_JOB_ID} thisscript.sh | awk '{print $4}'&amp;quot;)&lt;br /&gt;
  echo &amp;quot;submitted ${next_jobid}&amp;quot;&lt;br /&gt;
fi&lt;br /&gt;
 &lt;br /&gt;
sleep 15&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;${SLURM_JOB_ID} done&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Software ==&lt;br /&gt;
&lt;br /&gt;
==== GNU Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the newer advance toolchain version use:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/6.3.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== IBM Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the native IBM xlc/xlc++ compilers&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load xlc/13.1.5&lt;br /&gt;
module load xlf/15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Driver Version ====&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version is 384.66&lt;br /&gt;
&lt;br /&gt;
==== CUDA ====&lt;br /&gt;
&lt;br /&gt;
The current installed CUDA Tookit is 8.0&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The CUDA driver is installed locally, however the CUDA Toolkit is installed in:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/usr/local/cuda-8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== OpenMPI ====&lt;br /&gt;
&lt;br /&gt;
Currently OpenMPI has been setup on the 14 nodes connected over EDR Infiniband.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load openmpi/2.1.1-gcc-5.4.0&lt;br /&gt;
$ module load openmpi/2.1.1-XL-13_15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== IBM PowerAI ===&lt;br /&gt;
&lt;br /&gt;
The PowerAI platform contains popular open machine learning frameworks such as Caffe, Tensorflow, and Torch. Run the &amp;lt;tt&amp;gt;module avail&amp;lt;/tt&amp;gt; command for a complete listing. More information is available at this link: https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/. Release 4.0 is currently installed.&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9047</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9047"/>
		<updated>2017-10-30T14:14:20Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 18:30 EDT 2017&amp;lt;/b&amp;gt; Connectivity to the datacentre has been restored.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 17:00 EDT 2017&amp;lt;/b&amp;gt; [http://www.systemstatus.utoronto.ca/incidents/zlj7rd5z2mg4 It appears that the optical fibre connection to the datacentre has been cut, some 3 km away from the University.  Bell has been contacted.]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 14:30 EDT 2017&amp;lt;/b&amp;gt; [http://www.systemstatus.utoronto.ca/incidents/zlj7rd5z2mg4 Network issues to the data centre. We are investigating.]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9046</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9046"/>
		<updated>2017-10-30T13:37:32Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:down.png|down|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 18:30 EDT 2017&amp;lt;/b&amp;gt; Connectivity to the datacentre has been restored.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 17:00 EDT 2017&amp;lt;/b&amp;gt; [http://www.systemstatus.utoronto.ca/incidents/zlj7rd5z2mg4 It appears that the optical fibre connection to the datacentre has been cut, some 3 km away from the University.  Bell has been contacted.]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 14:30 EDT 2017&amp;lt;/b&amp;gt; [http://www.systemstatus.utoronto.ca/incidents/zlj7rd5z2mg4 Network issues to the data centre. We are investigating.]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9045</id>
		<title>Oldwiki.scinet.utoronto.ca:System Alerts</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=Oldwiki.scinet.utoronto.ca:System_Alerts&amp;diff=9045"/>
		<updated>2017-10-28T17:52:32Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* System Status */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== System Status==&lt;br /&gt;
&amp;lt;!-- &lt;br /&gt;
  Notes for updating the system status:&lt;br /&gt;
&lt;br /&gt;
  -  When removing system status entries, please archive them to:&lt;br /&gt;
&lt;br /&gt;
     http://wiki.scinethpc.ca/wiki/index.php/Previous_messages:&lt;br /&gt;
&lt;br /&gt;
     (yes, the trailing colon is part of the url)&lt;br /&gt;
&lt;br /&gt;
  -  The 'status circles' can be one of the following files: &lt;br /&gt;
&lt;br /&gt;
     down.png   for down&lt;br /&gt;
     up25.png   for 25% up&lt;br /&gt;
     up50.png   for 50% up&lt;br /&gt;
     up75.png   for 75% up&lt;br /&gt;
     up.png     for 100% up&lt;br /&gt;
&lt;br /&gt;
 --&amp;gt;&lt;br /&gt;
{| &lt;br /&gt;
|[[File:up.png|up|link=GPC Quickstart]][[GPC Quickstart|GPC]]&lt;br /&gt;
|[[File:up.png|up|link=Sandy]][[Sandy]]&lt;br /&gt;
|[[File:up.png|up|link=Gravity]][[Gravity]]&lt;br /&gt;
|[[File:up.png|up|link=Visualization Nodes]][[Visualization Nodes|Viz]]&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=BGQ]][[BGQ]]&lt;br /&gt;
|[[File:up.png|up|link=P7 Linux Cluster]][[P7 Linux Cluster|P7]]&lt;br /&gt;
|[[File:up.png|up|link=P8]][[P8]]&lt;br /&gt;
|[[File:up.png|up|]]File System&lt;br /&gt;
|-&lt;br /&gt;
|[[File:up.png|up|link=SOSCIP_GPU]][[SOSCIP_GPU|SGC]]&lt;br /&gt;
|[[File:up.png|up|link=Knights Landing]][[Knights Landing|KNL]]&lt;br /&gt;
|[[File:up.png|up|link=HPSS]][[HPSS]]&lt;br /&gt;
|[[File:up.png|up|]]External Network&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 18:30 EDT 2017&amp;lt;/b&amp;gt; Connectivity to the datacentre has been restored.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 17:00 EDT 2017&amp;lt;/b&amp;gt; [http://www.systemstatus.utoronto.ca/incidents/zlj7rd5z2mg4 It appears that the optical fibre connection to the datacentre has been cut, some 3 km away from the University.  Bell has been contacted.]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt; Thu Oct 19, 14:30 EDT 2017&amp;lt;/b&amp;gt; [http://www.systemstatus.utoronto.ca/incidents/zlj7rd5z2mg4 Network issues to the data centre. We are investigating.]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://support.scinet.utoronto.ca/wiki/index.php/Previous_messages:] --&amp;gt;&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
	<entry>
		<id>https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9036</id>
		<title>SOSCIP GPU</title>
		<link rel="alternate" type="text/html" href="https://oldwiki.scinet.utoronto.ca/index.php?title=SOSCIP_GPU&amp;diff=9036"/>
		<updated>2017-10-10T18:09:15Z</updated>

		<summary type="html">&lt;p&gt;Northrup: /* OpenAI */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Infobox Computer&lt;br /&gt;
|image=[[Image:S882lc.png|center|300px|thumb]]&lt;br /&gt;
|name=SOSCIP GPU &lt;br /&gt;
|installed=September 2017&lt;br /&gt;
|operatingsystem= Ubuntu 16.04 le &lt;br /&gt;
|loginnode= sgc01 &lt;br /&gt;
|nnodes= 14x Power 8 with  4x NVIDIA P100&lt;br /&gt;
|rampernode=512 GB&lt;br /&gt;
|corespernode= 2 x 10core (20 physical, 160 SMT)&lt;br /&gt;
|interconnect=Infiniband EDR &lt;br /&gt;
|vendorcompilers=xlc/xlf, nvcc&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
== SOSCIP ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster is a Southern Ontario Smart Computing Innovation Platform ([http://soscip.org/ SOSCIP]) resource located at theUniversity of Toronto's SciNet HPC facility. The SOSCIP  multi-university/industry consortium is funded by the Ontario Government and the Federal Economic Development Agency for Southern Ontario [http://www.research.utoronto.ca/about/our-research-partners/soscip/].&lt;br /&gt;
&lt;br /&gt;
== Support Email ==&lt;br /&gt;
&lt;br /&gt;
Please use [mailto:soscip-support@scinet.utoronto.ca &amp;lt;soscip-support@scinet.utoronto.ca&amp;gt;] for SOSCIP GPU specific inquiries.&lt;br /&gt;
&lt;br /&gt;
== Specifications==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU Cluster consists of  of 14 IBM Power 822LC &amp;quot;Minsky&amp;quot; Servers each with 2x10core 3.25GHz Power8 CPUs and 512GB Ram. Similar to Power 7, the Power 8 utilizes Simultaneous MultiThreading (SMT), but extends the design to 8 threads per core allowing the 20 physical cores to support up to 160 threads.  Each node has 4x NVIDIA Tesla P100 GPUs each with 16GB of RAM with CUDA Capability 6.0 (Pascal) connected using NVlink.&lt;br /&gt;
&lt;br /&gt;
== Compile/Devel/Test ==&lt;br /&gt;
&lt;br /&gt;
Access is provided through the BGQ login node, '''&amp;lt;tt&amp;gt; bgqdev.scinet.utoronto.ca &amp;lt;/tt&amp;gt;''' using ssh, and from there you can proceed to the GPU development node '''&amp;lt;tt&amp;gt;sgc01-ib0&amp;lt;/tt&amp;gt;'''.&lt;br /&gt;
&lt;br /&gt;
== Filesystem ==&lt;br /&gt;
&lt;br /&gt;
The filesystem is shared with the BGQ system.  See [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ#Filesystem here ] for details.&lt;br /&gt;
&lt;br /&gt;
== Job Submission ==&lt;br /&gt;
&lt;br /&gt;
The SOSCIP GPU cluster uses [https://slurm.schedmd.com/ SLURM ] as a job scheduler and jobs are scheduled by node, ie 20 cores and 4 GPUs each. Jobs are submitted from the development node '''&amp;lt;tt&amp;gt;sgc01&amp;lt;/tt&amp;gt;'''. The maximum walltime per job is 12 hours (except in the 'long' queue, see below) with up to 8 nodes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ sbatch myjob.script&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Where myjob.script is &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash&lt;br /&gt;
#SBATCH --nodes=1 &lt;br /&gt;
#SBATCH --ntasks=20  # MPI tasks (needed for srun) &lt;br /&gt;
#SBATCH --time=00:10:00  # H:M:S&lt;br /&gt;
#SBATCH --gres=gpu:4     # Ask for 4 GPUs per node&lt;br /&gt;
&lt;br /&gt;
cd $SLURM_SUBMIT_DIR&lt;br /&gt;
&lt;br /&gt;
hostname&lt;br /&gt;
nvidia-smi&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can queury job information using&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
squeue&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To cancel a job use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
scancel $JOBID&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Longer jobs ===&lt;br /&gt;
&lt;br /&gt;
If your job takes more than 12 hours, the sbatch command will not let you submit your job.  There is, however, a way to have jobs up to 24 hours long, by specifying &amp;quot;-p long&amp;quot; as an option (i.e., add &amp;lt;tt&amp;gt;#SBATCH -p long&amp;lt;/tt&amp;gt; to your job script).  The priority of such jobs may be throttled in the future if we see that the 'long' queue is having a negative efffect on turnover time in the queue.&lt;br /&gt;
&lt;br /&gt;
=== Interactive ===&lt;br /&gt;
&lt;br /&gt;
For an interactive session use&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
salloc --gres=gpu:4&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Software ==&lt;br /&gt;
&lt;br /&gt;
==== GNU Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the newer advance toolchain version use:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load gcc/6.3.1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== IBM Compilers ====&lt;br /&gt;
&lt;br /&gt;
To load the native IBM xlc/xlc++ compilers&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load xlc/13.1.5&lt;br /&gt;
module load xlf/15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Driver Version ====&lt;br /&gt;
&lt;br /&gt;
The current NVIDIA driver version is 384.66&lt;br /&gt;
&lt;br /&gt;
==== CUDA ====&lt;br /&gt;
&lt;br /&gt;
The current installed CUDA Tookit is 8.0&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
module load cuda/8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
The CUDA driver is installed locally, however the CUDA Toolkit is installed in:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/usr/local/cuda-8.0&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== OpenMPI ====&lt;br /&gt;
&lt;br /&gt;
Currently OpenMPI has been setup on the 14 nodes connected over EDR Infiniband.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
$ module load openmpi/2.1.1-gcc-5.4.0&lt;br /&gt;
$ module load openmpi/2.1.1-XL-13_15.1.5&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== IBM PowerAI ===&lt;br /&gt;
&lt;br /&gt;
The PowerAI platform contains popular open machine learning frameworks such as Caffe, Tensorflow, and Torch. Run the module avail command for a complete listing. More information is available at this link: https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/. Release 4.0 is currently installed.&lt;/div&gt;</summary>
		<author><name>Northrup</name></author>
	</entry>
</feed>