Difference between revisions of "FAQ"
m (1 revision) |
|||
Line 1: | Line 1: | ||
− | # ''' | + | # '''MPI development and interactive testing''' I am in the process of playing around with the mpi calls in my code to get it to work. I do a lot of tests and each of them takes a couple of seconds only.Sometimes (like now that I'm sending this email), all the machines are full and I'm put in the line. Since I just need a couple of SECONDS, is there any way I can test it on the log-in nodes? I can't do it using the llsubmit command and if I use mpiexec then I need a host file. Can I use a host file to run my 2 second test jobs on the log in nodes? If yes, can you send me an example host file please? |
+ | |||
+ | You can run small MPI jobs on the tcs-f11n06 node, which is meant for | ||
+ | development use. Please don't run them on the main login node tcs-f11n05. | ||
+ | Now, as for the hostfile, it simply looks like: | ||
+ | |||
+ | tcs-f11n06 | ||
+ | tcs-f11n06 | ||
+ | tcs-f11n06 | ||
+ | tcs-f11n06 | ||
+ | |||
+ | for a 4-task run. When you invoke "poe" or "mpirun", there are runtime | ||
+ | arguments that you specify pointing to this file. You can also specify it | ||
+ | in an environment variable MP_HOSTFILE, so, if your file is in your | ||
+ | /scratch/amashaye/hostfile, then you would do | ||
+ | |||
+ | export MP_HOSTFILE=/scratch/amashaye/hostfile | ||
+ | |||
+ | in your shell. After that you can simply run your program. You can | ||
+ | run it with the poe command (do a "man poe" for details), or even by | ||
+ | just directly running it. The number of MPI processes will by default | ||
+ | be the number of entries in your hostfile. | ||
+ | |||
+ | # '''Next question, please''' | ||
+ | |||
+ | We'll answer it asap! |
Revision as of 10:25, 4 May 2009
- MPI development and interactive testing I am in the process of playing around with the mpi calls in my code to get it to work. I do a lot of tests and each of them takes a couple of seconds only.Sometimes (like now that I'm sending this email), all the machines are full and I'm put in the line. Since I just need a couple of SECONDS, is there any way I can test it on the log-in nodes? I can't do it using the llsubmit command and if I use mpiexec then I need a host file. Can I use a host file to run my 2 second test jobs on the log in nodes? If yes, can you send me an example host file please?
You can run small MPI jobs on the tcs-f11n06 node, which is meant for development use. Please don't run them on the main login node tcs-f11n05. Now, as for the hostfile, it simply looks like:
tcs-f11n06 tcs-f11n06 tcs-f11n06 tcs-f11n06
for a 4-task run. When you invoke "poe" or "mpirun", there are runtime arguments that you specify pointing to this file. You can also specify it in an environment variable MP_HOSTFILE, so, if your file is in your /scratch/amashaye/hostfile, then you would do
export MP_HOSTFILE=/scratch/amashaye/hostfile
in your shell. After that you can simply run your program. You can run it with the poe command (do a "man poe" for details), or even by just directly running it. The number of MPI processes will by default be the number of entries in your hostfile.
- Next question, please
We'll answer it asap!