Notes for the first slide!
Using more than your laptop or desktop (or high powered workstation)
Designed with various hardware architectures that fit different problems
Using more than your laptop or desktop (or high powered workstation)
Designed with various hardware architectures that fit different problems
Allows you to run analyses not possible with a typical computer
Using more than your laptop or desktop (or high powered workstation)
Designed with various hardware architectures that fit different problems
Allows you to run analyses not possible with a typical computer
High memory, high core count, bigdata
$ ssh <netID>@hpc-class.its.iastate.edu
????
$ ssh <netID>@hpc-class.its.iastate.edu
????
cd
into your root directory and try: ls -a
$ ssh <netID>@hpc-class.its.iastate.edu
????
cd
into your root directory and try: ls -a
You should see a hidden folder named .ssh
$ ssh <netID>@hpc-class.its.iastate.edu
????
cd
into your root directory and try: ls -a
You should see a hidden folder named .ssh
cd
into .ssh
and create a file named "config":
$ touch config
config
, add the following contents, and save the file:Host hpc_classHostName hpc-class.its.iastate.eduUser <net ID>
config
, add the following contents, and save the file:Host hpc_classHostName hpc-class.its.iastate.eduUser <net ID>
$ ssh hpc_class
config
, add the following contents, and save the file:Host hpc_classHostName hpc-class.its.iastate.eduUser <net ID>
$ ssh hpc_class
$ hostname$ whoami
There are other tools, but these will get you the basic info you need and are installed on most systems by default
Nodes: compute node, head node
Processors (a cpu chip)
Nodes: compute node, head node
Processors (a cpu chip)
Cores (physical cpus embedded on a single chip)
Nodes: compute node, head node
Processors (a cpu chip)
Cores (physical cpus embedded on a single chip)
Threads: hyper-threading (software 'trick' to let two processes share a core)
Nodes: compute node, head node
Processors (a cpu chip)
Cores (physical cpus embedded on a single chip)
Threads: hyper-threading (software 'trick' to let two processes share a core)
Scheduler: Allocates access to resources to users (ISU uses Slurm)
Modules are used to allow multiple people to use the same software with consistent, reproducible results
Modules keep your environment clean and free of conflicts (e.g. python2/3 or Java7/8)
Modules are used to allow multiple people to use the same software with consistent, reproducible results
Modules keep your environment clean and free of conflicts (e.g. python2/3 or Java7/8)
Think of modules as a software library. You need to check out the software before you can use it.
Modules are used to allow multiple people to use the same software with consistent, reproducible results
Modules keep your environment clean and free of conflicts (e.g. python2/3 or Java7/8)
Think of modules as a software library. You need to check out the software before you can use it.
Modules can be searched:
Modules are used to allow multiple people to use the same software with consistent, reproducible results
Modules keep your environment clean and free of conflicts (e.g. python2/3 or Java7/8)
Think of modules as a software library. You need to check out the software before you can use it.
Modules can be searched:
$ module avail-------------------- /opt/rit/modules ---------------------------------abyss/1.5.2 freesurfer/5.3.0 lib/htslib/1.2.1 abyss/1.9.0 gapcloser/1.12-r6 lib/htslib/1.3afni/17.0.10 gapfiller/1-10 lib/htslib/1.3.2albert/20170105 gatk/3.4-46 lib/ICE/1.0.9...
$ module load <name of software>
$ module load <name of software>
$ module load <name of software>
$ module unload <name of software>
$ module load <name of software>
$ module unload <name of software>
$ module load <name of software>
$ module unload <name of software>
Default behavior is to load the latest version if not specified
Use 'module purge' to clear your environment before loading something different
Storage on the clusters and servers should be treated as temporary
Become familiar with the disk architecture on each machine
Storage on the clusters and servers should be treated as temporary
Become familiar with the disk architecture on each machine
Keep backups
Scheduler assigns jobs from the queue to the compute nodes
ISU uses SLURM
Think about the scheduler like a hotel reservation - you're charged for the room whether you use it or not, and if you ask for an especially long or large reservation, the room may not be available when you want it.
Scheduler assigns jobs from the queue to the compute nodes
ISU uses SLURM
Think about the scheduler like a hotel reservation - you're charged for the room whether you use it or not, and if you ask for an especially long or large reservation, the room may not be available when you want it.
Basic info required: how many nodes, how long
Scheduler assigns jobs from the queue to the compute nodes
ISU uses SLURM
Think about the scheduler like a hotel reservation - you're charged for the room whether you use it or not, and if you ask for an especially long or large reservation, the room may not be available when you want it.
Basic info required: how many nodes, how long
Script writer can get you started
#!/bin/bash#SBATCH --time=1:00:00 # walltime#SBATCH --nodes=2 # number of nodes in this job#SBATCH --ntasks-per-node=16 # total number of processor cores in this job#SBATCH --output=myout_%J.log#SBATCH --error=myerr_%J.errmodule load RRscript MyThing.R
#!/bin/bash#SBATCH --time=1:00:00 # walltime#SBATCH --nodes=2 # number of nodes in this job#SBATCH --ntasks-per-node=16 # total number of processor cores in this job#SBATCH --output=myout_%J.log#SBATCH --error=myerr_%J.errmodule load RRscript MyThing.R
Then submit:
$ sbatch mysbatch.sh
Over or under using resources
Not taking advantage of the right machine for the problem
Over or under using resources
Not taking advantage of the right machine for the problem
Moving data through slow links
Over or under using resources
Not taking advantage of the right machine for the problem
Moving data through slow links
Trying to scale up programs that weren't designed for large datasets
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |