Virginia Tech provides computing resources for computing with is suitable for simulations, machine learning model training, which called Advanced Research Computing. It provides a user friendly guide on how to use its system, but I think it would be nicer to try to summarize the most essential ones when you want to use it in a adequate manner. I also refer to Ramashish Gaurav‘s blog post about how to submit jobs as well as interactive mode for using the resources.
This link contains all the workshops regarding ARC that is available for the internet.
- Advanced Research Computing (ARC) Overview, 22 Spring: Slides, Video
- Connect to ARC systems and run your first jobs, 22 Spring: Slides, Video
- Get your software/code to run on ARC, 22 Spring: Slides, Video
- Monitoring Resource Utilization and Job Efficiency
- Getting the Best Data Storage Performance on ARC Filesystems
- Launching in Parallel
First you need to create your account here and then you will be asked to agree terms and conditions for requesting access. You will receive an email saying you’ve created the account, for me I could use infer, TinkerCliffs clusters, each name refers to a type of clusters.
| Name | specifications |
| TinkerCliffs | 316 Nodes*128 Cores(AMD EPYC ROME)+16 Nodes*96 Cores (Intel Cascade Lake-AP) |
| infer | 16 Nodes*32 cores(Skylake)+ 1 NVIDIA T4(2560 CUDA+320 tensor cores) 40 Nodes*28 cores(Broadwell)+ 2 NVIDIA P100(3560 CUDA) 40 Nodes*24 cores(Skylake)+ 2 NVIDIA V100(5120 CUDA+640 tensor cores) |
Access
You can use the following line to login to the arc server, please be reminded that you don’t have access to the server off campus unless you connect to the school’s proxy.
user@infer1.arc.vt.edu;
user@tinkercliffs1.arc.vt.edu;
user@tinkercliffs2.arc.vt.edu;When you input your pid as well as password which is the credential for logging in your vt services, you will be called on your phone for 2 factor authentication, you only need to pick up the call and then press a key, here’s the output after logged in.
+---------------------------------------------------------------------------+
| This computer is the property of Virginia Polytechnic Institute and State |
| University. Use of this equipment implies agreement to the university’s |
| Acceptable Use Policy (Policy 7000). For more information, please visit: |
| https://vt.edu/acceptable-use.html |
+---------------------------------------------------------------------------+
+---------------------------------------------------------------------------+
| NOTE: VT Enterprise Directory Password authentication requires a DUO |
| second factor challenge. After your password is provided, you |
| will receive a DUO challenge. |
+---------------------------------------------------------------------------+
use “screen” to detach and attach shell for running the server.
SLURM
The clusters uses SLURM which is short for scheduler and cluster resource manager.
batch job
Accounting
quota, scontrol show part, squeue, showusage, tcgetusage <accountname>, showqos
Leave a Reply