New User Quick Start
Accounts and Projects
Request a New Allocation
What are the differences between project types?
What happens after a project request is approved?
Guidance on Summit Allocation Requests
Applying for a user account
Checking the status of your application
Get access to additional projects
Frequently Asked Questions
How do I apply for an account?
What is the status of my application?
How should I acknowledge the OLCF in my publications and presentations?
What is a subproject?
I no longer need my account. Who should I inform and what should I do with my OLCF issued RSA SecurID token?
My SecurID token is broken/expired. What should I do?
Getting Help
Additional Resources
Documents and Forms
Forms for Requesting a Project Allocation
Forms for Requesting an Account
Forms to Request Changes to Computers, Jobs, or Accounts
Report Templates
Miscellaneous Forms
OLCF Policy Guides
OLCF Acknowledgement
Computing Policy
Computer Use
Data Use
Software Use
User Accountability
Data Management Policy
Introduction
Data Retention, Purge, & Quotas
Data Prohibitions & Safeguards
Software
Security Policy
Scope
Personal Use
Accessing OLCF Computational Resources
Data Management
Sensitive Data
Data Transfer
INCITE Allocation Under-utilization Policy
Project Reporting Policy
Non-proprietary Institutional User Agreement Policy
Access
Rules and Regulations
Safety and Health
Intent to Publish
Export Control
Intellectual Property
Special Requests and Policy Exemptions
Glossary
Additional Resources
Connecting
Connecting for the first time
Activating a new SecurID fob
PINs, Passcodes, and Tokencodes
X11 Forwarding
Systems Available to All Projects
OLCF System Hostnames
Checking System Availability
Systems
Summit User Guide
Summit Documentation Resources
System Overview
Summit Nodes
Node Types
System Interconnect
File Systems
Operating System
Hardware Threads
GPUs
Connecting
Data and Storage
Software
Shell & Programming Environments
Default Shell
Environment Management with Lmod
Compiling
Compilers
Linking in Libraries
Running Jobs
Login, Launch, and Compute Nodes
Batch Scripts
Interactive Jobs
Common bsub Options
Batch Environment Variables
Job States
Scheduling Policy
Job Dependencies
Job Launcher (jsrun)
CUDA-Aware MPI
Monitoring Jobs
Interacting With Jobs
Other LSF Commands
PBS/Torque/MOAB-to-LSF Translation
Easy Mode vs. Expert Mode
System Service Core Isolation
Resource Accounting
Other Notes
Debugging
Arm DDT
GDB
Valgrind
Optimizing and Profiling
Profiling GPU Code with NVIDIA Developer Tools
Score-P
Vampir
NVIDIA V100 GPUs
NVIDIA V100 SM
HBM2
NVIDIA NVLink
Volta Multi-Process Service
Unified Memory
Independent Thread Scheduling
Tensor Cores
Tesla V100 Specifications
Further Reading
Burst Buffer
NVMe (XFS)
Current NVMe Usage
Interactive Jobs Using the NVMe
NVMe Usage Example
Spectral Library
Known Issues
Open Issues
Resolved Issues
CUDA 10.1 Known Issues
Training System (Ascent)
File Systems
Obtaining Access to Ascent
Logging In to Ascent
Preparing For Frontier
HIP
Using HIP on Summit
Learning to Program with HIP
Previous Frontier Training Events
Rhea User Guide
System Overview
Compute nodes
Login nodes
File systems
Shell and programming environments
Default shell
Environment management with lmod
Installed Software
Compiling
Available compilers
Changing compilers
Compiler wrappers
Compiling threaded codes
Running Jobs
Login vs Compute Nodes on Commodity Clusters
Slurm
Interactive Batch Jobs on Commodity Clusters
Common Batch Options to Slurm
Batch Environment Variables
Modifying Batch Jobs
Monitoring Batch Jobs
Job Execution
Batch Queues on Rhea
Job Accounting on Rhea
Visualization tools
ParaView
VisIt
Remote Visualization using VNC (non-GPU)
Remote Visualization using VNC (GPU nodes)
Remote Visualization using Nice DCV (GPU nodes only)
Home
System Overview
Access & Connecting
Usage
Acceptable Tasks
Unacceptable Tasks
Data Transfer Nodes (DTNs)
System Overview
Access & Connecting
High Performance Storage System
System Overview
Ascent
System Overview
Andes
System Overview
Compute nodes
Login nodes
File systems
Shell and programming environments
Default shell
Environment management with lmod
Installed Software
Compiling
Available compilers
Changing compilers
Compiler wrappers
Compiling threaded codes
Running Jobs
Login vs Compute Nodes on Commodity Clusters
Slurm
Interactive Batch Jobs on Commodity Clusters
Common Batch Options to Slurm
Batch Environment Variables
Modifying Batch Jobs
Monitoring Batch Jobs
Job Execution
Batch Queues on Andes
Job Accounting on Andes
Visualization tools
ParaView
VisIt
Remote Visualization using VNC (non-GPU)
Remote Visualization using VNC (GPU nodes)
Remote Visualization using Nice DCV (GPU nodes only)
Services and Applications
Slate
Overview
What is Slate?
What is Kubernetes?
What is OpenShift?
Getting Started
Requesting A Slate Project Allocation
Logging in
Slate Namespaces
Install the OC tool
Test login with OC Tool
Guided Tutorial
Creating your project
Guided Web GUI Tutorial
Guided Tutorial: CLI
Adding a Pod to your Project
Image Building
Build Types
Examples
Logging into the registry externally
Workloads
Pods
Deployments
Networking
Services
NodePorts
Routes
Network Policies
Quick Access from Outside Slate
Persistent Storage
Creating A Persistent Volume Claim
Adding PVC To Pod
Backups
Application Deployment Examples
Build and Deploy Simple Website
Deploy MongoDB
Deploy NGINX with Hello World
Deploy Packages with Helm
MinIO Object Store (On an NCCS Filesystem)
Access OLCF Resources From Containers
Batch Job Submission
Mount OLCF Filesystems
Schedule Other Slate Resources
GPUs
OLCF-Provided Applications on Slate
Troubleshooting
Fix Container Image Permissions
Debugging
YAML Object Quick Reference
CronJobs
Deployments and Stateful Sets
Pods
Roles and Rolebindings
Routes, Services and Nodeports
Persistent Volume Claims
Glossary
myOLCF
Overview
What is myOLCF?
What can it do?
Can I suggest a feature?
Authenticating
OLCF Moderate Accounts
OLCF Open Accounts
Project Pages
Project Context
Switching Project Contexts
Available Pages
Account Pages
Account Context
Available Pages
Processing Project Membership Requests
Jupyter
Overview
Jupyter at OLCF
Access
CPU vs. GPU JupyterLab (Available Resources)
Working within GPFS and NFS (Launching a notebook)
Conda environments and custom notebooks
Manually stopping your JupyterLab session
Things to be aware of
Example Jupyter Notebooks
Data Storage and Transfers
Storage Overview
Storage Areas
Alpine IBM Spectrum Scale Filesystem
Performance under not ideal workload
Tips
Major difference between Lustre and IBM Spectrum Scale
Policy
Information
Purge
Data Retention
User-Centric Data Storage
User Home Directories (NFS)
User Home Quotas
User Home Permissions
User Home Backups
User Website Directory
User Archive Directories (HPSS)
User Archive Access
User Archive Accounting
Project-Centric Data Storage
Project Home Directories (NFS)
Project Home Path
Project Home Quotas
Project Home Permissions
Project Home Backups
Project Work Areas
Three Project Work Areas to Facilitate Collaboration
Backups
Project Archive Directories
Three Project Archive Areas Facilitae Collaboration on Archival Data
Project Archive Access
Transferring Data
Using common terminal tools
Using Globus from your local machine
HPSS Data Archival System
Using Globus
Using HSI
Additional HSI Documentation
Using HTAR
HTAR Limitations
Additional HTAR Documentation
Burst Buffer and Spectral Library
Software
Software News
ML/DL, Analytics
IBM Watson Machine Learning CE -> Open CE
Getting Started
Running Distributed Deep Learning Jobs
Setting up Custom Environments
Best Distributed Deep Learning Performance
Example
Programming with Big Data in R (pbdR)
Python on OLCF Systems
Profiling Tools
Tuning and Analysis Utilities (TAU)
Run-Time Environment Variables
Compile-Time Environment Variables
MiniWeather Example Application
CUDA Profiling Tools Interface
Tracing
Selective Instrumentation
Dynamic Phase
Static Phase
OpenMP Offload
User-Managed Software
Flux
Training
OLCF Training Calendar
OLCF Tutorials
OLCF Training Archive
OLCF GPU Hackathons
OLCF Vimeo Channel
Contributing to these docs
Submitting suggestions
Authoring content
Setup authoring environment
Edit the docs
Resources
GitHub Guidelines
OLCF User Documentation
Docs
»
Services and Applications
»
Jupyter
Edit on GitHub
Jupyter
ΒΆ
Overview
Jupyter at OLCF
Access
CPU vs. GPU JupyterLab (Available Resources)
Working within GPFS and NFS (Launching a notebook)
Conda environments and custom notebooks
Manually stopping your JupyterLab session
Things to be aware of
Example Jupyter Notebooks