Log Everything
To be a good HPC system administrator for today’s environment, you need to be a lumberjack.
HPC Compilers
If you compile software on an expensive supercomputer, its a good idea to select the languages and compilers with particular care. We report on tried-and-proved tools used on SuperMUC, a supercomputer at the Leibniz Supercomputing Center in Germany.
More Small Tools
We look at some additional tools that you might find useful when troubleshooting HPC systems.
Freeing the GPU
Exploring AMD’s ambitious Radeon Open Compute Ecosystem with ROCm senior director Greg Stoner.
Exploring AMD’s Ambitious ROCm Initiative
AMD’s ROCm platform brings new freedom and portability to the GPU space.
It’s the Little Things
Several very sophisticated tools can be used to manage HPC systems, but it’s the little things that make them hum. Here are a few favorites.
Resource Monitoring For Remote Applications
Remora combines profiling and system monitoring to help you get to the root of application problems by revealing its use of resources.
Multiprocessing in Python with Fortran and OpenMP
Creating Python modules with Fortran OpenMP code makes all available cores accessible to Python functions.
ClusterHAT
Inexpensive, small, portable, low-power clusters are fantastic for many HPC applications. One of the coolest small clusters is the ClusterHAT for Raspberry Pi.
SquashFS
If you are an intensive, or even a typical, computer user, you store an amazing amount of data on your personal systems, servers, and HPC systems that you rarely touch. SquashFS is an underestimated filesystem that can address that needed, but little used, data.