Log Management

One of the more mundane, perhaps boring, but necessary administration tasks is checking system logs – the source of knowledge or intelligence of what is happening in the cluster.

Parallel I/O Chases Amdahl Away

Scalability abhors serial computation, but parallel I/O can defeat those limitations.

Rocky Reaches for a Role in HPC

Can this enterprise Linux fill the void left by CentOS?

Sharing a Linux Terminal Over the Web

The ability to share a terminal over the web could multiply the effectiveness of admins and users. The tty-share tool might be the answer.

Sharing Linux Terminals

Sometimes sharing a screen between two users is enormously helpful. We look at two terminal sharing tools: screen and tmux.

Performance Health Check

Many HPC systems check the state of a node before running an application, but not very many check that the performance of the node is acceptable before running the job.

Distributed Linear Algebra with Mahout

The Apache Mahout distributed linear algebra framework delivers new tools and methods for performing data analysis, building machine learning data pipelines, and implementing machine learning models in production.

Extended File Attributes

One way to store metadata is with the originating file in extended file attributes.

What Is an Inode?

Understanding inodes is key to a better understanding of HPC filesystems.

Working with the Lustre Filesystem

The Lustre open source distributed, parallel filesystem scales to high-performance computing environments.