Hadoop Performance Tuning Guide
Hadoop is a large and complex software framework involving a number of components interacting with each other across multiple hardware systems. Bottlenecks in a subset of the hardware systems within the cluster can cause overall poor performance of the underlying Hadoop workload. In this tuning guide, we attempt to provide the audience with a holistic approach of Hadoop performance tuning methodologies and best practices.