Since 1986 - Covering the Fastest Computers in the World and the People Who Run Them

Language Flags

Tag: hadoop distributed file systems

Accelerate Hadoop MapReduce Performance using Dedicated OrangeFS Servers

Sep 9, 2013 |

Recent tests performed at Clemson University achieved a 25 percent improvement in Apache Hadoop Terasort run times by replacing Hadoop Distributed File System (HDFS) with an OrangeFS configuration using dedicated servers. Key components included extension of the MapReduce “FileSystem” class and a Java Native Interface (JNI) shim to the OrangeFS client. No modifications of Hadoop were required, and existing MapReduce jobs require no modification to utilize OrangeFS.