An Open-Source Project for MapReduce Performance Self-Tuning

12/28/2019
by   Donghua Chen, et al.
0

Many Hadoop configuration parameters have significant influence in the performance of running MapReduce jobs on Hadoop. It is time-consuming and tedious for general users to manually tune the parameters for optimal MapReduce performance. Besides, most of existing self-tuning system have opaque implementation, making it difficult to use in practice. This study presents an open-source project that hosts the developing self-tuning system called Catla to address the issues. Catla integrates multiple direct search and derivative-free optimization-based techniques to facilitate tuning efficiency for users. An overview of the system and its usage are illustrated in this study. We also reported a simple example demonstrating the benefits of this ongoing project. Although this project is still developing and far from comprehensive, it is dedicated to contributing Hadoop ecosystem in terms of improving performance in big data analysis.

READ FULL TEXT

page 1

page 2

page 3

research
11/04/2021

Auto Tuning of Hadoop and Spark parameters

Data of the order of terabytes, petabytes, or beyond is known as Big Dat...
research
11/06/2019

Auptimizer – an Extensible, Open-Source Framework for Hyperparameter Tuning

Tuning machine learning models at scale, especially finding the right hy...
research
10/31/2017

A Prediction Model of the Project Life-span in Open Source Software Ecosystem

In nature ecosystems, animal life-spans are determined by genes and some...
research
10/26/2016

A self-tuning Firefly algorithm to tune the parameters of Ant Colony System (ACSFA)

Ant colony system (ACS) is a promising approach which has been widely us...
research
09/09/2018

Tuning the Performance of a Computational Persistent Homology Package

In recent years, persistent homology has become an attractive method for...
research
02/28/2021

Moroccan Dialect -Darija- Open Dataset

Darija Open Dataset (DODa) is an open-source project for the Moroccan di...
research
04/26/2021

Unikraft: Fast, Specialized Unikernels the Easy Way

Unikernels are famous for providing excellent performance in terms of bo...

Please sign up or login with your details

Forgot password? Click here to reset