YARN Schedulers for Hadoop MapReduce Jobs: Design Goals, Issues and Taxonomy

Gnanendra Kotikam; Lokesh Selvaraj

doi:10.2174/2666255816666220831125012

ISSN: 2666-2558
E-ISSN: 2666-2566

YARN Schedulers for Hadoop MapReduce Jobs: Design Goals, Issues and Taxonomy
By Gnanendra Kotikam and Lokesh Selvaraj
Source: Recent Advances in Computer Science and Communications, Volume 16, Issue 6, Jul 2023, p. 44 - 55
DOI: https://doi.org/10.2174/2666255816666220831125012
- Available online: 01 Jul 2023

Abstract

Objective: Big Data processing is a demanding task, and several big data processing frameworks have emerged in recent decades. The performance of these frameworks is greatly dependent on resource management models. Methods: YARN is one of such models which acts as a resource management layer and provides computational resources for execution engines (Spark, MapReduce, storm, etc.) through its schedulers. The most important aspect of resource management is job scheduling. Results: In this paper, we first present the design goal of YARN real-life schedulers (FIFO, Capacity, and Fair) for the MapReduce engine. Later, we discuss the scheduling issues of the Hadoop MapReduce cluster. Conclusion: Many efforts have been carried out in the literature to address issues of data locality, heterogeneity, straggling, skew mitigation, stragglers and fairness in Hadoop MapReduce scheduling. Lastly, we present the taxonomy of different scheduling algorithms available in the literature based on some factors like environment, scope, approach, objective and addressed issues.

Article metrics loading...

/content/journals/rascs/10.2174/2666255816666220831125012

2023-07-01

2026-02-28

From This Site

/content/journals/rascs/10.2174/2666255816666220831125012

dcterms_title,dcterms_subject,pub_keyword

-contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

/content/journals/rascs/10.2174/2666255816666220831125012

Article Type: Other

Keyword(s): energy consumption; fair scheduling; Hadoop map reduce; scheduling issues; virtualization; YARN schedulers

YARN Schedulers for Hadoop MapReduce Jobs: Design Goals, Issues and Taxonomy

Abstract

From This Site

Most Read This Month

Most Cited Most Cited RSS feed

Key Issues in Software Reliability Growth Models

An Ensemble of Bacterial Foraging, Genetic, Ant Colony and Particle Swarm Approach EB-GAP: A Load Balancing Approach in Cloud Computing

Remaining Useful Life Prediction of Lithium-ion Batteries Using Multiple Kernel Extreme Learning Machine

ROUGE-SS: A New ROUGE Variant for the Evaluation of Text Summarization

Extensive Review of Literature on Explainable AI (XAI) in Healthcare Applications

An Analog Circuit Fault Diagnosis Approach Based on Wavelet-based Fractal Analysis and Multiple Kernel SVM

Research on Monitoring System of Daily Statistical Indexes Through Big Data

A Study on E-Learning and Recommendation System

Container Elasticity: Based on Response Time using Docker

Revolutionizing Agriculture: A Comprehensive Review of IoT Farming Technologies