Skip to content
2000
Volume 16, Issue 6
  • ISSN: 2666-2558
  • E-ISSN: 2666-2566

Abstract

Objective: Big Data processing is a demanding task, and several big data processing frameworks have emerged in recent decades. The performance of these frameworks is greatly dependent on resource management models. Methods: YARN is one of such models which acts as a resource management layer and provides computational resources for execution engines (Spark, MapReduce, storm, etc.) through its schedulers. The most important aspect of resource management is job scheduling. Results: In this paper, we first present the design goal of YARN real-life schedulers (FIFO, Capacity, and Fair) for the MapReduce engine. Later, we discuss the scheduling issues of the Hadoop MapReduce cluster. Conclusion: Many efforts have been carried out in the literature to address issues of data locality, heterogeneity, straggling, skew mitigation, stragglers and fairness in Hadoop MapReduce scheduling. Lastly, we present the taxonomy of different scheduling algorithms available in the literature based on some factors like environment, scope, approach, objective and addressed issues.

Loading

Article metrics loading...

/content/journals/rascs/10.2174/2666255816666220831125012
2023-07-01
2025-07-15
Loading full text...

Full text loading...

/content/journals/rascs/10.2174/2666255816666220831125012
Loading
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test