TDWS: A Job Scheduling Algorithm Based on MapReduce

Yanrong Zhao; Weiping Wang; Dan Meng; YongChun Lv; Shubin Zhang; Jun Li

doi:10.1109/NAS.2012.50

2012 IEEE Seventh International Conference on Networking, Architecture, and Storage

TDWS: A Job Scheduling Algorithm Based on MapReduce

Year: 2012, Pages: 313-319

DOI Bookmark: 10.1109/NAS.2012.50

Authors

Yanrong Zhao
Weiping Wang
Dan Meng
YongChun Lv
Shubin Zhang
Jun Li

Abstract

As organizations start to use data intensive cluster computing systems like Hadoop MapReduce to handle large-scale data, scheduling of jobs become very important in order to achieve efficiency. In the default implementations of Hadoop MapReduce, jobs are scheduled in FIFO order. It easily causes the starvation of small jobs in the event of resources being utilized by large jobs, while Fair Scheduler is inefficient when handling large jobs and it leads to sticky slots problem. In this paper, we proposed a new job scheduling algorithm TDWS. The scheduling algorithm takes account characters of different applications to meet their different needs. In addition, it is also highly robust to heterogeneity and easy to achieve optimal data locality. The experiments demonstrate the feasibility and efficiency of our solution.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

Locality-Aware Reduce Task Scheduling for MapReduce
2011 IEEE Third International Conference on Cloud Computing Technology and Science
Matchmaking: A New MapReduce Scheduling Technique
2011 IEEE Third International Conference on Cloud Computing Technology and Science
A Hybrid Scheduling Algorithm for Data Intensive Workloads in a MapReduce Environment
Utility and Cloud Computing, IEEE Internatonal Conference on
Job Scheduling Optimization for Multi-user MapReduce Clusters
Parallel Architectures, Algorithms and Programming, International Symposium on
A Practical Performance Model for Hadoop MapReduce
IEEE International Conference on Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS)
Dynamic Processing Slots Scheduling for I/O Intensive Jobs of Hadoop MapReduce
2012 Third International Conference on Networking and Computing
Scheduling Mixed Real-Time and Non-real-Time Applications in MapReduce Environment
Parallel and Distributed Systems, International Conference on
A cross-job framework for MapReduce scheduling
2014 IEEE International Conference on Big Data (Big Data)
Byzantine Fault-Tolerant MapReduce: Faults are Not Just Crashes
2011 IEEE Third International Conference on Cloud Computing Technology and Science
Job Aware Scheduling Algorithm for MapReduce Framework
2011 IEEE Third International Conference on Cloud Computing Technology and Science

TDWS: A Job Scheduling Algorithm Based on MapReduce

Authors

Abstract

Related Articles