Notice of Violation of IEEE Publication Principles
Pipelined-MapReduce: An Improved MapReduce Parallel Programing Model

Author

Wang, Li ; Ni, Zhiwei ; Zhang, Yiwen ; Wu, Zhang Jun ; Tang, Liyang

Author_Institution

Sch. of Manage., Hefei Univ. of Technol., Hefei, China

Volume

1

fYear

2011

fDate

28-29 March 2011

Firstpage

871

Lastpage

874

Abstract

Notice of Violation of IEEE Publication Principles

"Pipelined-MapReduced: An Improved MapReduce Parallel Programming Model"
by Li Wang, Zhiwei Ni, Yiwen Zhang, Zhang Jun Wu, Liyang Tang
in the Proceedings of the 2011 4th International Conference on Intelligent Computation Technology and Automation, March 2011, pp. 871-874

After careful and considered review of the content and authorship of this paper by a duly constituted expert committee, this paper has been found to be in violation of IEEE\´s Publication Principles.

This paper contains significant portions of original text from the paper cited below. The original text was copied without attribution (including appropriate references to the original author(s) and/or paper title) and without permission. The lead author, Li Wang, was solely responsible for the misconduct, and submitted the paper without the knowledge or consent of the other authors.

Due to the nature of this violation, reasonable effort should be made to remove all past references to this paper, and future references should be made to the following article:

"MapReduce Online"
by Tyson Condie, Neil Conway, Peter Alvaro, Joseph M. Hellerstein
in the Proceedings of the 7th USENIX Symposium on Networked Systems Design and Implementation (NSDI 2010), April 2010

MapReduce is a parallel programming model, and used to handle large datasets. The MapReduce program can be automatically concurrent executed in large-scale commodity machines. We proposed an improved MapReduce programming model-Pipelined-MapReduce, to solve the data intensive of information retrieval problems. Pipelined-MapReduce allows data transfer by pipeline between the operations, expanding the batched MapReduce programming model, and can reduce the completion time, and improve the system utilization rate. The experimental results demonstrate that the implementation of Pipelined-MapReduce can scale well and efficiently process large dataset- s on commodity machines.

Keywords

information retrieval; parallel programming; pipeline processing; MapReduce parallel programing model; batched MapReduce programming model; information retrieval problems; large scale commodity machines; pipelined-MapReduce; Computational modeling; Fault tolerance; Fault tolerant systems; File systems; Google; Programming; Training; Hadoop; MapReduce; Parallel processing; Pipelined-MapReduce;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Computation Technology and Automation (ICICTA), 2011 International Conference on

Conference_Location

Shenzhen, Guangdong

Print_ISBN

978-1-61284-289-9

Type

conf

DOI

10.1109/ICICTA.2011.593

Filename

5750651

Notice of Violation of IEEE Publication PrinciplesPipelined-MapReduce: An Improved MapReduce Parallel Programing Model

Wang, Li ; Ni, Zhiwei ; Zhang, Yiwen ; Wu, Zhang Jun ; Tang, Liyang

conf

Notice of Violation of IEEE Publication Principles
Pipelined-MapReduce: An Improved MapReduce Parallel Programing Model