Trong Nhan Phan, Josef Küng, Tran Khanh Dang,
"An Efficient Similarity search in Large Data Collections with MapReduce, in Future Data and Security Engineering,Proceedings of the first International Conference, FDSE 2014,HO Chi Minh City Vietnam Nov"
: Future Data and Security Engineering,Proceedings of the first International Conference, FDSE 2014,HO Chi Minh City Vietnam Nov., Serie Lecture Notes in Computer Science (LNCS), Vol. 8860, Springer, Berlin, Heidelberg, Seite(n) 44-57, 11-2014
Original Titel:
An Efficient Similarity search in Large Data Collections with MapReduce, in Future Data and Security Engineering,Proceedings of the first International Conference, FDSE 2014,HO Chi Minh City Vietnam Nov
Sprache des Titels:
Englisch
Original Buchtitel:
Future Data and Security Engineering,Proceedings of the first International Conference, FDSE 2014,HO Chi Minh City Vietnam Nov.
Original Kurzfassung:
The era of big data has been calling for many innovations on improving similarity search computing. Such unstoppable large amounts of data threaten both processing capacity and performance of existing information systems. Joining the challenges on scalability, we propose an efficient similarity search in large data collections with MapReduce. In addition, we make the best use of the proposed scheme for widespread similarity search cases including pairwise similarity, search by example, range query, and k-Nearest Neighbor query. Moreover, collaborative strategic refinements are utilized to effectively eliminate unnecessary computations and efficiently speed up the whole process. Last but not least, our methods are enhanced by experiments, along with a previous work, on real large datasets, which shows how well these methods are verified.