Title: Extraction of Automatic Search Result Records Using Content Density Algorithm Based on Node Similarity

Year of Publication: Nov - 2014
Page Numbers: 69-75
Authors: Yasar Gozudeli, Oktay Yildiz, Hacer Karacan, Mohammed R. Baker, Ali Minnet, Murat Kalender, Ozcan Ozay, M. Ali Akcayol
Conference Name: The International Conference on Data Mining, Internet Computing, and Big Data (BigData2014)
- Malaysia

Abstract:


In this paper, a new method proposed for finding and extracting the SRRs. The method first detects content dense nodes on HTML DOM and then extracts SRRs to suggest a list of candidate HTML DOM nodes for a given single research result Web page instance. Afterwards an evaluation algorithm has been applied to the candidate list to find the best solution without any human interaction and manual process. Experimental results show that the proposed methods are successful for finding and extracting the SRRs.