Title: COMBINATION OF STATISTICAL AND LANGUAGE PROCESSING METHODS IN NEWS SUMMARIZATION: A CASE STUDY FOR VIETNAMESE NEWS

Year of Publication: 2013
Page Numbers: 119-128
Authors: Vo Thanh Hung, Phan Thi Tuoi, Quan Thanh Tho
Conference Name: The Second International Conference on Digital Enterprise and Information Systems (DEIS2013)
- Malaysia

Abstract:


Automatic summarization refers to a technique of reducing of a text document by a computer program to create a summary that retains the most important points of the original document. Among various kinds of textual documents, news plays an important role in our life, which is always updated sequentially every day in large scale. A technique for automatic creation of new summarization would help readers to quickly capture significant information from vast amount of electronic newspapers available on the Internet nowadays. In this paper, we introduce an approach to handle this issue on Vietnamese news. By combining both statistical and linguistic methods, we are able to produce meaningful summarization of real articles collected from major newspaper channels in Vietnam.