Publication:
Structure-Preserving And Query-Biased Document Summarisation For Web Searching

dc.contributor.authorPembe, F. Canan
dc.contributor.authorGüngör, Tunga
dc.contributor.authorIDTR6099tr_TR
dc.date.accessioned2016-05-05T12:32:46Z
dc.date.available2016-05-05T12:32:46Z
dc.date.issued2009
dc.description.abstractPurpose - The purpose of this paper is to develop a new summarisation approach, namely structure-preserving and query-biased summarisation, to improve the effectiveness of web searching. During web searching, one aid for users is the document summaries provided in the search results. However, the summaries provided by current search engines have limitations in directing users to relevant documents. Design/methodology/approach - The proposed system consists of two stages: document structure analysis and summarisation. In the first stage, a rule-based approach is used to identify the sectional hierarchies of web documents. In the second stage, query-biased summaries are created, making use of document structure both in the summarisation process and in the output summaries. Findings - In structural processing, about 70 per cent accuracy in identifying document sectional hierarchies is obtained. The summarisation method is tested on a task-based evaluation method using English and Turkish document collections. The results show that the proposed method is a significant improvement over both unstructured query-biased summaries and Google snippets in terms of f-measure. Practical implications - The proposed summarisation system can be incorporated into search engines. The structural processing technique also has applications in other information systems, such as browsing, outlining and indexing documents. Originality/value - In the literature on summarisation, the effects of query-biased techniques and document structure are considered in only a few works and are researched separately. The research reported here differs from traditional approaches by combining these two aspects in a coherent framework. The work is also the first automatic summarisation study for Turkish targeting web search.tr_TR
dc.identifier.issn1468-4527
dc.identifier.scopus2-s2.0-70249124175
dc.identifier.urihttp://hdl.handle.net/11413/1283
dc.identifier.wos270768100004
dc.language.isoen
dc.publisherEmerald Group Publishing Limited, Howard House, Wagon Lane, Bingley Bd16 1Wa, W Yorkshire, England
dc.relationOnline Information Reviewtr_TR
dc.subjectData Structurestr_TR
dc.subjectDocument Deliverytr_TR
dc.subjectMarkup Languagestr_TR
dc.subjectSearch Enginestr_TR
dc.subjectWorldwide Webtr_TR
dc.subjectExtractiontr_TR
dc.subjectVeri Yapılarıtr_TR
dc.subjectBelge Teslimtr_TR
dc.subjectBiçimlendirme Dilleritr_TR
dc.subjectArama Motorlarıtr_TR
dc.subjectDünya Çapında Ağtr_TR
dc.subjectÇıkarmatr_TR
dc.titleStructure-Preserving And Query-Biased Document Summarisation For Web Searchingtr_TR
dc.typeArticle
dspace.entity.typePublication
local.indexed.atWOS
local.indexed.atScopus

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: