MUDOS-NG: Multi-document Summaries Using N-gram Graphs (Tech Report)

12/09/2010
by   George Giannakopoulos, et al.
0

This report describes the MUDOS-NG summarization system, which applies a set of language-independent and generic methods for generating extractive summaries. The proposed methods are mostly combinations of simple operators on a generic character n-gram graph representation of texts. This work defines the set of used operators upon n-gram graphs and proposes using these operators within the multi-document summarization process in such subtasks as document analysis, salient sentence selection, query expansion and redundancy control. Furthermore, a novel chunking methodology is used, together with a novel way to assign concepts to sentences for query expansion. The experimental results of the summarization system, performed upon widely used corpora from the Document Understanding and the Text Analysis Conferences, are promising and provide evidence for the potential of the generic methods introduced. This work aims to designate core methods exploiting the n-gram graph representation, providing the basis for more advanced summarization systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2015

Extending a Single-Document Summarizer to Multi-Document: a Hierarchical Approach

The increasing amount of online content motivated the development of mul...
research
02/08/2023

Long Text and Multi-Table Summarization: Dataset and Method

Automatic document summarization aims to produce a concise summary cover...
research
04/11/2023

LBMT team at VLSP2022-Abmusu: Hybrid method with text correlation and generative models for Vietnamese multi-document summarization

Multi-document summarization is challenging because the summaries should...
research
09/23/2021

iFacetSum: Coreference-based Interactive Faceted Summarization for Multi-Document Exploration

We introduce iFacetSum, a web application for exploring topical document...
research
07/05/2021

On Bi-gram Graph Attributes

We propose a new approach to text semantic analysis and general corpus a...
research
07/31/2021

Using Query Expansion in Manifold Ranking for Query-Oriented Multi-Document Summarization

Manifold ranking has been successfully applied in query-oriented multi-d...
research
06/11/2019

Cued@wmt19:ewc&lms

Two techniques provide the fabric of the Cambridge University Engineerin...

Please sign up or login with your details

Forgot password? Click here to reset