A Baseline Analysis for Podcast Abstractive Summarization

08/24/2020
by   Chujie Zheng, et al.
0

Podcast summary, an important factor affecting end-users' listening decisions, has often been considered a critical feature in podcast recommendation systems, as well as many downstream applications. Existing abstractive summarization approaches are mainly built on fine-tuned models on professionally edited texts such as CNN and DailyMail news. Different from news, podcasts are often longer, more colloquial and conversational, and noisier with contents on commercials and sponsorship, which makes automatic podcast summarization extremely challenging. This paper presents a baseline analysis of podcast summarization using the Spotify Podcast Dataset provided by TREC 2020. It aims to help researchers understand current state-of-the-art pre-trained models and hence build a foundation for creating better models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2022

Implementing Deep Learning-Based Approaches for Article Summarization in Indian Languages

The research on text summarization for low-resource Indian languages has...
research
05/08/2023

The Current State of Summarization

With the explosive growth of textual information, summarization systems ...
research
06/23/2023

Abstractive Text Summarization for Resumes With Cutting Edge NLP Transformers and LSTM

Text summarization is a fundamental task in natural language processing ...
research
02/11/2016

Variations of the Similarity Function of TextRank for Automated Summarization

This article presents new alternatives to the similarity function for th...
research
04/17/2021

Transductive Learning for Abstractive News Summarization

Pre-trained language models have recently advanced abstractive summariza...
research
07/15/2020

Align then Summarize: Automatic Alignment Methods for Summarization Corpus Creation

Summarizing texts is not a straightforward task. Before even considering...
research
02/02/2023

Combining Deep Neural Reranking and Unsupervised Extraction for Multi-Query Focused Summarization

The CrisisFACTS Track aims to tackle challenges such as multi-stream fac...

Please sign up or login with your details

Forgot password? Click here to reset