Wikibook-Bot - Automatic Generation of a Wikipedia Book

12/28/2018
by   Shahar Admati, et al.
0

A Wikipedia book (known as Wikibook) is a collection of Wikipedia articles on a particular theme that is organized as a book. We propose Wikibook-Bot, a machine-learning based technique for automatically generating high quality Wikibooks based on a concept provided by the user. In order to create the Wikibook we apply machine learning algorithms to the different steps of the proposed technique. Firs, we need to decide whether an article belongs to a specific Wikibook - a classification task. Then, we need to divide the chosen articles into chapters - a clustering task - and finally, we deal with the ordering task which includes two subtasks: order articles within each chapter and order the chapters themselves. We propose a set of structural, text-based and unique Wikipedia features, and we show that by using these features, a machine learning classifier can successfully address the above challenges. The predictive performance of the proposed method is evaluated by comparing the auto-generated books to existing 407 Wikibooks which were manually generated by humans. For all the tasks we were able to obtain high and statistically significant results when comparing the Wikibook-bot books to books that were manually generated by Wikipedia contributors

READ FULL TEXT

page 7

page 8

research
03/30/2022

Generating Scientific Articles with Machine Learning

In recent years, the field of machine learning has seen rapid growth, wi...
research
04/08/2019

Eliciting New Wikipedia Users' Interests via Automatically Mined Questionnaires: For a Warm Welcome, Not a Cold Start

Every day, thousands of users sign up as new Wikipedia contributors. Onc...
research
03/07/2016

A matter of words: NLP for quality evaluation of Wikipedia medical articles

Automatic quality evaluation of Web information is a task with many fiel...
research
06/27/2019

BioGen: Automated Biography Generation

A biography of a person is the detailed description of several life even...
research
06/24/2020

WikipediaBot: Automated Adversarial Manipulation of Wikipedia Articles

This paper presents an automated adversarial mechanism called WikipediaB...
research
12/16/2022

How to disagree well: Investigating the dispute tactics used on Wikipedia

Disagreements are frequently studied from the perspective of either dete...
research
12/06/2018

Feature Analysis for Assessing the Quality of Wikipedia Articles through Supervised Classification

Nowadays, thanks to Web 2.0 technologies, people have the possibility to...

Please sign up or login with your details

Forgot password? Click here to reset