Multiple Authors Detection: A Quantitative Analysis of Dream of the Red Chamber

12/19/2014
by   Xianfeng Hu, et al.
0

Inspired by the authorship controversy of Dream of the Red Chamber and the application of machine learning in the study of literary stylometry, we develop a rigorous new method for the mathematical analysis of authorship by testing for a so-called chrono-divide in writing styles. Our method incorporates some of the latest advances in the study of authorship attribution, particularly techniques from support vector machines. By introducing the notion of relative frequency as a feature ranking metric our method proves to be highly effective and robust. Applying our method to the Cheng-Gao version of Dream of the Red Chamber has led to convincing if not irrefutable evidence that the first 80 chapters and the last 40 chapters of the book were written by two different authors. Furthermore, our analysis has unexpectedly provided strong support to the hypothesis that Chapter 67 was not the work of Cao Xueqin either. We have also tested our method to the other three Great Classical Novels in Chinese. As expected no chrono-divides have been found. This provides further evidence of the robustness of our method.

READ FULL TEXT
research
11/06/2017

Authorship Analysis of Xenophon's Cyropaedia

In the past several decades, many authorship attribution studies have us...
research
03/26/2015

Unsupervised authorship attribution

We describe a technique for attributing parts of a written text to a set...
research
07/06/2021

A nonBayesian view of Hempel's paradox of the ravens

In Hempel's paradox of the ravens, seeing a red pencil is considered as ...
research
08/15/2022

Reproduction and Replication of an Adversarial Stylometry Experiment

Maintaining anonymity while communicating using natural language remains...
research
10/30/2022

Recognizing Handwriting Styles in a Historical Scanned Document Using Scikit-Fuzzy c-means Clustering

The forensic attribution of the handwriting in a digitized document to m...
research
03/01/2019

Quantitative Robustness of Localized Support Vector Machines

The huge amount of available data nowadays is a challenge for kernel-bas...

Please sign up or login with your details

Forgot password? Click here to reset