Finite and infinite Mallows ranking models, maximum likelihood estimator, and regeneration
In this paper we are concerned with various Mallows ranking models. First we study the statistical properties of the MLE of Mallows' ϕ model: P_θ, π_0(π) ∝(-θ inv(π∘π_0^-1)), where θ is the dispersion parameter and π_0 is the central ranking. We prove that (1). the MLE θ is biased upwards for both Mallows' ϕ model and the single parameter IGM model; (2). the MLE π_0 converges exponentially for Mallows' ϕ model. We also make connections of various Mallows ranking models, encompassing the work of Gnedin and Olshanski, and Pitman and Tang. Motivated by the infinite top-t ranking model of Meilǎ and Bao, we propose an algorithm to select the model size t automatically. The key idea relies on the regenerative property of such an infinite permutation. Finally, we apply our algorithm to several data sets including the APA election data and the University homepage search data.
READ FULL TEXT