Two Approaches to Survival Analysis of Open Source Python Projects

03/15/2022
by   Derek Robinson, et al.
0

A recent study applied frequentist survival analysis methods to a subset of the Software Heritage Graph and determined which attributes of an OSS project contribute to its health. This paper serves as an exact replication of that study. In addition, Bayesian survival analysis methods were applied to the same dataset, and an additional project attribute was studied to serve as a conceptual replication. Both analyses focus on the effects of certain attributes on the survival of open-source software projects as measured by their revision activity. Methods such as the Kaplan-Meier estimator, Cox Proportional-Hazards model, and the visualization of posterior survival functions were used for each of the project attributes. The results show that projects which publish major releases, have repositories on multiple hosting services, possess a large team of developers, and make frequent revisions have a higher likelihood of survival in the long run. The findings were similar to the original study; however, a deeper look revealed quantitative inconsistencies.

READ FULL TEXT

page 3

page 5

research
06/19/2019

On the abandonment and survival of open source projects: An empirical investigation

Background: Evolution of open source projects frequently depends on a sm...
research
05/02/2023

Towards a Critical Open-Source Software Database

Open-source software (OSS) plays a vital role in the modern software eco...
research
08/12/2019

How do Developers Promote Open Source Projects?

Open source projects have an increasing importance on modern software de...
research
02/21/2023

SurvLIMEpy: A Python package implementing SurvLIME

In this paper we present SurvLIMEpy, an open-source Python package that ...
research
03/19/2021

An Exploratory Study of Project Activity Changepoints in Open Source Software Evolution

To explore the prevalence of abrupt changes (changepoints) in open sourc...
research
06/12/2020

Predicting Project Health for Open Source Projects (using the DECART Hyperparameter Optimizer)

Software developed on public platforms are a source of data that can be ...
research
04/09/2021

Memory Error Detection in Security Testing

We study 10 C/C++ projects that have been using a static analysis securi...

Please sign up or login with your details

Forgot password? Click here to reset