Reproducible Domain-Specific Knowledge Graphs in the Life Sciences: a Systematic Literature Review

by   Samira Babalou, et al.

Knowledge graphs (KGs) are widely used for representing and organizing structured knowledge in diverse domains. However, the creation and upkeep of KGs pose substantial challenges. Developing a KG demands extensive expertise in data modeling, ontology design, and data curation. Furthermore, KGs are dynamic, requiring continuous updates and quality control to ensure accuracy and relevance. These intricacies contribute to the considerable effort required for their development and maintenance. One critical dimension of KGs that warrants attention is reproducibility. The ability to replicate and validate KGs is fundamental for ensuring the trustworthiness and sustainability of the knowledge they represent. Reproducible KGs not only support open science by allowing others to build upon existing knowledge but also enhance transparency and reliability in disseminating information. Despite the growing number of domain-specific KGs, a comprehensive analysis concerning their reproducibility has been lacking. This paper addresses this gap by offering a general overview of domain-specific KGs and comparing them based on various reproducibility criteria. Our study over 19 different domains shows only eight out of 250 domain-specific KGs (3.2 only one system could successfully pass our reproducibility assessment (14.3 These findings highlight the challenges and gaps in achieving reproducibility across domain-specific KGs. Our finding that only 0.4 domain-specific KGs are reproducible shows a clear need for further research and a shift in cultural practices.


page 1

page 2

page 3

page 4


Domain-specific Knowledge Graphs: A survey

Knowledge Graphs (KGs) have made a qualitative leap and effected a real ...

Pattern-based design applied to cultural heritage knowledge graphs

Ontology Design Patterns (ODPs) have become an established and recognise...

Relational Learning Analysis of Social Politics using Knowledge Graph Embedding

Knowledge Graphs (KGs) have gained considerable attention recently from ...

Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting

In crowd counting, due to the problem of laborious labelling, it is perc...

How are Software Repositories Mined? A Systematic Literature Review of Workflows, Methodologies, Reproducibility, and Tools

With the advent of open source software, a veritable treasure trove of p...

Cross-domain Retrieval in the Legal and Patent Domains: a Reproducibility Study

Domain specific search has always been a challenging information retriev...

A Generative Approach for User-Centered, Collaborative, Domain-Specific Modeling Environments

The use of low- and no-code modeling tools is today an established way i...

Please sign up or login with your details

Forgot password? Click here to reset