Ancestral causal learning in high dimensions with a human genome-wide application

05/27/2019
by   Umberto Noè, et al.
0

We consider learning ancestral causal relationships in high dimensions. Our approach is driven by a supervised learning perspective, with discrete indicators of causal relationships treated as labels to be learned from available data. We focus on the setting in which some causal (ancestral) relationships are known (via background knowledge or experimental data) and put forward a general approach that scales to large problems. This is motivated by problems in human biology which are characterized by high dimensionality and potentially many latent variables. We present a case study involving interventional data from human cells with total dimension p ∼ 19,000. Performance is assessed empirically by testing model output against previously unseen interventional data. The proposed approach is highly effective and demonstrably scalable to the human genome-wide setting. We consider sensitivity to background knowledge and find that results are robust to nontrivial perturbations of the input information. We consider also the case, relevant to some applications, where the only prior information available concerns a small number of known ancestral relationships.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2020

Learning Latent Causal Structures with a Redundant Input Neural Network

Most causal discovery algorithms find causal structure among a set of ob...
research
12/09/2022

Deep Learning of Causal Structures in High Dimensions

Recent years have seen rapid progress at the intersection between causal...
research
07/05/2023

Causal Discovery with Language Models as Imperfect Experts

Understanding the causal relationships that underlie a system is a funda...
research
06/29/2021

Learning latent causal graphs via mixture oracles

We study the problem of reconstructing a causal graphical model from dat...
research
11/24/2021

Causal Regularization Using Domain Priors

Neural networks leverage both causal and correlation-based relationships...
research
10/30/2017

Implicit Causal Models for Genome-wide Association Studies

Progress in probabilistic generative models has accelerated, developing ...
research
12/02/2014

Learning interpretable models of phenotypes from whole genome sequences with the Set Covering Machine

The increased affordability of whole genome sequencing has motivated its...

Please sign up or login with your details

Forgot password? Click here to reset