Unique contribution of enhancer-driven and master-regulator genes to autoimmune disease revealed using functionally informed SNP-to-gene linking strategies

Kushal K. Dey, Steven Gazal, Bryce van de Geijn, Samuel Sungil Kim, Joseph Nasser, Jesse M. Engreitz, Alkes L. Price.
bioRxiv.   
Abstract
Gene regulation is known to play a fundamental role in human disease, but mechanisms of regulation vary greatly across genes. Here, we explore the contributions to disease of two types of genes: genes whose regulation is driven by enhancer regions as opposed to promoter regions (Enhancer-driven) and genes that regulate many other genes in trans (Master-regulator). We link these genes to SNPs using a comprehensive set of SNP-to-gene (S2G) strategies and apply stratified LD score regression to the resulting SNP annotations to draw three main conclusions about 11 autoimmune diseases and blood cell traits (average N =306K). First, Enhancer-driven genes defined in blood using functional genomics data (e.g. ATAC-seq, RNA-seq, PC-HiC) are uniquely informative for autoimmune disease heritability, after conditioning on a broad set of regulatory annotations from the baseline-LD model. Second, Master-regulator genes defined using trans-eQTL in blood are also uniquely informative for autoimmune disease heritability. Third, integrating Enhancer-driven and Master-regulator gene sets with protein-protein interaction (PPI) network information magnified their unique disease signal. The resulting PPI-enhancer gene score produced >2x stronger conditional signal (maximum standardized SNP annotation effect size (τ*) = 2.0 (s.e. 0.3) vs. 0.91 (s.e. 0.21)), and >2x stronger gene-level enrichment for approved autoimmune disease drug targets (5.3x vs. 2.1x), as compared to the recently proposed Enhancer Domain Score (EDS). In each case, using functionally informed S2G strategies to link genes to SNPs that may regulate them produced much stronger disease signals (4.1x-13x larger τ* values) than conventional window-based S2G strategies. We conclude that Enhancer-driven and Master-regulator genes are uniquely informative for human disease, and that PPI networks and S2G strategies magnify these signals.