Elucidation of disease etiology by trans-layer omics analysis

Shirai, Yuya; Okada, Yukinori

doi:10.1186/s41232-021-00155-w

Review
Open access
Published: 08 February 2021

Elucidation of disease etiology by trans-layer omics analysis

Yuya Shirai^1,2 &
Yukinori Okada^1,3

Inflammation and Regeneration volume 41, Article number: 6 (2021) Cite this article

4357 Accesses
1 Citations
1 Altmetric
Metrics details

Abstract

To date, genome-wide association studies (GWASs) have successfully identified thousands of associations between genetic polymorphisms and human traits. However, the pathways between the associated genotype and phenotype are often poorly understood. The transcriptome, proteome, and metabolome, the omics, are positioned along the pathway and can provide useful information to translate from genotype to phenotype. This review shows useful data resources for connecting each omics and describes how they are combined into a cohesive analysis. Quantitative trait loci (QTL) are useful information for connecting the genome and other omics. QTL represent how much genetic variants have effects on other omics and give us clues to how GWAS risk SNPs affect biological mechanisms. Integration of each omics provides a robust analytical framework for estimating disease causality, discovering drug targets, and identifying disease-associated tissues. Technological advances and the rise of consortia and biobanks have facilitated the analyses of unprecedented data, improving both the quality and quantity of research. Proficient management of these valuable datasets allows discovering novel insights into the genetic background and etiology of complex human diseases and contributing to personalized medicine.

Background

Genome-wide association studies (GWASs) are study designs for evaluating associations between genetic variants and phenotypic traits across the genomes, revealing the genetic impacts on a variety of phenotypes since its first success in 2002 [1,2,3]. In recent years, the rise of international consortia and biobanks has enabled GWAS’s application on a scale of more than 1 million samples, which can identify a large number of genetic risk variants. However, little is known about how GWAS results should be translated into disease etiology and novel drug discovery. To correctly interpret GWAS results, it is useful to utilize additional information, such as transcriptome, proteome, and metabolome, which are the components of the central dogma (Fig. 1a). Assessment of the translation from genotype to phenotype, including the role of omics, will enable us to understand how genetic diversity impacts our health (trans-layer omics analysis).

Ideally, researchers prefer to use identical samples with data across each omics. However, few datasets contain all the omics data in individuals. While it would be ideal for defining a new sample and collecting all of the omics, genetic and phenotypic data, that undertaking has enormous cost and time required. While individual raw data is often restricted in its use due to ethical issues, de-identified or anonymized analysis results are available in many biological databases. We can perform comprehensive analyses for various phenotypes by integrating the accumulated data with our own data. In particular, the quantitative trait loci (QTL), or loci associated with a variation of a quantitative trait (Fig. 1b), play an essential role in trans-layer omics analysis.

This review introduces several useful resources that can be applied to trans-layer omics analysis and describes how the data can be utilized, mainly focusing on QTL.

Transcriptome

More than 90% of the risk variants identified by GWASs have been found in non-coding regions [4, 5], and as a result, rarely alter the amino acid sequence. Several studies have reported that many GWAS risk variants in the non-coding region are enriched in regulatory regions involved in gene transcription, including enhancers and promoters [6, 7]. These findings motivated the investigation of genetic variants, called expression quantitative trait loci (eQTL), that affect gene expression levels. Researchers have identified numerous eQTL, some of which are available in several curated databases (Table 1). The effects of eQTL are tissue- and cell-specific [8, 18], and it is important to use eQTL data derived from the appropriate tissues and cells in relation to the target phenotype.

Table 1 Useful resources and databases related to QTL

Full size table

The Genotype-Tissue Expression (GTEx) Project has built a large-scale database of organ-specific eQTL data. The latest GTEx database is version 8, deployed in 2020, which includes curated data on as many as 15,201 RNA sequencing samples from 49 tissues of 838 donors [9]. In addition to eQTL information, the GTEx also contains genetic variants, splicing quantitative trait loci (sQTL), affecting gene splicing. The sQTL data allow us to develop hypotheses about genetic polymorphisms in splicing, leading to diversity in the RNA transcribed from a single gene.

Several eQTL analyses for individual blood cell types have been successfully performed due to easy access to the sample and the cell diversity [10, 19,20,21]. In these studies, cell types were clustered by sorting in the laboratory or estimation in silico.

The DICE (database of immune cell expression, expression quantitative trait loci (eQTL) and epigenomics) project was established to define the transcriptional and epigenomic landscape of many human immune cell types in relation to genetic variation [10]. As the first report from this project, Schmiedel et al. performed the eQTL analysis for 13 primary immune cell types (three innate immune cell types, four naive adaptive immune cell types, and six CD4⁺ T memory cell types) and two activated cell types isolated from 106 leukapheresis samples provided by 91 healthy subjects in the San Diego area. They identified a total of 12,254 genes with cis-eQTL and a large fraction (41%) of these genes showed a strong cis-association with genotype only in a single cell type. Furthermore, they confirmed several cases in which the cell types with eQTL corresponding to GWAS risk SNPs related to the well-known pathogenesis.

As for the eQTL analysis of non-Europeans, Ishigaki et al. conducted the eQTL analysis on five immune cell types (CD4+ T cells, CD8+ T cells, B cells, natural killer cells, and monocytes) and unfractionated peripheral blood from 105 healthy Japanese volunteers [21]. In this study, gene expression levels were predicted from individual genotype data based on the developed eQTL dataset and public epigenetic data. Subsequently, the association analysis between the estimated gene expression levels and 15 diseases state was performed. Finally, the cell-specific pathway activity was predicted by integrating the direction of eQTL effects. This framework applied to rheumatoid arthritis (RA) revealed that activation of the TNF pathway in CD4+ T cells plays a vital role in RA etiology.

Association analyses for genome-wide gene expression, as was done in this study, are called transcriptome-wide association studies (TWASs). TWASs use penalized regression techniques, such as LASSO, Ridge, or Elastic net, which incorporate eQTL reference data as training data and evaluate associations between predicted gene expression levels and a target trait. TWAS can also be performed with only GWAS summary statistics and external eQTL reference data [22, 23]. However, we should carefully interpret TWAS results because non-causal genes co-regulated with causal genes are likely significant, resulting in TWAS results’ bias. Fine-mapping of causal gene sets deals with this problem by incorporating linkage disequilibrium data and provides less unbiased results [24, 25].

Most transcriptome studies focus on coding genes because many non-coding genes have unknown functions, and the analysis results can be challenging to interpret. Sakaue et al. developed a method for estimating GWAS-target miRNAs and GWAS-related tissues by integrating GWAS summary statistics and miRNA expression data [26]. Analyses incorporating the transcriptome are expected to continue developing with increasing resources and maturating analytic techniques for non-coding gene data.

Single-cell analysis

Single-cell RNA sequencing makes it possible to evaluate gene expression at an unprecedentedly fine resolution in individual cell types and identify rare populations without assumptions. The previous classification of cells by surface markers could only identify existing cell types and had difficulty identifying heterogeneity in captured cell types [27]. Single-cell RNA sequencing is currently evolving with respect to technology and sample size. Monique et al. conducted eQTL analysis using single-cell RNA sequencing to identify eQTL for rare populations and reported variants that alter gene co-expression [28]. The eQTL analysis for single-cell RNA sequencing can analyze individual cell types and freely definable clusters such as cell lineages, providing a more flexible analysis framework. More and more cell-type-specific and cell-cluster-specific eQTL will be identified in the near future [29].

In general, eQTL analyses that handle RNA data have a limit in that they capture a snapshot at a single point in time, not reflective of transcriptome fluctuations. Therefore, many eQTL are conditional, and some can only be identified through cell activation or cell differentiation [30, 31]. Davenport et al. focused on transcriptome changes associated with drug administration [32]. They reported how much the eQTL were impacted by IL6 antibody treatment in SLE patients, resulting in expression changes. This study showed that the eQTL analysis using RNA-seq data at multiple times (at 0, 12, and 24 weeks of anti-IL6 drug administration) increased the number of identifiable eQTL compared to the analysis from one point in time. This study revealed that several eQTL effects were enhanced at a high total IFN level or by IL6 antibody administration, and each of the eQTL was enriched in ISRE motif and IRF4 motif. These findings suggested that these transcription factors (TFs) binding motifs may be key regulatory mediators of environmental stimuli and potential therapeutic targets [33].

Mass cytometry is another single-cell modality and provides different cell-type-specific profiles. Mass cytometry measures a limited number (~ 40) of pre-selected markers, but these markers are supported by decades of experimental evidence that they are useful for defining cellular heterogeneity [34]. Zhang et al. defined stromal and immune cell populations overabundant in RA joint synovial tissues by integrating single-cell RNA sequencing and mass cytometry data [35]. They found that several specific immune cells, categorized by genetic and proteomic profiles (e.g., THY1⁺HLA-DRA^hi sublining fibroblasts), were increased in RA synovium. This study showed that the integration of multiple experimental modalities helps us select trait-specific cells by increasing the individual cells’ information. QTL analyses of these cells central to the etiology should provide more evident and profound insights into genetic impacts on various diseases.

Proteome

The proteome comprises the entire protein complement produced in an organism or system. Sun et al. investigated the associations between genetic variants and 2994 proteins in 3301 Europeans [15]. They identified 1927 genetic loci, which altered the plasma protein amount between 1478 proteins and 764 genomic regions, known as protein quantitative trait loci (pQTL). Of note, only 10–20% of the previously reported cis eQTL were cis pQTL, while cis pQTL were significantly enriched in eQTL for the corresponding gene. Comparison between eQTL and pQTL studies may be influenced by differences in sample size, tissues, and technology platforms used in each analysis. Nonetheless, this study suggested that genetic effects on plasma protein abundance are often, but not exclusively, driven by regulation of mRNA.

Plasma proteins play essential roles through biological processes and represent a significant resource for drug targets [36]. Mendelian randomization (MR) is a useful method for exploring diseases-causing proteins, which can be therapeutic targets. Instead of allocating interventions or non-intervention as in randomized controlled trials (RCT), MR allocates subjects according to risk variants of the causative traits [37]. MR is an attractive method for conducting RCT-like research despite being feasible with existing data. MR studies can be performed if the two traits are in different cohorts. Thus, the analysis platform with access to various GWAS results has been currently established [38]. Zheng et al. conducted a large-scale MR analysis of 1002 proteins on 225 traits with five extensive pQTL datasets [39]. They found 111 causal relationships between 65 proteins and 52 disease-related phenotypes. When these relationships were queried against a curated drug database, previously defined associations between approved drugs and their target proteins were more likely to be identified. This finding supported MR as a useful tool to search for drug targets.

Metabolome

The metabolome consists of small molecules that are intermediates or products of metabolism, ranging from peptides and lipids to drugs and pollutants. Several studies that tested the associations between genetic variants and serum metabolites have reported hundreds of loci changing serum metabolite amount, or metabolite quantitative trait loci (mQTL), which were often mapped to genes encoding for enzymes or transporters [40,41,42]. The kidneys play an important role in the regulation of blood metabolite by controlling the amount of urinary metabolites [43]. As a result of excretion, urinary metabolites are more diverse than serum metabolites. Therefore, they can reflect individual differences that cannot be captured in blood metabolites. This fact inspired the investigation of associations between urinary metabolites and genetic variants [44, 45]. Schlosser et al. performed the GWAS for the urinary concentrations of 1172 metabolites among 1627 patients with reduced kidney function [44]. They identified 240 urine metabolite-mQTL associations and found that the loci included 90 unique genes. These genes’ expressions were seen in organs involved in the absorption and metabolism, such as the kidney, liver, and small intestine.

Furthermore, they used the colocalization method to confirm whether a GWAS risk variant was also responsible for mQTL signals in the locus. GWAS and QTL signals can overlap for three reasons: two independent causal variants in linkage disequilibrium (linkage), a single causal variant affecting the GWAS trait via gene expression modulation (causality), or a single causal variant affecting both traits independently (pleiotropy) [46]. Colocalization helps us distinguish causality and pleiotropy from linkage, which is essential for identifying targets that drive GWAS risk loci.

Epigenome

The epigenome describes a biological phenomenon where chemical compounds can modify or mark the genome to affect gene expressions. DNA methylation at CpG dinucleotides plays an important role in gene regulation by altering DNA affinity with TF or chromatin-binding proteins. DNA methylation can occur for various reasons: normal development such as genomic imprinting, aging, environmental factors, or genetic factors. In these, genetic factors can be inherited and diversify the innate methylation status among individuals. Gaunt et al. evaluated the genetic influences on DNA methylation, or methylation quantitative trait loci (meQTL), in the human blood at five different life stages: children at birth, childhood, adolescence, and their mothers during pregnancy and at middle age [16]. They identified 30,000 significant associations at each time point and revealed that the genetic heritability was highly stable at about 20% throughout the life stages. They also showed that meQTL likely overlapped with eQTL, as reported in GTEx, and enriched in GWAS risk loci for complex diseases, such as Alzheimer’s disease and schizophrenia. Because DNA methylation is generally involved in gene regulation, genetic variants that affect methylation status can also affect gene expression levels. Their result supported that some GWAS risk variants may impact our health by altering gene expression via methylation.

While QTL are beneficial information in evaluating the effects of GWAS risk SNPs, most QTL analyses do not cover all tissues and probably have insufficient power in some tissues due to a lack of adequate sample size. It is useful to consider what functional annotations target variants are located in (Table 2). ENCODE [47], ROADMAP Epigenomics [49], and BLUEPRINT [50] have accumulated considerable resources, mapping regulatory annotations in the genome by profiling chromatin functions, including DNase hypersensitivity sites, several types of histone markers, and the binding sites of chromatin-related proteins in many cells and tissues. ENCODE, which aims to catalog all functional elements of humans and mice, was updated in 2020 by expanding the target cell types and tissues and adding new annotations, including RNA-binding protein regions and chromatin loops [48]. ENCODE, version 3, integrates a vast amount of accumulated data into novel annotations for 926,535 candidate cis-regulatory elements (cCRE), covering 7.9% of the human genome and 339,815 cCRE covering 3.4% of the mouse genome. The registered data accumulated in several large consortia including ENCODE is tremendous; a tool for efficiently searching them has also been released [52].

Table 2 Useful resources and tools related to epigenetic data

Full size table

The FANTOM Consortium has used a unique technique called cap analysis of gene expression (CAGE) to identify promoters and enhancers across hundreds of cells and tissues [53]. Hirabayashi et al. developed NET-CAGE (native elongating transcript), which enables the detection of TSSs of nascent RNAs and quantifies true transcriptional activities of promoters and enhancers at high nucleotide resolution in diverse cell types as well as frozen cells and tissues [57].

Linkage disequilibrium score regression (LDSC regression) is an effective tool for combining GWAS summary statistics with these genome annotations. In the polygenic traits, the χ² association statistic for a given SNP in GWASs includes the effects of all SNPs tagged by this SNP [58]. The LD score is calculated by the sum of the linkage disequilibrium r² measures of the target variant and the surrounding variants (e.g., variants in a window size of 1 cM around the target variant). The regression of the χ² statistics by the LD score in a genome-wide manner provides an estimate of heritability. The modified form, or stratified LDSC regression, partitions SNP heritability by functional genomic annotations and tests whether the GWAS statistics is enriched in the annotations [59]. For stratified LDSC regression, the authors have developed 10 cell-group-specific annotations and 220 cell-type-specific annotations for histone modifications (H3K4me1, H3K4me3, H3K9ac, and H3K27ac) created from ROADMAP Epigenomics data. These annotation sets are useful to implement GWAS enrichment analyses in various tissues [60].

Ishigaki et al. estimated TF enrichment in various diseases using the annotation of TF binding sites defined by 2868 publicly available chromatin immunoprecipitation sequencing datasets for 410 unique TFs [61]. They identified 378 significant enrichments across nine diseases (e.g., NF-κB for immune-related diseases) and revealed that TF clusters characterized based on LD score included TF components which showed similar disease enrichment. LDSC regression provides a flexible framework because it can integrate GWAS summary statistics and customized genome annotations. In the future, functional annotations will continue their significant growth and will be available for genome annotation projects. Enrichment analyses combining these valuable annotations and GWAS data should contribute to the elucidation of complex human traits.

Conclusions

The available omics data has improved in both quantity and quality. State-of-the-art technology, such as whole-genome sequencing, long lead sequencing, mass cytometry, and single-cell RNA sequencing, should illuminate currently unreachable areas. Furthermore, trans-layer omics analysis empowers the information from the individual omics. If the relationship between the genome and each omics is revealed, various living organisms’ phenomena could be predicted from genome data. The tools introduced in this review are not limited to one type of omics pair but can be applied to various omics data combinations (e.g., colocalization between GWAS and sQTL). The trans-layer omics analyses using these sophisticated methods and a vast amount of data provide novel insights into the genetic background, etiology of complex diseases, and drug discovery, which should contribute to the implementation of personalized medicine.

Availability of data and materials

Not applicable

Abbreviations

GWAS:: Genome-wide association study
QTL:: Quantitative trait loci
eQTL:: Expression quantitative trait loci
GTEx:: Genotype-tissue expression
DICE:: Database of immune cell expression, expression of quantitative trait loci and epigenomics
RA:: Rheumatoid arthritis
TWAS:: Transcriptome-wide association study
TF:: Transcription factor
pQTL:: Protein quantitative trait loci
MR:: Mendelian randomization
RCT:: Randomized controlled trials
mQTL:: Metabolite quantitative trait loci
meQTL:: Methylation quantitative trait loci
cCRE:: Candidate cis-regulatory elements
CAGE:: Cap analysis of gene expression
LDSC:: Linkage disequilibrium score

References

Ozaki K, Ohnishi Y, Iida A, Sekine A, Yamada R, Tsunoda T, et al. Functional SNPs in the lymphotoxin-α gene that are associated with susceptibility to myocardial infarction. Nat Genet. 2002;32:650–4.
Article CAS PubMed Google Scholar
Akiyama M, Okada Y, Kanai M, Takahashi A, Momozawa Y, Ikeda M, et al. Genome-wide association study identifies 112 new loci for body mass index in the Japanese population. Nat Genet. 2017;49:1458–67.
Article CAS PubMed Google Scholar
Okada Y, Wu D, Trynka G, Raj T, Terao C, Ikari K, et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature. 2014;506:376–81.
Article CAS PubMed Google Scholar
Edwards SL, Beesley J, French JD, Dunning AM. Beyond GWASs: illuminating the dark road from association to function. Am J Hum Genet. 2013;93:779–97.
Article CAS PubMed PubMed Central Google Scholar
Ricaño-Ponce I, Wijmenga C. Mapping of immune-mediated disease genes. Annu Rev Genomics Hum Genet. 2013;14:325–53.
Article PubMed CAS Google Scholar
Maurano MT, Humbert R, Rynes E, Thurman RE, Haugen E, Wang H, et al. Systematic localization of common disease-associated variation in regulatory DNA. Science. 2012;337:1190–5.
Article CAS PubMed PubMed Central Google Scholar
Pasquali L, Gaulton KJ, Rodríguez-Seguí SA, Mularoni L, Miguel-Escalada I, Akerman İ, et al. Pancreatic islet enhancer clusters enriched in type 2 diabetes risk-associated variants. Nat Genet. 2014;46:136–43.
Article CAS PubMed PubMed Central Google Scholar
GTEx Consortium. Genetic effects on gene expression across human tissues. Nature. 2017;550:204–13.
Article PubMed Central Google Scholar
GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369:1318–30.
Article CAS Google Scholar
Schmiedel BJ, Singh D, Madrigal A, Valdovino-Gonzalez AG, White BM, Zapardiel-Gonzalo J, et al. Impact of genetic polymorphisms on human immune cell gene expression. Cell. 2018;175:1701–1715.e16.
Article CAS PubMed PubMed Central Google Scholar
Mostafavi S, Ortiz-Lopez A, Bogue MA, Hattori K, Pop C, Koller D, et al. Variation and Genetic control of gene expression in primary immunocytes across inbred mouse strains. J Immunol. 2014;193:4485–96.
Article CAS PubMed Google Scholar
Võsa U, Claringbould A, Westra H-J, Bonder MJ, Deelen P, Zeng B, et al. Unraveling the polygenic architecture of complex traits using blood eQTL metaanalysis. bioRxiv. 2018. https://doi.org/10.1101/447367.
Wang D, Liu S, Warrell J, Won H, Shi X, Navarro FCP, et al. Comprehensive functional genomic resource and integrative model for the human brain. Science. 2018;362:eaat8464. https://doi.org/10.1126/science.aat8464.
Article CAS PubMed PubMed Central Google Scholar
Kerimov N, Hayhurst JD, Manning JR, Walter P, Kolberg L, Peikova K, et al. eQTL catalogue: a compendium of uniformly processed human gene expression and splicing QTLs. bioRxiv. 2020. https://doi.org/10.1101/2020.01.29.924266.
Sun BB, Maranville JC, Peters JE, Stacey D, Staley JR, Blackshaw J, et al. Genomic atlas of the human plasma proteome. Nature. 2018;558:73–9.
Article CAS PubMed PubMed Central Google Scholar
Gaunt TR, Shihab HA, Hemani G, Min JL, Woodward G, Lyttleton O, et al. Systematic identification of genetic influences on methylation across the human life course. Genome Biol. 2016;17:61.
Article PubMed PubMed Central CAS Google Scholar
Zheng Z, Huang D, Wang J, Zhao K, Zhou Y, Guo Z, et al. QTLbase: an integrative resource for quantitative trait loci across multiple human molecular phenotypes. Nucleic Acids Res. 2020;48:D983–91.
Article CAS PubMed Google Scholar
Brown CD, Mangravite LM, Engelhardt BE. Integrative modeling of eQTLs and Cis-regulatory elements suggests mechanisms underlying cell type specificity of eQTLs. PLoS Genet. 2013;9:e1003649.
Article CAS PubMed PubMed Central Google Scholar
Sumitomo S, Nagafuchi Y, Tsuchida Y, Tsuchiya H, Ota M, Ishigaki K, et al. Transcriptome analysis of peripheral blood from patients with rheumatoid arthritis: a systematic review. Inflamm Regen. 2018;38:21.
Article CAS PubMed PubMed Central Google Scholar
Westra H-J, Arends D, Esko T, Peters MJ, Schurmann C, Schramm K, et al. Cell specific eQTL analysis without sorting cells. PLoS Genet. 2015;11:e1005223.
Article PubMed PubMed Central CAS Google Scholar
Ishigaki K, Kochi Y, Suzuki A, Tsuchida Y, Tsuchiya H, Sumitomo S, et al. Polygenic burdens on cell-specific pathways underlie the risk of rheumatoid arthritis. Nat Genet. 2017;49:1120–5.
Article CAS PubMed Google Scholar
Gusev A, Ko A, Shi H, Bhatia G, Chung W, Penninx BWJH, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat Genet. 2016;48:245–52.
Article CAS PubMed PubMed Central Google Scholar
Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, Wheeler HE, Torres JM, et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun. 2018;9:1825.
Article PubMed PubMed Central CAS Google Scholar
Mancuso N, Freund MK, Johnson R, Shi H, Kichaev G, Gusev A, et al. Probabilistic fine-mapping of transcriptome-wide association studies. Nat Genet. 2019;51:675–82.
Article CAS PubMed PubMed Central Google Scholar
Wainberg M, Sinnott-Armstrong N, Mancuso N, Barbeira AN, Knowles DA, Golan D, et al. Opportunities and challenges for transcriptome-wide association studies. Nat Genet. 2019;51:592–9.
Article CAS PubMed Google Scholar
Sakaue S, Hirata J, Maeda Y, Kawakami E, Nii T, Kishikawa T, et al. Integration of genetics and miRNA – target gene network identified disease biology implicated in tissue specificity. Nucleic Acids Res. 2018;46:11898–909.
Article CAS PubMed Google Scholar
Villani A-C, Satija R, Reynolds G, Sarkizova S, Shekhar K, Fletcher J, et al. Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors. Science. 2017;356:eaah4573.
Article PubMed PubMed Central CAS Google Scholar
van der Wijst MGP, Brugge H, de Vries DH, Deelen P, Swertz MA, Franke L. Single-cell RNA sequencing identifies celltype-specific cis-eQTLs and co-expression QTLs. Nat Genet. 2018;50:493–7.
Article PubMed PubMed Central CAS Google Scholar
van der Wijst MGP, de Vries D, Groot H, Trynka G, Hon C, Bonder M, et al. The single-cell eQTLGen consortium. Elife. 2020;9:e52155.
Article PubMed PubMed Central Google Scholar
Ye CJ, Feng T, Kwon H-K, Raj T, Wilson MT, Asinovski N, et al. Intersection of population variation and autoimmunity genetics in human T cell activation. Science. 2014;345:1254665.
Article PubMed PubMed Central CAS Google Scholar
Cuomo ASE, Seaton DD, McCarthy DJ, Martinez I, Bonder MJ, Garcia-Bernardo J, et al. Single-cell RNA-sequencing of differentiating iPS cells reveals dynamic genetic effects on gene expression. Nat Commun. 2020;11:810.
Article CAS PubMed PubMed Central Google Scholar
Davenport EE, Amariuta T, Gutierrez-Arcelus M, Slowikowski K, Westra H-J, Luo Y, et al. Discovering in vivo cytokine-eQTL interactions from a lupus clinical trial. Genome Biol. 2018;19:168.
Article PubMed PubMed Central CAS Google Scholar
Fujio K, Takeshima Y, Nakano M, Iwasaki Y. Review: transcriptome and trans-omics analysis of systemic lupus erythematosus. Inflamm Regen. 2020;40:11.
Article CAS PubMed PubMed Central Google Scholar
Bjornson ZB, Nolan GP, Fantl WJ. Single-cell mass cytometry for analysis of immune system functional states. Curr Opin Immunol. 2013;25:484–94.
Article CAS PubMed Google Scholar
Zhang F, Wei K, Slowikowski K, Fonseka CY, Rao DA, Kelly S, et al. Defining inflammatory cell states in rheumatoid arthritis joint synovial tissues by integrating single-cell transcriptomics and mass cytometry. Nat Immunol. 2019;20:928–42.
Article PubMed PubMed Central CAS Google Scholar
Santos R, Ursu O, Gaulton A, Bento AP, Donadi RS, Bologa CG, et al. A comprehensive map of molecular drug targets. Nat Rev Drug Discov. 2017;16:19–34.
Article CAS PubMed Google Scholar
Masuda T, Ogawa K, Kamatani Y, Murakami Y, Kimura T, Okada Y. A Mendelian randomization study identified obesity as a causal risk factor of uterine endometrial cancer in Japanese. Cancer Sci. 2020. https://doi.org/10.1111/cas.14667.
Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018;7:e34408.
Article PubMed PubMed Central Google Scholar
Zheng J, Haberland V, Baird D, Walker V, Haycock PC, Hurle MR, et al. Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases. Nat Genet. 2020;52(10):1122–31.
Article PubMed CAS PubMed Central Google Scholar
Gallois A, Mefford J, Ko A, Vaysse A, Julienne H, Ala-Korpela M, et al. A comprehensive study of metabolite genetics reveals strong pleiotropy and heterogeneity across time and context. Nat Commun. 2019;10:4788.
Article PubMed PubMed Central CAS Google Scholar
Kettunen J, Demirkan A, Würtz P, Draisma HHM, Haller T, Rawal R, et al. Genome-wide study for circulating metabolites identifies 62 loci and reveals novel systemic effects of LPA. Nat Commun. 2016;7:11122.
Article CAS PubMed PubMed Central Google Scholar
Shin S-Y, Fauman EB, Petersen A-K, Krumsiek J, Santos R, Huang J, et al. An atlas of genetic influences on human blood metabolites. Nat Genet. 2014;46:543–50.
Article CAS PubMed PubMed Central Google Scholar
Köttgen A, Raffler J, Sekula P, Kastenmüller G. Genome-wide association studies of metabolite concentrations (mGWAS): relevance for nephrology. Semin Nephrol. 2018;38:151–74.
Article PubMed CAS Google Scholar
Schlosser P, Li Y, Sekula P, Raffler J, Grundner-Culemann F, Pietzner M, et al. Genetic studies of urinary metabolites illuminate mechanisms of detoxification and excretion in humans. Nat Genet. 2020;52:167–76.
Article CAS PubMed PubMed Central Google Scholar
Suhre K, Wallaschofski H, Raffler J, Friedrich N, Haring R, Michael K, et al. A genome-wide association study of metabolic traits in human urine. Nat Genet. 2011;43:565–9.
Article CAS PubMed Google Scholar
Cano-Gamez E, Trynka G. From GWAS to function: using functional genomics to identify the mechanisms underlying complex diseases. Front Genet. 2020;11:424.
Article CAS PubMed PubMed Central Google Scholar
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74.
Article CAS Google Scholar
ENCODE Project Consortium. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature. 2020;583:699–710.
Article CAS Google Scholar
Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–30.
Article CAS PubMed PubMed Central Google Scholar
Stunnenberg HG, Hirst M, Abrignani S, Adams D, de Almeida M, Altucci L, et al. The International Human Epigenome Consortium: a blueprint for scientific collaboration and discovery. Cell. 2016;167:1145–9.
Article CAS PubMed Google Scholar
Deutsches Epigenom Programm. Welcome to DEEP. 2012. http://www.deutsches-epigenom-programm.de/.
Google Scholar
Albrecht F, List M, Bock C, Lengauer T. DeepBlue epigenomic data server: programmatic data retrieval and analysis of epigenome region sets. Nucleic Acids Res. 2016;44:W581–6.
Article CAS PubMed PubMed Central Google Scholar
The FANTOM Consortium and the RIKEN PMI and CLST (DGT). A promoter-level mammalian expression atlas. Nature. 2014;507:462–70.
Article CAS Google Scholar
Oki S, Ohta T, Shioi G, Hatanaka H, Ogasawara O, Okuda Y, et al. ChIP-Atlas: a data-mining suite powered by full integration of public ChIP-seq data. EMBO Rep. 2018;19:e46255.
Article PubMed PubMed Central CAS Google Scholar
Ward LD, Kellis M. HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res. 2012;40:D930–4.
Article CAS PubMed Google Scholar
Boyle AP, Hong EL, Hariharan M, Cheng Y, Schaub MA, Kasowski M, et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 2012;22:1790–7.
Article CAS PubMed PubMed Central Google Scholar
Hirabayashi S, Bhagat S, Matsuki Y, Takegami Y, Uehata T, Kanemaru A, et al. NET-CAGE characterizes the dynamics and topology of human transcribed cis-regulatory elements. Nat Genet. 2019;51:1369–79.
Article CAS PubMed Google Scholar
Bulik-Sullivan B, Loh PR, Finucane HK, Ripke S, Yang J, Patterson N, et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47:291–5.
Article CAS PubMed PubMed Central Google Scholar
Finucane HK, Bulik-Sullivan B, Gusev A, Trynka G, Reshef Y, Loh P-R, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet. 2015;47:1228–35.
Article CAS PubMed PubMed Central Google Scholar
Kanai M, Akiyama M, Takahashi A, Matoba N, Momozawa Y, Ikeda M, et al. Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases. Nat Genet. 2018;50:390–400.
Article CAS PubMed Google Scholar
Ishigaki K, Akiyama M, Kanai M, Takahashi A, Kawakami E, Sugishita H, et al. Large-scale genome-wide association study in a Japanese population identifies novel susceptibility loci across different diseases. Nat Genet. 2020;52:669–79.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Not applicable

Funding

Y.O. was supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI (19H01021, 20 K21834), and AMED (JP20km0405211, JP20ek0109413, JP20ek0410075, JP20gm4010006, and JP20km0405217), Takeda Science Foundation, and Bioinformatics Initiative of Osaka University Graduate School of Medicine, Osaka University.

Author information

Authors and Affiliations

Department of Statistical Genetics, Osaka University Graduate School of Medicine, 2-2 Yamadaoka, Suita, Osaka, 565-0871, Japan
Yuya Shirai & Yukinori Okada
Department of Respiratory Medicine and Clinical Immunology, Osaka University Graduate School of Medicine, Suita, 565-0871, Japan
Yuya Shirai
Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, 565-0871, Japan
Yukinori Okada

Authors

Yuya Shirai
View author publications
You can also search for this author in PubMed Google Scholar
Yukinori Okada
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.S. wrote the manuscripts. Y.O. supervised the review. The authors read and approved the final manuscript.

Corresponding author

Correspondence to Yukinori Okada.

Ethics declarations

Ethics approval and consent to participate

Not applicable

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shirai, Y., Okada, Y. Elucidation of disease etiology by trans-layer omics analysis. Inflamm Regener 41, 6 (2021). https://doi.org/10.1186/s41232-021-00155-w

Download citation

Received: 07 December 2020
Accepted: 22 January 2021
Published: 08 February 2021
DOI: https://doi.org/10.1186/s41232-021-00155-w

Elucidation of disease etiology by trans-layer omics analysis

Abstract

Background

Transcriptome

Single-cell analysis

Proteome

Metabolome

Epigenome

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Inflammation and Regeneration

Contact us