epiomics: Analysis of Omics Data in Observational Studies

R-CMD-check

epiomics provides a collection of fast and flexible functions for the analysis of omics data in observational studies.

Installation

You can install epiomics from GitHub with:

# install.packages("devtools")
devtools::install_github("JAGoodrich/epiomics")

library(epiomics)

Omics wide association study (owas)

The basis of many omics analysis in epidemiology begin with an omics wide association study. The function owas() implements an omics wide association study with the option of using the ’omics data as either the dependent variable (i.e., for performing an exposure –> ’omics analysis) or using the ’omics as the independent variable (i.e., for performing an ’omics –> outcome analysis). owas() provides the option to adjust for covariates, and allows for either continuous or dichotomous outcomes. owas() can also handle multiple variables of interest (ie, multiple exposures or multiple traits).

Start with simulating data:

# Load Example Data
data("example_data")

# Get names of omics
colnames_omic_fts <- colnames(example_data)[grep("feature_",
                                               colnames(example_data))][1:10]

# Get names of traits
trait_nms = c("disease1", "disease2")

Run owas with continuous exposure as the variable of interest

owas(df = example_data, 
     var = "exposure1", 
     omics = colnames_omic_fts, 
     covars = c("age", "sex"), 
     var_exposure_or_outcome = "exposure", 
     family = "gaussian")
     
# Equivalent: 
owas(df = example_data, 
     var = "exposure1", 
     omics = colnames_omic_fts, 
     covars = c("age", "sex"), 
     var_exposure_or_outcome = "exposure")  

Run owas with dichotomous outcome as the variable of interest

owas(df = example_data, 
     var = "disease1", 
     omics = colnames_omic_fts, 
     covars = c("age", "sex"), 
     var_exposure_or_outcome = "outcome", 
     family = "binomial")

Run owas with multiple continuous exposures as the variable of interest

# Get names of exposures
expnms = c("exposure1", "exposure2", "exposure3")

owas(df = example_data, 
     var = expnms, 
     omics = colnames_omic_fts, 
     covars = c("age", "sex"), 
     var_exposure_or_outcome = "exposure", 
     family = "gaussian")

Meet in the Middle

The function meet_in_middle() conducts meet in the middle screening between an exposure, omics, and an outcome, as described by Cadiou et al., 2021. This function provides the option to adjust for covariates, and allows for either continuous or dichotomous outcomes. Examples are based on the simulated data created above.

Meet in the middle with a dichotomous outcome

res <- meet_in_middle(df = example_data,
                      exposure = "exposure1", 
                      outcome = "disease1", 
                      omics = colnames_omic_fts,
                      covars = c("age", "sex"), 
                      outcome_family = "binomial")
res

Meet in the middle with a continuous outcome

res <- meet_in_middle(df = example_data,
                      exposure = "exposure1", 
                      outcome = "weight", 
                      omics = colnames_omic_fts,
                      covars = c("age", "sex"), 
                      outcome_family = "gaussian")

Meet in the middle with a continuous outcome and no covariates

res <- meet_in_middle(df = example_data,
                      exposure = "exposure1", 
                      outcome = "weight", 
                      omics = colnames_omic_fts,
                      outcome_family = "gaussian")