An R package for evaluation of pair-wise synteny conservation at the genome-wide scale. It takes a table of orthologs and genome annotation files formatted as BED to automatically infer significantly conserved linkage groups, and order them on an oxford grid.
# Before this package is submitted to CRAN,
# get the development version from GitHub using devtools :
# install.packages("devtools")
devtools::install_github("SamiLhll/macrosyntR",build_vignettes = TRUE)
# building the vignette makes the installation a bit longer but its mandatory so ou can access it by doing :
vignette("macrosyntR")
Check out the vignette for a comprehensive step-by-step tutorial illustrating how the package works using publicly available data, and how to customize the analysis.
To start drawing an oxford grid using this package, you’ll need three
files :
* 1 - A two columns table of orthologous pairs of genes between species
to compare (as generated by rbhxpress)
* 2 - A bed file listing the genomic coordinates and sequence names of
all the orthologs of the 1st species (with names matching the first
column of file 1)
* 3 - A bed file, same as the other, for the 2nd species. (with names
matching those of the first column of file 2)
To illustrate the results of the package we compare the publicly
available data from the lancelet Branchiostoma floridae (Simakov et
al. 2020) with the Siboglinidae Paraescarpia echinospica (Sun et al. 2021)
Once you have your pairs of orthologs, getting an ordered oxford grid
using this package is achieved as following :
library(macrosyntR)
# Load table of orthologs and integrate with genomic coordinates :
my_orthologs <- load_orthologs(orthologs_table = "DATA/Bflo_vs_Pech.tab",
sp1_bed = "DATA/Bfloridae.protein_products.bed",
sp2_bed = "DATA/Pechinospica.protein_products.bed")
# Draw an oxford grid :
p1 <- plot_oxford_grid(my_orthologs,
sp1_label = "B.floridae",
sp2_label = "P.echinospica")
p1
# Automatically reorder the oxford grid and color the detected clusters (communities):
p2 <- plot_oxford_grid(my_orthologs,
sp1_label = "B.floridae",
sp2_label = "P.echinospica",
reorder = TRUE,
color_by = "clust")
p2
# Plot the significant linkage groups :
my_macrosynteny <- compute_macrosynteny(my_orthologs)
p3 <- plot_macrosynteny(my_macrosynteny)
p3
# Call the reordering function, test significance and plot it :
my_orthologs_reordered <- reorder_macrosynteny(my_orthologs)
my_macrosynteny <- compute_macrosynteny(my_orthologs_reordered)
p4 <- plot_macrosynteny(my_macrosynteny)
p4
Need help, Identified a bug, or want to see other features
implemented ?
Feel free to open an issue here or send an email at :
elhilali.sami@gmail.com