Bioconductor Packages

糗世界

糗世界2站

motifStack Workshop for Bioc2020

trackViewer Workshop for BiocAsia2020

deepToolsDownstream

Downstream plots for deepTools.

HMMtBroadPeak

The HMMtBroadPeak package is designed to call very broad peaks for data such as lamina-associated domains (LADs), nucleolus-associated domains (NADs), or other topologically associating domains.

uShuffleR

uShuffle is a useful tool for shuffling biological sequences published by Minghui Jiang et.al. This package is R wrapper for the tool.

HMMtBroadPeak

The HMMtBroadPeak package is designed to call very broad peaks for data such as lamina-associated domains (LADs), nucleolus-associated domains (NADs), or other topologically associating domains.

Basic and Advanced boostcamp

» Bioinformatics Basics/Advanced boostcamp: Sequence Analysis Bootcamp

This is a series of lectures on Unix, DCC cluster, R language and bioinformatics tools for molecular biologist. The main goal is to introduce the audience to the basics of bioinformatics with a focus on deep sequencing data analysis using a mixed data of RNA-seq and ChIP-seq as an example. Basic computer skills are required to follow the lectures. No programming skills are required. The boost camp stats with a quick introduction to how to read the analysis results. Next, genomic file formats will be introduced. Then the analysis of RNA-seq and ChIP-seq will be demonstrated. This is followed by the visualizing and downstream analyzing of the data. The last part of this boost camp is a brief introduction to R programming language.

ribosomeProfilingQC

» Ribosome Profiling Quality Control

Ribo-Seq (also named ribosome profiling or footprinting) measures translatome (unlike RNA-Seq, which sequences the transcriptome) by direct quantification of the ribosome-protected fragments (RPFs). This package provides the tools for quality assessment of ribosome profiling. In addition, it can preprocess Ribo-Seq data for subsequent differential analysis.

motifStack

» a bioconductor package to plot stacked logos for single or multiple DNA, RNA and amino acid sequence

motifStack is a package that is able to draw amino acid sequence as easy as to draw DNA/RNA sequence. motifStack provides the flexibility for users to select the font type and symbol colors. motifStack is designed for graphical representation of multiple motifs.

This package is involved in the pipeline of finding candidate binding sites for known transcription factors via sequence matching.

Ou J, Wolfe SA, Brodsky MH, Zhu LJ (2018). “motifStack for the analysis of transcription factor binding site evolution.” Nature Methods, 15, 8-9. doi: 10.1038/nmeth.4555.

ATACseqQC

» a bioconductor package for ATAC-seq Quality Control

ATAC-seq, an assay for Transposase-Accessible Chromatin using sequencing, is a rapid and sensitive method for chromatin accessibility analysis. It was developed as an alternative method to MNase-seq, FAIRE-seq and DNAse-seq. Comparing to the other methods, ATAC-seq requires less amount of the biological samples and time to process. In the process of analyzing several ATAC-seq dataset produced in our labs, we learned some of the unique aspects of the quality assessment for ATAC-seq data. To help users to quickly assess whether their ATAC-seq experiment is successful, we developed ATACseqQC package partially following the guideline published in Nature Method 2013 (Greenleaf et al.), including diagnostic plot of fragment size distribution, proportion of mitochondria reads, nucleosome positioning pattern, and CTCF or other Transcript Factor footprints.

Ou J, Liu H, Yu J, Kelliher MA, Castilla LH, Lawson ND, Zhu LJ (2018). “ATACseqQC: a Bioconductor package for post-alignment quality assessment of ATAC-seq data.” BMC Genomics, 19(1), 169. ISSN 1471-2164, doi: 10.1186/s12864-018-4559-3.

trackViewer

» A bioconductor package with minimalist design for plotting elegant track layers

Visualize mapped reads along with annotation as track layers for NGS dataset such as ChIP-seq, RNA-seq, miRNA-seq, DNA-seq.

Ou, J., Zhu, L.J. trackViewer: a Bioconductor package for interactive and integrative visualization of multi-omics data. Nat Methods 16, 453–454 (2019). doi: 10.1038/s41592-019-0430-y

ChIPpeakAnno

» a bioconductor package for annotating peaks identified from ChIP-seq, Chip-chip or any high-throuput experiments

Batch annotation of the peaks identified from either ChIP-seq or ChIP-chip experiments. The package includes functions to retrieve the sequences around the peak, obtain enriched Gene Ontology (GO) terms, find the nearest gene, exon, miRNA or custom features such as most conserved elements and other transcription factor binding sites supplied by users. This package leverages the biomaRt, IRanges, Biostrings, BSgenome, GO.db, multtest and stat packages

Zhu LJ*, Gazin C, Lawson ND, Pages H, Lin SM, Lapointe DS and Green MR. (2010) [* denotes corresponding author] ChIPpeakAnno: a Bioconductor package to annotate ChIP-seq and ChIP-chip data. BMC Bioinformatics 2010, 11:237.

GeneNetworkBuilder

» Build Regulatory Network from ChIP-chip/ChIP-seq and Expression Data

GeneNetworkBuilder (GNB) is an appliation for discovering the transcriptional regulatory network for a given transcription factor (TF) of Caenorhabditis elegans, Homo sapiens and so on, using ChIP-chip (ChIP-seq) combined with gene expression profile from either RNA-seq or expression microarray experiments.

dagLogo

» a bioconductor package to find and visualize signficantly enriched or depleted amino acid motif or amino acid group patterns in proteom dataset

In addition to implement iceLogo in R to visualize differential amino acid sequence pattern, dagLogo can also test and visualize significant amino acid group patterns by classifying the amino acids into groups according to charge, chemistry and hydrophobicity and etc.

Ou, Jianhong, Liu, Haibo, Nirala, K N, Stukalov, Alexey, Acharya, Usha, Green, R M, Zhu, Julie L (2020). “dagLogo: An R/Bioconductor package for identifying and visualizing differential amino acid group usage in proteomics data.” Plos one, 15(11), e0242030. >doi: 10.1371/journal.pone.0242030.

InPAS

» Identification of Novel alternative PolyAdenylation Sites (PAS)

Alternative polyadenylation (APA) is one of the important post-transcriptional regulation mechanisms which occurs in most human genes. InPAS facilitates the discovery of novel APA sites from RNAseq data. It leverages cleanUpdTSeq to fine tune identified APA sites.

This site is maintained by Jianhong Ou.