Software

Dynamont

Dynamont – A Dynamic Programming Approach to Segment ONT Signals. Dynamont is a segmentation/resquiggling tool for ONT signals. Dynamont was tested on
RNA002
RNA004
DNA R10.4.1 5kHz (I applied the trained transition parameters from the RNA004 model to the DNA R10 models. These should be fine-tuned for the DNA models.)

Download

Download from GitHub

Magnipore

Magnipore is a tool written in python3 to analyze and pair-wise compare sequencing samples from Oxford Nanopore Technologies (ONT) sequencing.
Magnipore compares two ONT samples on a signal level to find differential signals between them in single base resolution. Such differences are caused by mutations or modifications. Magnipore classifies these differences and provides the user with a position-wise comparison.

Download

Download from GitHub

AnchoRNA: Identify conserved regions within coding sequences of viral genomes

AnchoRNA is a Python-based command line tool designed to identify conserved regions, or anchors, within coding sequences of viral genomes.

Download

Download from GitHub

EpiDope: Prediction of B-cell epitopes from amino acid sequences

Features

Fast genome wide search
Interactive graphical results
Docker support

Download

Download from GitHub

Reference

Collatz, Maximilian; Mock, Florian; Barth, Emanuel; Hölzer, Martin; Sachse, Konrad; Marz, Manja

EpiDope: A Deep Neural Network for linear B-cell epitope prediction Journal Article

In: Bioinformatics, vol. 37, no. 4, pp. 448–455, 2020.

Abstract | Links | BibTeX

VIDHOP: virus host prediction

VIDHOP is a fast and accurate deep learning approach for viral host prediction, which is based on the viral genome sequence only. VIDHOP allows highly accurate predictions while using only fractions (100–400 bp) of the viral genome sequences. VIDHOP also allows the user to train and use models for other viruses.

Features

From input fasta to prediction of host in seconds.

Download

Download from GitHub

Reference

Mock, Florian; Viehweger, Adrian; Barth, Emanuel; Marz, Manja

VIDHOP, viral host prediction with Deep Learning Journal Article

In: Bioinformatics, vol. 37, no. 3, pp. 318–325, 2020.

Abstract | Links | BibTeX

RNAflow: Simple RNA-Seq differential gene expression pipeline using Nextflow

RNA-Seq enables the identification and quantification of RNA molecules, often with the aim of detecting differentially expressed genes (DEGs). Although RNA-Seq evolved into a standard technique, there is no universal gold standard for these data’s computational analysis. On top of that, previous studies proved the irreproducibility of RNA-Seq studies.
RNAflow is a portable, scalable, and parallelizable Nextflow RNA-Seq pipeline to detect DEGs, which assures a high level of reproducibility. The pipeline automatically takes care of common pitfalls, such as ribosomal RNA removal and low abundance gene filtering. Apart from various visualizations for the DEG results, we incorporated downstream pathway analysis for common species as Homo sapiens and Mus musculus.

Download

Download from GitHub

Reference

Lataretu, Marie; Hölzer, Martin

RNAflow: An Effective and Simple RNA-Seq Differential Gene Expression Pipeline Using Nextflow Journal Article

In: Genes, vol. 11, no. 12, pp. 1487, 2020.

Abstract | Links | BibTeX

SIM: SilentMutations

SilentMutations (SIM) can analyze the effect of multiple point mutations on the secondary structures of two interacting viral RNAs. It simulates destructive and compensatory mutants of two key regions from a single-stranded RNA, which can then be utilized for the combinatorial in vitro analysis of RNA-RNA interactions.

Download

Download from GitHub

Reference

Desiro, Daniel; Hölzer, Martin; Ibrahim, Bashar; Marz, Manja

SilentMutations (SIM): a tool for analyzing long-range RNA-RNA interactions in viral genomes and structured RNAs Journal Article

In: Virus Res, vol. 260, pp. 135-141, 2018.

Abstract | Links | BibTeX

@article{Desiro:18,

title = {SilentMutations (SIM): a tool for analyzing long-range RNA-RNA interactions in viral genomes and structured RNAs},

author = {Daniel Desiro and Martin Hölzer and Bashar Ibrahim and Manja Marz},

url = {https://github.com/desiro/silentMutations},

doi = {10.1016/j.virusres.2018.11.005},

year  = {2018},

date = {2018-11-12},

urldate = {2018-11-12},

journal = {Virus Res},

volume = {260},

pages = {135-141},

abstract = {A single nucleotide change in the coding region can alter the amino acid sequence of a protein. In consequence, natural or artificial sequence changes in viral RNAs may have various effects not only on protein stability, function and structure but also on viral replication. In recent decades, several tools have been developed to predict the effect of mutations in structured RNAs such as viral genomes or non-coding RNAs. Some tools use multiple point mutations and also take coding regions into account. However, none of these tools was designed to specifically simulate the effect of mutations on viral long-range interactions. Here, we developed SilentMutations (SIM), an easy-to-use tool to analyze the effect of multiple point mutations on the secondary structures of two interacting viral RNAs. The tool can simulate disruptive and compensatory mutants of two interacting single-stranded RNAs. This allows a fast and accurate assessment of key regions potentially involved in functional long-range RNA-RNA interactions and will eventually help virologists and RNA-experts to design appropriate experiments. SIM only requires two interacting single-stranded RNA regions as input. The output is a plain text file containing the most promising mutants and a graphical representation of all interactions. We applied our tool on two experimentally validated influenza A virus and hepatitis C virus interactions and we were able to predict potential double mutants for in vitro validation experiments. The source code and documentation of SIM are freely available at github.com/desiro/silentMutations.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

GORAP: Genomewide ncRNA Annotation Pipeline

GORAP is a pipeline for automated non-coding RNA annotation based on BioPerl, Infernal, Blast, RNAmmer, tRNAscan, Bcheck, RAxML, CRT, Mafft, Samtools.

Features & Facts

Input: FASTA file(s)
Uses in-house filters (e.g. phylogeny based) and TPM/FPKM computation from BAM files
Offers RNome based phylogeny reconstruction
Root less installation for Linux, Unix
Requirements: internet, gcc, wget, Perl, make

Download

Easy–installer including all necessary software, libraries and up to date databases (Rfam, NCBI Taxonomy, Silva)
Source @GitHub

PoSeiDon: Positive Selection Detection and Recombination Analysis

PoSeiDon is an easy-to-use pipeline to detect significant positively selected sites and possible recombination events in an alignment of multiple coding sequences.

Features & Facts

Input: nucleotide coding sequences as one multiple FASTA file
assigns unique ID that can be used to access all data when calculations are finished

Download

GitHub page
PoSeiDon now runs w/ Nextflow and Docker: nextflow run hoelzer/poseidon --help

Reference

Hölzer, Martin; Marz, Manja

PoSeiDon: a Nextflow pipeline for the detection of evolutionary recombination events and positive selection Journal Article

In: Bioinformatics, vol. 37, no. 7, pp. 1018-1020, 2020.

Abstract | Links | BibTeX

PCAGO: Principal component analysis for RNA-Seq read counts

PCAGO is an interactive web service that helps you analyze your RNA-Seq read counts with principal component analysis (PCA) and clustering.

Features & Facts

read count normalization
download annotations and GO terms for your genes
tool to find gene variance cut-off for PCA

Download

Reference

Gerst, Ruman; Hölzer, Martin

PCAGO: An interactive web service to analyze RNA-Seq data with principal component analysis Journal Article

In: bioRxiv, pp. 433078, 2018.

Abstract | Links | BibTeX

LRIscan: Long range RNA-RNA interactions

LRIscan is no longer maintained.

LRIscan is a tool to predict conserved, genome-wide long range RNA-RNA interactions based on a multiple sequence alignment in only a few hours on an average computer.

Download

Reference

Fricke, Markus; Marz, Manja

Prediction of conserved long-range RNA-RNA interactions in full viral genomes Journal Article

In: Bioinformatics, vol. 32, no. 19, pp. 2928–2935, 2016.

Abstract | Links | BibTeX

VrAP: Viral Assembly Pipeline

VrAP is no longer maintained.

VrAP is a viral assembly pipeline based on the genome assembler SPAdes combined with an additional read correction and several filter steps. VrAP classifies the contigs to distinguish host from viral sequences by annotation and ORF density scores.

Features & Facts

new ORF density method to identify viruses without any sequence homology to known references
tested on real datasets generated with different sequencing technologies

Download

RNAgraphdist: Graph distance between to bases

RNAgraphdist is no longer maintained.

RNAgraphdist finds the shortest graph distance between to bases i and j.

Features & Facts

Handles thousands of input constraints
Plots all results with gnuplot
Optimized for multi-cores
Runtime complexity: O(n log n)

Download

References

Qin, Jing; Fricke, Markus; Marz, Manja; Stadler, Peter F; Backofen, Rolf

Graph-distance distribution of the Boltzmann ensemble of RNA secondary structures Journal Article

In: Algorithms Mol Biol, vol. 9, pp. 19, 2014.

Abstract | Links | BibTeX

@article{Qin:14,

title = {Graph-distance distribution of the Boltzmann ensemble of RNA secondary structures},

author = {Jing Qin and Markus Fricke and Manja Marz and Peter F Stadler and Rolf Backofen},

url = {http://www.rna.uni-jena.de/RNAgraphdist.html},

doi = {10.1186/1748-7188-9-19},

year  = {2014},

date = {2014-09-11},

urldate = {2014-09-11},

journal = {Algorithms Mol Biol},

volume = {9},

pages = {19},

abstract = {Large RNA molecules are often composed of multiple functional domains whose spatial arrangement strongly influences their function. Pre-mRNA splicing, for instance, relies on the spatial proximity of the splice junctions that can be separated by very long introns. Similar effects appear in the processing of RNA virus genomes. Albeit a crude measure, the distribution of spatial distances in thermodynamic equilibrium harbors useful information on the shape of the molecule that in turn can give insights into the interplay of its functional domains. Spatial distance can be approximated by the graph-distance in RNA secondary structure. We show here that the equilibrium distribution of graph-distances between a fixed pair of nucleotides can be computed in polynomial time by means of dynamic programming. While a naïve implementation would yield recursions with a very high time complexity of O(n (6) D (5)) for sequence length n and D distinct distance values, it is possible to reduce this to O(n (4)) for practical applications in which predominantly small distances are of of interest. Further reductions, however, seem to be difficult. Therefore, we introduced sampling approaches that are much easier to implement. They are also theoretically favorable for several real-life applications, in particular since these primarily concern long-range interactions in very large RNA molecules. The graph-distance distribution can be computed using a dynamic programming approach. Although a crude approximation of reality, our initial results indicate that the graph-distance can be related to the smFRET data. The additional file and the software of our paper are available from http://www.rna.uni-jena.de/RNAgraphdist.html.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

POMAGO: Multiple Genome Aligner

POMAGO is no longer maintained.

POMAGO is a multiple genome aligner designed for, but not limited to, bacterial genomes.

Features & Facts

Based on the whole set of all known bacterial orthologous genes and their syntenic information determined by Proteinortho

Download

References

Wieseke, Nicolas; Lechner, Marcus; Ludwig, Marcus; Marz, Manja

POMAGO: Multiple Genome-Wide Alignment Tool for Bacteria Proceedings Article

In: Cai, Zhipeng; Eulenstein, Oliver; Janies, Daniel; Schwartz, Daniel (Ed.): Proceedings of the 9th International Symposium on Bioinformatics Research and Applications (ISBRA 2013), Charlotte, NC, USA, May 20-22, 2013., pp. pp 249-260, Springer, 2013.

Abstract | Links | BibTeX

@inproceedings{Wieseke:13,

title = {POMAGO: Multiple Genome-Wide Alignment Tool for Bacteria},

author = {Nicolas Wieseke and Marcus Lechner and Marcus Ludwig and Manja Marz},

editor = {Zhipeng Cai and Oliver Eulenstein and Daniel Janies and Daniel Schwartz},

url = {http://www.rna.uni-jena.de/supplements/pomago},

doi = {10.1007/978-3-642-38036-5_25},

year  = {2013},

date = {2013-01-01},

urldate = {2013-01-01},

booktitle = {Proceedings of the 9th International Symposium on Bioinformatics Research and Applications (ISBRA 2013), Charlotte, NC, USA, May 20-22, 2013.},

volume = {7875},

number = {1},

pages = {pp 249-260},

publisher = {Springer},

series = {Lecture Notes in Computer Science},

abstract = {Multiple Genome-wide Alignments are a first crucial step to compare genomes. Gain and loss of genes, duplications and genomic rearrangements are challenging problems that aggravate with increasing phylogenetic distances. We describe a multiple genome-wide alignment tool for bacteria, called POMAGO, which is based on orthologous genes and their syntenic information determined by Proteinortho.This strategy enables POMAGO to efficiently define anchor points even across wide phylogenetic distances and outperform existing approaches in this field of application. The given set of orthologous genes is enhanced by several cleaning and completion steps, including the addition of previously undetected orthologous genes. Protein-coding genes are aligned on nucleotide and protein level, whereas intergenic regions are aligned on nucleotide level only. We tested and compared our program at three very different sets of bacteria that exhibit different degrees of phylogenetic distances: 1) 15 closely related, well examined and described E. coli species, 2) six more divergent Aquificales, as putative basal bacteria, and 3) a set of eight extreme divergent species, distributed among the whole phylogenetic tree of bacteria. POMAGO is written in a modular way which allows extending or even exchanging algorithms in different stages of the alignment process. Intergenic regions might for instance be aligned using an RNA secondary structure aware algorithm rather than to rely on sequence data alone. The software is freely available from 



},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Proteinortho: Orthology detection tool

Proteinortho is no longer maintained.

Proteinortho is a tool to detect orthologous proteins across hundreds of species.

Features & Facts

Small memory footprint
Optimized for multi-core and cluster environments
Runtime complexity: O(n2)

Download

Download and Manual

References

Lechner, Marcus; Findeiss, Sven; Steiner, Lydia; Marz, Manja; Stadler, Peter F; Prohaska, Sonja J

Proteinortho: detection of (co-)orthologs in large-scale analysis Journal Article

In: BMC Bioinf, vol. 12, pp. 124, 2011.

Abstract | Links | BibTeX

Galculator: Nucleotide counter for fasta files

Galculator is no longer maintained.

Galculator is a nucleotide counter for fasta files that counts mononucleotide frequencies, dinucleotide frequencies, and gapped dinucleotide frequencies (XnY).

Features & Facts

Small constant memory footprint
Handles hundreds of petabytes if necessary
Runtime complexity: O(n)

Download

Download and Manual

Research » Software

Download

Download

Download

Features

Download

Reference

Features

Download

Reference

Download

Reference

Download

Reference

Features & Facts

Download

Features & Facts

Download

Reference

Features & Facts

Download

Reference

Download

Reference

Features & Facts

Download

Features & Facts

Download

References

Features & Facts

Download

References

Features & Facts

Download

References

Features & Facts

Download