Projects

List of Projects

Identifying DNA methylation biomarkers using nanopore sequencing

Carcinogenesis is associated with changes in DNA methylation patterns, especially in CpGs islands, which are region in the human genome with an increased amount of CpG dinucleotides. Using methylation-specific PCR based on bisulfite treated DNA, these methylated cytosine loci (cytosines beeing methylated into 5-methyl cytosines, an epigenetic change occurring in the early stages of cancer already, can be used for clinical screening tests. In cooperation with oncgnostics GmbH, are using Nanopore sequencing to identify potential methylation-specific genomic cancer biomarker regions in the human genome, which can be used as targets for the develoment of methylation-specific screening tests. In the scope of this project, the tool diffoNT has been developed, which predicts genomic regions, which are suitable for methylation-specific PCR.

People involved: Daria Meyer

Collaborations: oncgnostics GmbH

Related publications:

Meyer, Daria; Göttsch, Winfried; Spangenberg, Jannes; Stieber, Bettina; Krautwurst, Sebastian; Hölzer, Martin; Brandt, Christian; Linde, Jörg; zu Siederdissen, Christian Höner; Srivastava, Akash; Zarkovic, Milena; Wollny, Damian; Marz, Manja

Unlocking the full potential of nanopore sequencing: tips, tricks, and advanced data analysis techniques Journal Article

In: Nucleic Acids Research, vol. 54, iss. 3, no. gkag023, 2026.

Abstract | Links | BibTeX

Meyer, Daria; Barth, Emanuel; Wiehle, Laura; Marz, Manja

diffMONT: predicting methylation-specific PCR biomarkers based on nanopore sequencing data for clinical application Journal Article

In: Bioinformatics, vol. 42, iss. 2, no. btag039, 2026.

Abstract | Links | BibTeX

@article{nokey_102,

title = {diffMONT: predicting methylation-specific PCR biomarkers based on nanopore sequencing data for clinical application},

author = {Daria Meyer and Emanuel Barth and Laura Wiehle and Manja Marz},

doi = {10.1093/bioinformatics/btag039},

year  = {2026},

date = {2026-02-01},

urldate = {2026-02-01},

journal = {Bioinformatics},

volume = {42},

number = {btag039},

issue = {2},

abstract = {Motivation

DNA methylation serves as a key biomarker in clinical diagnostics, especially in cancer detection. With methylation-specific PCR (MSP), a widely used approach, patient samples can be screened fast and efficiently for differential methylation. During MSP, methylated regions are selectively amplified with specific primers. With nanopore sequencing, knowledge about DNA methylation is generated during direct DNA sequencing without needing pretreatment of the DNA. Multiple methods, mainly developed for whole-genome bisulfite sequencing (WGBS) data, exist to predict differentially methylated regions (DMRs) in the genome. However, the predicted DMRs are often very large and not sufficiently discriminating to generate meaningful results in MSP, creating a gap between theoretical cancer marker research and practical application, as no tool currently provides methylation difference predictions tailored for PCR-based diagnostics.



Results

Here, we present diffMONT, a tool that predicts differentially methylated regions specifically suited for MSP primer design, enabling rapid translation into practical applications. diffMONT takes into account (i) the specific length of primer and amplicon regions, (ii) the fact that one condition should be unmethylated, and (iii) a minimal required amount of differentially methylated cytosines within the primer regions. We compared the results of diffMONT to metilene and DSS based on a publicly available nanopore sequencing dataset and show that the regions predicted by diffMONT are more specific toward hypermethylated regions. diffMONT accelerates the design of methylation-specific diagnostic assays, bridging the gap between theoretical research and clinical application.



Availability and implementation

The source code for diffMONT, an open-source Python-based tool, is available at https://github.com/rnajena/diffMONT/, with an archived release under https://zenodo.org/records/17641031.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Meyer, Daria; Hennig, Anne; Hums, Anna-Bawany; Guntinas-Lichius, Orlando; Schmitz, Martina; Marz, Manja

Nanopore sequencing-derived methylation biomarker prediction for methylation-specific PCR in patients with head and neck squamous cell carcinoma Journal Article

In: Clinical Epigenetics, vol. 17, 2025.

Abstract | Links | BibTeX

Quantifying Liver Perfusion

The liver is crucial for whole-body metabolic homeostasis, handling lipid, carbohydrate, and detoxification metabolism, which are coordinated by its spatial organization and perfusion. With the rise in liver tumors and metastases, advanced surgical procedures are required, which allow for removal of up to 70% of liver volume, but leads to an increased risk of postoperative liver failure.
In cooperation with both clinicians and modellers, our aim is to understand and describe transcriptomic changes after these extended partial hepatectomy surgeries.
We analyzed data from rats, which underwent a portal vein ligation, a procedure performed before partial hepatectomy during which diseased liver lobes are ligated with the expectation that non-ligated liver-lobes increase in size. We sequenced tissue extracted at different time points (2d, 5d) from different parts (right median liver lobe, left median liver lobe) of the liver and compared the gene expression betweeen the conditions and the resulting enriched pathways.

People involved: Daria Meyer, Emanuel Barth, Manja Marz

Collaborations: Bruno Christ, Uta Dahmen, Michael Tautenhahn, Matthias König

Grants: DFG FOR 5151 QuaLiPerF / MA5082/15-1

Related publications:

Meyer, Daria; Kosacka, Joanna; von Bergen, Martin; Christ, Bruno; Marz, Manja

Data report on gene expression after hepatic portal vein ligation (PVL) in rats Journal Article

In: Front Genet, vol. 15, pp. 1421955, 2024.

Links | BibTeX

Christ, Bruno; Collatz, Maximilian; Dahmen, Uta; Herrmann, Karl-Heinz; Höpfl, Sebastian; König, Matthias; Lambers, Lena; Marz, Manja; Meyer, Daria; Radde, Nicole; Reichenbach, Jürgen R.; Ricken, Tim; Tautenhahn, Hans-Michael

Hepatectomy-Induced Alterations in Hepatic Perfusion and Function - Toward Multi-Scale Computational Modeling for a Better Prediction of Post-hepatectomy Liver Function Journal Article

In: Front Physiol, vol. 12, pp. 733868, 2021.

Abstract | Links | BibTeX

@article{Christ2021,

title = {Hepatectomy-Induced Alterations in Hepatic Perfusion and Function - Toward Multi-Scale Computational Modeling for a Better Prediction of Post-hepatectomy Liver Function},

author = {Bruno Christ and Maximilian Collatz and Uta Dahmen and Karl-Heinz Herrmann and Sebastian Höpfl and Matthias König and Lena Lambers and Manja Marz and Daria Meyer and Nicole Radde and Jürgen R. Reichenbach and Tim Ricken and Hans-Michael Tautenhahn},

doi = {10.3389/fphys.2021.733868},

year  = {2021},

date = {2021-11-18},

urldate = {2021-11-18},

journal = {Front Physiol},

volume = {12},

pages = {733868},

abstract = {Liver resection causes marked perfusion alterations in the liver remnant both on the organ scale (vascular anatomy) and on the microscale (sinusoidal blood flow on tissue level). These changes in perfusion affect hepatic functions via direct alterations in blood supply and drainage, followed by indirect changes of biomechanical tissue properties and cellular function. Changes in blood flow impose compression, tension and shear forces on the liver tissue. These forces are perceived by mechanosensors on parenchymal and non-parenchymal cells of the liver and regulate cell-cell and cell-matrix interactions as well as cellular signaling and metabolism. These interactions are key players in tissue growth and remodeling, a prerequisite to restore tissue function after PHx. Their dysregulation is associated with metabolic impairment of the liver eventually leading to liver failure, a serious post-hepatectomy complication with high morbidity and mortality. Though certain links are known, the overall functional change after liver surgery is not understood due to complex feedback loops, non-linearities, spatial heterogeneities and different time-scales of events. Computational modeling is a unique approach to gain a better understanding of complex biomedical systems. This approach allows (i) integration of heterogeneous data and knowledge on multiple scales into a consistent view of how perfusion is related to hepatic function; (ii) testing and generating hypotheses based on predictive models, which must be validated experimentally and clinically. In the long term, computational modeling will (iii) support surgical planning by predicting surgery-induced perfusion perturbations and their functional (metabolic) consequences; and thereby (iv) allow minimizing surgical risks for the individual patient. Here, we review the alterations of hepatic perfusion, biomechanical properties and function associated with hepatectomy. Specifically, we provide an overview over the clinical problem, preoperative diagnostics, functional imaging approaches, experimental approaches in animal models, mechanoperception in the liver and impact on cellular metabolism, omics approaches with a focus on transcriptomics, data integration and uncertainty analysis, and computational modeling on multiple scales. Finally, we provide a perspective on how multi-scale computational models, which couple perfusion changes to hepatic function, could become part of clinical workflows to predict and optimize patient outcome after complex liver surgery.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Liver resection causes marked perfusion alterations in the liver remnant both on the organ scale (vascular anatomy) and on the microscale (sinusoidal blood flow on tissue level). These changes in perfusion affect hepatic functions via direct alterations in blood supply and drainage, followed by indirect changes of biomechanical tissue properties and cellular function. Changes in blood flow impose compression, tension and shear forces on the liver tissue. These forces are perceived by mechanosensors on parenchymal and non-parenchymal cells of the liver and regulate cell-cell and cell-matrix interactions as well as cellular signaling and metabolism. These interactions are key players in tissue growth and remodeling, a prerequisite to restore tissue function after PHx. Their dysregulation is associated with metabolic impairment of the liver eventually leading to liver failure, a serious post-hepatectomy complication with high morbidity and mortality. Though certain links are known, the overall functional change after liver surgery is not understood due to complex feedback loops, non-linearities, spatial heterogeneities and different time-scales of events. Computational modeling is a unique approach to gain a better understanding of complex biomedical systems. This approach allows (i) integration of heterogeneous data and knowledge on multiple scales into a consistent view of how perfusion is related to hepatic function; (ii) testing and generating hypotheses based on predictive models, which must be validated experimentally and clinically. In the long term, computational modeling will (iii) support surgical planning by predicting surgery-induced perfusion perturbations and their functional (metabolic) consequences; and thereby (iv) allow minimizing surgical risks for the individual patient. Here, we review the alterations of hepatic perfusion, biomechanical properties and function associated with hepatectomy. Specifically, we provide an overview over the clinical problem, preoperative diagnostics, functional imaging approaches, experimental approaches in animal models, mechanoperception in the liver and impact on cellular metabolism, omics approaches with a focus on transcriptomics, data integration and uncertainty analysis, and computational modeling on multiple scales. Finally, we provide a perspective on how multi-scale computational models, which couple perfusion changes to hepatic function, could become part of clinical workflows to predict and optimize patient outcome after complex liver surgery.

VirJenDB: the virus database based in Jena

The NFDI4Microbiota consortium is a DFG-funded project involving 10 German institutions, constituting research groups with a focus on microbiology and research data management. The NFDI4Microbiota aims to be the central hub in Germany for supporting the microbiology community with FAIR and Open infrastructure, access to data, analysis services, data/metadata standards, and training. VirJenDB is a service and Use Case of the NFDI4Microbiota, funded by DFG grant number NFDI 28/1 (DFG project number 460129525) since 2021 at the Friedrich Schiller University in Jena, Germany. The website has been live since 10 February 2024 and is currently available in the beta version v0.1.

The VirJenDB team aims to provide a data analysis platform for researchers to find, access, curate, download and analyze sequences and (meta)data from all viruses. Virus sequences and metadata are ingested from repositories (INSDC) and secondary databases and knowledge resources (BV-BRC, NCBI Virus, ViralZone, ICTV), curated, harmonized and integrated into a metadata model. The website provides search, browse, download and summary functionality for the VJDB dataset. The VJDB is developed on the de.NBI cloud as an OpenStack project and uses the NFDI service Aruna Object Storage for sequence storage as well as the NFDI4Microbiota ClowM management system for workflows. Upcoming plans include the integration and visualization of sequence alignments and metagenomic sequences.

People involved: Noriko Cassman

Grants: NFDI4Microbiota

Related publications:

Ritsch, Muriel; Cassman, Noriko A.; Saghaei, Shahram; Marz, Manja

Navigating the Landscape: A Comprehensive Review of Current Virus Databases Journal Article

In: Viruses, vol. 15, iss. 9, no. 1834, 2023, ISBN: 1999-4915.

Abstract | Links | BibTeX

@article{nokey_43,

title = {Navigating the Landscape: A Comprehensive Review of Current Virus Databases},

author = {Muriel Ritsch and Noriko A. Cassman and Shahram Saghaei and Manja Marz},

doi = {10.3390/v15091834},

isbn = {1999-4915},

year  = {2023},

date = {2023-08-29},

journal = {Viruses},

volume = {15},

number = {1834},

issue = {9},

abstract = {Viruses are abundant and diverse entities that have important roles in public health, ecology, and agriculture. The identification and surveillance of viruses rely on an understanding of their genome organization, sequences, and replication strategy. Despite technological advancements in sequencing methods, our current understanding of virus diversity remains incomplete, highlighting the need to explore undiscovered viruses. Virus databases play a crucial role in providing access to sequences, annotations and other metadata, and analysis tools for studying viruses. However, there has not been a comprehensive review of virus databases in the last five years. This study aimed to fill this gap by identifying 24 active virus databases and included an extensive evaluation of their content, functionality and compliance with the FAIR principles. In this study, we thoroughly assessed the search capabilities of five database catalogs, which serve as comprehensive repositories housing a diverse array of databases and offering essential metadata. Moreover, we conducted a comprehensive review of different types of errors, encompassing taxonomy, names, missing information, sequences, sequence orientation, and chimeric sequences, with the intention of empowering users to effectively tackle these challenges. We expect this review to aid users in selecting suitable virus databases and other resources, and to help databases in error management and improve their adherence to the FAIR principles. The databases listed here represent the current knowledge of viruses and will help aid users find databases of interest based on content, functionality, and scope. The use of virus databases is integral to gaining new insights into the biology, evolution, and transmission of viruses, and developing new strategies to manage virus outbreaks and preserve global health.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Signal Segmentation in Oxford Nanopore Technologies Data Using HMM Models

The goal of this project is to accurately align measured Oxford Nanopore Technologies (ONT) signals to nucleotide sequences by leveraging Hidden Markov Models (HMMs). The ONT sequencing platform generates raw signal data that needs to be segmented and mapped to the corresponding nucleotide sequences for further biological interpretation.

To achieve this, I utilize the Baum-Welch Algorithm, also known as the Forward-Backward Algorithm, which is a standard technique for training HMMs. This algorithm iteratively adjusts the model parameters to maximize the likelihood of the observed sequence data. By aligning the raw ONT signals to the nucleotide sequences, this project aims to improve the accuracy and reliability of ONT-based sequencing analysis.

The successful implementation of this approach will provide a robust framework for decoding ONT signal data, paving the way for more precise genomic studies and applications.

People involved: Jannes Spangenberg, Manja Marz

Past people involved: Christian Hoener zu Siederdissen

Detection of RNA Modifications in Oxford Nanopore Technologies Data Using Neural Networks

This project focuses on the detection of RNA modifications within Oxford Nanopore Technologies (ONT) signal data using advanced neural network models. RNA modifications play a crucial role in various biological processes, and their detection is essential for understanding gene expression regulation and other cellular mechanisms.

ONT sequencing generates complex signal data that contains information about both the nucleotide sequence and potential modifications. The challenge lies in accurately identifying these modifications amidst the noise and variability inherent in the raw signal data. In this project, I employ deep learning techniques, specifically neural networks, to analyze and classify ONT signals to detect RNA modifications.

The neural network models are trained on labeled datasets, where modified and unmodified RNA sequences are known. By learning the subtle differences in the signal patterns, the models aim to distinguish between modified and unmodified regions with high accuracy. This project has the potential to significantly enhance the understanding of RNA biology by providing a powerful tool for the detection of RNA modifications in sequencing data.

People involved: Jannes Spangenberg, Manja Marz

Past people involved: Christian Hoener zu Siederdissen

Related publications:

Unlocking the full potential of nanopore sequencing: tips, tricks, and advanced data analysis techniques Journal Article

In: Nucleic Acids Research, vol. 54, iss. 3, no. gkag023, 2026.

Abstract | Links | BibTeX

Spangenberg, Jannes; zu Siederdissen, Christian Höner; Goettsch, Winfried; Köhler, Lennart; Luke, Liz Maria; Papenfort, Kai; Marz, Manja

Dynamont: A comprehensive cross-species comparison of ONT segmentation tools Journal Article

In: GigaScience, vol. 15, no. giag005, 2026.

Abstract | Links | BibTeX

@article{nokey_103,

title = {Dynamont: A comprehensive cross-species comparison of ONT segmentation tools},

author = {Jannes Spangenberg and Christian Höner zu Siederdissen and Winfried Goettsch and Lennart Köhler and Liz Maria Luke and Kai Papenfort and Manja Marz},

doi = {10.1093/gigascience/giag005},

year  = {2026},

date = {2026-01-19},

journal = {GigaScience},

volume = {15},

number = {giag005},

abstract = {Background

Oxford Nanopore Technologies (ONT) sequencing enables direct, long-read sequencing of DNA and RNA, preserving nucleotide modifications. During basecalling, deep neural networks translate raw nanopore signals into nucleotide sequences, internally segmenting the signal to align it with the corresponding bases. This is a challenging task due to uneven motor protein rotation, signal variability, low-quality reads, and the presence of nucleotide modifications. However, the signal to nucleotide assignment is critical for novel downstream signal analysis. Existing tools, such as Tombo Resquiggle, f5c Eventalign, f5c Resquiggle, and Uncalled4, operate after basecalling and rely on event-based segmentation and mapping approaches that often fail to align low-quality or modified reads and lack confidence estimates for segmentation accuracy.



Results

Here, we present a large-scale comparative study in which 5 segmentation tools, including our novel tool Dynamont, are applied to 16 ONT-sequenced datasets spanning different kingdoms of life. Overall, we segmented 160,000 reads and evaluated the tools’ performance on a combination of 12 signal and downstream assembly metrics. Our study is accompanied by a comprehensive and extensible supplement that summarizes all datasets, execution instructions, and evaluation results. We score the segmentation results using an aggregated metric score, created from all our analyzed metrics.



Conclusions

No tool delivered the best results for all datasets. We recommend a careful choice and normalization of evaluation metrics to select the best segmentation tool as a critical step in the process of ONT signal segmentation. Across nearly all RNA datasets, Dynamont outperforms other segmentation tools in terms of aggregated metric scores. For DNA datasets, however, the performance is more variable, with mixed results observed across tools.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Spangenberg, Jannes; Mündnich, Stefan; Busch, Anne; Pastore, Stefan; Wierczeiko, Anna; Goettsch, Winfried; Dietrich, Vincent; Pryszcz, Leszek P.; Cruciani, Sonia; Novoa, Eva Maria; Joshi, Kandarp; Perera, Ranjan; Giorgio, Salvatore Di; Arrubarrena, Paola; Tellioglu, Irem; Poon, Chi-Lam; Wan, Yuk Kei; Göke, Jonathan; Hildebrandt, Andreas; Dieterich, Christoph; Helm, Mark; Marz, Manja; Gerber, Susanne; Alagna, Nicolo

The RMaP challenge of predicting RNA modifications by nanopore sequencing Journal Article

In: Communications Chemistry, vol. 8, iss. 1, 2024.

Abstract | Links | BibTeX

@article{nokey_79,

title = {The RMaP challenge of predicting RNA modifications by nanopore sequencing},

author = {Jannes Spangenberg and Stefan Mündnich and Anne Busch and Stefan Pastore and Anna Wierczeiko and Winfried Goettsch and Vincent Dietrich and Leszek P. Pryszcz and Sonia Cruciani and Eva Maria Novoa and Kandarp Joshi and Ranjan Perera and Salvatore Di Giorgio and Paola Arrubarrena and Irem Tellioglu and Chi-Lam Poon and Yuk Kei Wan and Jonathan Göke and Andreas Hildebrandt and Christoph Dieterich and Mark Helm and Manja Marz and Susanne Gerber and Nicolo Alagna},

doi = {10.1038/s42004-025-01507-0},

year  = {2024},

date = {2024-12-04},

urldate = {2024-12-04},

journal = {Communications Chemistry},

volume = {8},

issue = {1},

abstract = {The field of epitranscriptomics is undergoing a technology-driven revolution. During past decades, RNA modifications like N6-methyladenosine (m6A), pseudouridine (ψ), and 5-methylcytosine (m5C) became acknowledged for playing critical roles in cellular processes. Direct RNA sequencing by Oxford Nanopore Technologies (ONT) enabled the detection of modifications in native RNA, by detecting noncanonical RNA nucleosides properties in raw data. Consequently, the field’s cutting edge has a heavy component in computer science, opening new avenues of cooperation across the community, as exchanging data is as impactful as exchanging samples. Therefore, we seize the occasion to bring scientists together within the RNA Modification and Processing (RMaP) challenge to advance solutions for RNA modification detection and discuss ideas, problems and approaches. We show several computational methods to detect the most researched mRNA modifications (m6A, ψ, and m5C). Results demonstrate that a low prediction error and a high prediction accuracy can be achieved on these modifications across different approaches and algorithms. The RMaP challenge marks a substantial step towards improving algorithms’ comparability, reliability, and consistency in RNA modification prediction. It points out the deficits in this young field that need to be addressed in further challenges.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

zu Siederdissen, Christian Höner; Spangenberg, Jannes; Bisdorf, Kevin; Krautwurst, Sebastian; Srivastava, Akash; Marz, Manja; Taubert, Martin

Nanopore sequencing enables novel detection of deuterium incorporation in DNA Journal Article

In: Computational and Structural Biotechnology Journal, vol. 23, 2024.

Abstract | Links | BibTeX

@article{nokey_74,

title = {Nanopore sequencing enables novel detection of deuterium incorporation in DNA},

author = {Christian {Höner zu Siederdissen} and Jannes Spangenberg and Kevin Bisdorf and Sebastian Krautwurst and Akash Srivastava and Manja Marz and Martin Taubert},

doi = {10.1016/j.csbj.2024.09.027},

year  = {2024},

date = {2024-10-03},

urldate = {2024-10-03},

journal = {Computational and Structural Biotechnology Journal},

volume = {23},

abstract = {Identifying active microbes is crucial to understand their role in ecosystem functions. Metabolic labeling with heavy, non-radioactive isotopes, i.e., stable isotope probing (SIP), can track active microbes by detecting heavy isotope incorporation in biomolecules such as DNA. However, the detection of heavy isotope-labeled nucleotides directly during sequencing has, to date, not been achieved. In this study, Oxford nanopore sequencing was utilized to detect heavy isotopes incorporation in DNA molecules. Two isotopes widely used in SIP experiments were employed to label a bacterial isolate: deuterium (D, as D2O) and carbon-13 (13C, as glucose). We hypothesize that labeled DNA is distinguishable from unlabeled DNA by changes in the nanopore signal. To verify this distinction, we employed a Bayesian classifier trained on signal distributions of short oligonucleotides (k-mers) from labeled and unlabeled sequencing reads. Our results show a clear distinction between D-labeled and unlabeled reads, based on changes in median and median absolute deviation (MAD) of the nanopore signals for different k-mers. In contrast, 13C-labeled DNA cannot be distinguished from unlabeled DNA. For D, the model employed correctly predicted more than 85% of the reads. Even when metabolic labeling was conducted with only 30% D2O, 80% of the obtained reads were correctly classified with a 5% false discovery rate. Our work demonstrates the feasibility of direct detection of deuterium incorporation in DNA molecules during Oxford nanopore sequencing. This finding represents a first step in establishing the combined use of nanopore sequencing and SIP for tracking active organisms in microbial ecology.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Spangenberg, Jannes; zu Siederdissen, Christian Höner; Žarković, Milena; Triebel, Sandra; Rose, Ruben; Christophersen, Christina Martínez; Paltzow, Lea; Hegab, Mohsen M.; Wansorra, Anna; Srivastava, Akash; Krumbholz, Andi; Marz, Manja

Magnipore: Prediction of differential single nucleotide changes in the Oxford Nanopore Technologies sequencing signal of SARS-CoV-2 samples Journal Article

In: bioRxiv, 2023.

Abstract | Links | BibTeX

Molecular genetic, metagenomic and bioinformatic studies on the endometrium and placenta

As a member of the Collaborative Research Center CEPRE (Center for Early Pregnancy and Reproductive Health), our subproject also aims to improve the understanding of possible misregulation of non-coding genes and metagenomics in infertile women. We will investigate the molecular genetic differences between infertile and fertile women before pregnancy, in the first trimester and during pregnancy. The study will mainly be conducted bioinformatically by transcriptomic analyses of the endometrium, decidua and placenta. In addition to protein-coding transcripts, the main focus will be on non-coding RNAs. In addition, we will investigate the metagenomic composition of viruses and bacteria in the uterus and their differences in infertile and fertile women.

People involved: Stephan Kastner, Manja Marz

Collaborations: Udo Markert, Ekkehard Schleußner, Diana Maria Morales Prieto, Regine Heller

Grants: BMBF – 01GR2305B CEPRE – Center for Early Pregnancy and Reproductive Health

The virome of the lung

Even today, it still happens that patients die in hospital from pneumonia or other respiratory symptoms without the exact cause being known. One possible cause could be infection with an unknown virus. To find out, our first goal is to develop a method to identify rare viruses in the superimposed human background using metagenomic approaches. The method will then be validated in microbe-positive patients. The next step will be to analyse BAL (bronchoalveolar lavage) samples from patients with respiratory symptoms but without positive clinical microbial findings to identify a possible viral cause of the disease. A correlation between the symptoms and the individual genes of the virus is expected to determine whether an identified virus is truly pathogenic.

People involved: Stephan Kastner, Noriko Cassman, Manja Marz

Past people involved: Johanna Luise Gorki

Collaborations: Susanne Lang

Grants: Digitalisierung der Lebenswissenschaften: Wege in die Zukunft

FRESCO-Phage: Longterm transfer of FRozen, Encapsulated multidonor Stool bacteriophage filtrates for active ulcerative COlitis

Inflammatory bowel disease (IBD) is a global disease of the 21st century. In the US, it is currently estimated that about 1-1.3 million people suffer from IBD. The exact etiology of IBD is unknown, however, one of the debated reasons is dysbiosis of the intestinal microbiota. Ulcerative colitis (UC) and Crohn’s disease are two main forms of IBD. The diversity of microbiota in gut of IBD patients is reduced; and multidonor intensive faecal microbiota transplantation during active UC is maintained. Kill-the-winner dynamics suggest that bacteriophages regulate the microbiome, making them an interesting target in UC. FRESCO, a BMBF research program longitudinally tracking multidonor stool transplantations in hundreds of UC patients (Prof. Stallmach, 1.6 MEUR) includes a conservative treatment for active mild to moderate UC by a) sterile faecal microbiota filtrate transplantation (SFMFT), b) classical faecal microbiota transplantation (FMT); and c) placebo in patients with active UC. As the 0.22 um filters used for SFMFT allow bacteriophages to pass, this experimental setup allows us to investigate the role of bacteriophages on the composition of the gut microbiome, and their role in UC.
In FRESCO-Phage we want to understand the role of viruses as potential regulators of the bacterial community and therefore the microbiome. For this we aim to analyze the bacterial community and separately the viruses from stool of the donor and transplant only viruses into the patients with active UC. We will analyse the microbiome (bacteria and viruses) before and after transplantation.

People involved: Stephan Kastner, Noriko Cassman, Tom Eulenfeld, Manja Marz

Past people involved: Johanna Luise Gorki

Collaborations: Andreas Stallmach

Grants: DFG EXC2051 – Balance of the Microverse

Alternative splicing and isoforms

Alternative splicing is an important process of gene regulation whereby a single gene can produce multiple mRNA isoforms with different functions. The formation of different isoforms consequently leads to protein diversity and increased functional complexity. Understanding the changes in alternative splicing patterns in response to different conditions can provide valuable insights into disease mechanisms and the identification of potential therapeutic targets. Our research investigates the role of alternative splicing and the use of different isoforms in health and disease, looking at RNA-Seq data from humans, rats, mice and fish. We want to identify genes that undergo differential splicing and use differential isoform under specific conditions, like viral infections or ageing.

People involved: Maria Schreiber

A comprehensive toxin-antitoxin atlas

The threat posed by antibiotic-resistant bacteria is considerable. However, the existence of toxin-antitoxin (TA) systems in these microbes may facilitate the development of novel treatment options. TA systems function as a kind of self-destruct mechanism with antidotes. Further research is required to ascertain their potential as a means of combating or controlling antibiotic-resistant bacteria. The next step is to identify the presence of these systems in all bacteria and archaea at different levels (primary, secondary, tertiary) and to gain a deeper understanding of their function. This will entail analysing their interactions and investigating their evolutionary development. Finally, the aim is to demonstrate in the laboratory that the systems identified are indeed functional.

People involved: Maria Schreiber, Emanuel Barth

Bioinformatics support for researchers of the FSU and associated research institutes

The Bioinformatics Core Facility Jena (BiC) provides free support for researchers of the Friedrich Schiller University and associated research institutes in Jena at all stages of bioinformatics analysis. The support we offer ranges from consultations, basic bioinformatics services, and scientific workshops to full research collaborations. For our basic bioinformatics services, we have established modern, standardized workflows for numerous tasks in the field of high-throughput analysis and related research areas, starting with data quality control up to the final visualization of the results. For special applications, where our standardized methods reach their limits or a deeper interpretation of the results is desired, we offer individually adapted solutions in the form of full research collaboration. In the end, our aim is to contribute to the interdisciplinarity and development of life science research projects in Jena. Through our still-growing network of scientific partners, we have the opportunity to work on a huge variety of different topics.

People involved: Emanuel Barth, Muriel Ritsch, Sebastian Krautwurst, Daria Meyer

Viruses in groundwater

The principle aim of the CRC AquaDiva is to increase our understanding of the links between surface and subsurface, especially how organisms inhabiting the subsurface critical zone reflect and influence their physical, ecological and geochemical environment. In project A06, we will determine the diversity of previously known viruses in groundwater by high-throughput sequencing of viral genomes. A challenge in virology is the identification of previously undetected viruses, which we will tackle with our new approach to virus assembly. We complement this approach with a comparison of different protocols for the metagenomic identification of the groundwater virom. Finally, we will determine the largely unknown virus half-lives of various viruses in groundwater in order to gain insights into their ability to reproduce even in the long-term absence of the host.

People involved: Christian Höner zu Siederdissen, Stephan Kastner, Franziska Hufsky, Janina Rahlff, Noriko Cassman, Manja Marz

Past people involved: Akash Srivastava, Sebastian Krautwurst, Muriel Ritsch, Milena Žarković

Collaborations: Kirsten Küsel, Antonis Chatzinotas, René Kallies, Christian Jogler

Grants: CRC 1076 — AquaDiva: A6: Viral Diversity, Viral de novo Assembly, and Viral Decay in Groundwater

Related publications:

Overholt, Will A.; Trumbore, Susan; Xu, Xiaomei; Bornemann, Till L. V.; Probst, Alexander J.; Krüger, Markus; Herrmann, Martina; Thamdrup, Bo; Bristow, Laura A.; Taubert, Martin; Schwab, Valérie F.; Hölzer, Martin; Marz, Manja; Küsel, Kirsten

Carbon fixation rates in groundwater similar to those in oligotrophic marine systems Journal Article

In: Nat Geosci, vol. 15, pp. 561–567, 2022.

Abstract | Links | BibTeX

@article{nokey,

title = {Carbon fixation rates in groundwater similar to those in oligotrophic marine systems},

author = {Will A. Overholt and Susan Trumbore and Xiaomei Xu and Till L. V. Bornemann and Alexander J. Probst and Markus Krüger and Martina Herrmann and Bo Thamdrup and Laura A. Bristow and Martin Taubert and Valérie F. Schwab and Martin Hölzer and Manja Marz and Kirsten Küsel},

doi = {10.1038/s41561-022-00968-5},

year  = {2022},

date = {2022-06-30},

journal = {Nat Geosci},

volume = {15},

pages = {561–567},

abstract = {The terrestrial subsurface contains nearly all of Earth’s freshwater reserves and harbours the majority of our planet’s total prokaryotic biomass. Although genetic surveys suggest these organisms rely on in situ carbon fixation, rather than the photosynthetically derived organic carbon transported from surface environments, direct measurements of carbon fixation in the subsurface are absent. Using an ultra-low level 14C-labelling technique, we estimate in situ carbon fixation rates in a carbonate aquifer. We find these rates are similar to those measured in oligotrophic marine surface waters and up to six-fold greater than those observed in the lower euphotic zone. Our empirical carbon fixation rates agree with nitrification rate data. Metagenomic analyses reveal abundant putative chemolithoautotrophic members of an uncharacterized order of Nitrospiria that may be behind the carbon fixation. On the basis of our determined carbon fixation rates, we conservatively extrapolate global primary production in carbonate groundwaters (10% of global reserves) to be 0.11 Pg carbon per year. These rates fall within the range found for oligotrophic marine surface waters, indicating a substantial contribution of in situ primary production to subsurface ecosystem processes. We further suggest that, just as phototrophs are for marine biogeochemical cycling, such subsurface carbon fixation is potentially foundational to subsurface trophic webs.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Chaudhari, Narendrakumar M.; Overholt, Will A.; Figueroa-Gonzalez, Perla Abigail; Taubert, Martin; Bornemann, Till L. V.; Probst, Alexander J.; Hölzer, Martin; Marz, Manja; Küsel, Kirsten

The economical lifestyle of CPR bacteria in groundwater allows little preference for environmental drivers Journal Article

In: Environ Microbiome, vol. 16, no. 1, pp. 24, 2021.

Abstract | Links | BibTeX

@article{nokey,

title = {The economical lifestyle of CPR bacteria in groundwater allows little preference for environmental drivers},

author = {Narendrakumar M. Chaudhari and Will A. Overholt and Perla Abigail Figueroa-Gonzalez and Martin Taubert and Till L. V. Bornemann and Alexander J. Probst and Martin Hölzer and Manja Marz and Kirsten Küsel},

doi = {10.1186/s40793-021-00395-w},

year  = {2021},

date = {2021-12-14},

urldate = {2021-12-14},

journal = {Environ Microbiome},

volume = {16},

number = {1},

pages = {24},

abstract = {Background: The highly diverse Cand. Patescibacteria are predicted to have minimal biosynthetic and metabolic pathways, which hinders understanding of how their populations differentiate in response to environmental drivers or host organisms. Their mechanisms employed to cope with oxidative stress are largely unknown. Here, we utilized genome-resolved metagenomics to investigate the adaptive genome repertoire of Patescibacteria in oxic and anoxic groundwaters, and to infer putative host ranges.



Results: Within six groundwater wells, Cand. Patescibacteria was the most dominant (up to 79%) super-phylum across 32 metagenomes sequenced from DNA retained on 0.2 and 0.1 µm filters after sequential filtration. Of the reconstructed 1275 metagenome-assembled genomes (MAGs), 291 high-quality MAGs were classified as Cand. Patescibacteria. Cand. Paceibacteria and Cand. Microgenomates were enriched exclusively in the 0.1 µm fractions, whereas candidate division ABY1 and Cand. Gracilibacteria were enriched in the 0.2 µm fractions. On average, Patescibacteria enriched in the smaller 0.1 µm filter fractions had 22% smaller genomes, 13.4% lower replication measures, higher proportion of rod-shape determining proteins, and of genomic features suggesting type IV pili mediated cell-cell attachments. Near-surface wells harbored Patescibacteria with higher replication rates than anoxic downstream wells characterized by longer water residence time. Except prevalence of superoxide dismutase genes in Patescibacteria MAGs enriched in oxic groundwaters (83%), no major metabolic or phylogenetic differences were observed. The most abundant Patescibacteria MAG in oxic groundwater encoded a nitrate transporter, nitrite reductase, and F-type ATPase, suggesting an alternative energy conservation mechanism. Patescibacteria consistently co-occurred with one another or with members of phyla Nanoarchaeota, Bacteroidota, Nitrospirota, and Omnitrophota. Among the MAGs enriched in 0.2 µm fractions,, only 8% Patescibacteria showed highly significant one-to-one correlation, mostly with Omnitrophota. Motility and transport related genes in certain Patescibacteria were highly similar to genes from other phyla (Omnitrophota, Proteobacteria and Nanoarchaeota).



Conclusion: Other than genes to cope with oxidative stress, we found little genomic evidence for niche adaptation of Patescibacteria to oxic or anoxic groundwaters. Given that we could detect specific host preference only for a few MAGs, we speculate that the majority of Patescibacteria is able to attach multiple hosts just long enough to loot or exchange supplies.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Background: The highly diverse Cand. Patescibacteria are predicted to have minimal biosynthetic and metabolic pathways, which hinders understanding of how their populations differentiate in response to environmental drivers or host organisms. Their mechanisms employed to cope with oxidative stress are largely unknown. Here, we utilized genome-resolved metagenomics to investigate the adaptive genome repertoire of Patescibacteria in oxic and anoxic groundwaters, and to infer putative host ranges.

Results: Within six groundwater wells, Cand. Patescibacteria was the most dominant (up to 79%) super-phylum across 32 metagenomes sequenced from DNA retained on 0.2 and 0.1 µm filters after sequential filtration. Of the reconstructed 1275 metagenome-assembled genomes (MAGs), 291 high-quality MAGs were classified as Cand. Patescibacteria. Cand. Paceibacteria and Cand. Microgenomates were enriched exclusively in the 0.1 µm fractions, whereas candidate division ABY1 and Cand. Gracilibacteria were enriched in the 0.2 µm fractions. On average, Patescibacteria enriched in the smaller 0.1 µm filter fractions had 22% smaller genomes, 13.4% lower replication measures, higher proportion of rod-shape determining proteins, and of genomic features suggesting type IV pili mediated cell-cell attachments. Near-surface wells harbored Patescibacteria with higher replication rates than anoxic downstream wells characterized by longer water residence time. Except prevalence of superoxide dismutase genes in Patescibacteria MAGs enriched in oxic groundwaters (83%), no major metabolic or phylogenetic differences were observed. The most abundant Patescibacteria MAG in oxic groundwater encoded a nitrate transporter, nitrite reductase, and F-type ATPase, suggesting an alternative energy conservation mechanism. Patescibacteria consistently co-occurred with one another or with members of phyla Nanoarchaeota, Bacteroidota, Nitrospirota, and Omnitrophota. Among the MAGs enriched in 0.2 µm fractions,, only 8% Patescibacteria showed highly significant one-to-one correlation, mostly with Omnitrophota. Motility and transport related genes in certain Patescibacteria were highly similar to genes from other phyla (Omnitrophota, Proteobacteria and Nanoarchaeota).

Conclusion: Other than genes to cope with oxidative stress, we found little genomic evidence for niche adaptation of Patescibacteria to oxic or anoxic groundwaters. Given that we could detect specific host preference only for a few MAGs, we speculate that the majority of Patescibacteria is able to attach multiple hosts just long enough to loot or exchange supplies.

Overholt, Will A.; Hölzer, Martin; Geesink, Patricia; Diezel, Celia; Marz, Manja; Küsel, Kirsten

Inclusion of Oxford Nanopore long reads improves all microbial and viral metagenome-assembled genomes from a complex aquifer system Journal Article

In: Environ Microbiol, vol. 22, no. 9, pp. 4000-4013, 2020.

Abstract | Links | BibTeX

@article{Overholt:20,

title = {Inclusion of Oxford Nanopore long reads improves all microbial and viral metagenome-assembled genomes from a complex aquifer system},

author = {Will A. Overholt and Martin Hölzer and Patricia Geesink and Celia Diezel and Manja Marz and Kirsten Küsel},

doi = {10.1111/1462-2920.15186},

year  = {2020},

date = {2020-08-05},

urldate = {2020-08-05},

journal = {Environ Microbiol},

volume = {22},

number = {9},

pages = {4000-4013},

publisher = {Wiley},

abstract = {Assembling microbial and viral genomes from metagenomes is a powerful and appealing method to understand structure–function relationships in complex environments. To compare the recovery of genomes from microorganisms and their viruses from groundwater, we generated shotgun metagenomes with Illumina sequencing accompanied by long reads derived from the Oxford Nanopore Technologies (ONT) sequencing platform. Assembly and metagenome-assembled genome (MAG) metrics for both microbes and viruses were determined from an Illumina-only assembly, ONT-only assembly, and a hybrid assembly approach. The hybrid approach recovered 2× more mid to high-quality MAGs compared to the Illumina-only approach and 4× more than the ONT-only approach. A similar number of viral genomes were reconstructed using the hybrid and ONT methods, and both recovered nearly fourfold more viral genomes than the Illumina-only approach. While yielding fewer MAGs, the ONT-only approach generated MAGs with a high probability of containing rRNA genes, 3× higher than either of the other methods. Of the shared MAGs recovered from each method, the ONT-only approach generated the longest and least fragmented MAGs, while the hybrid approach yielded the most complete. This work provides quantitative data to inform a cost–benefit analysis of the decision to supplement shotgun metagenomic projects with long reads towards the goal of recovering genomes from environmentally abundant groups.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Kallies, René; Hölzer, Martin; Toscan, Rodolfo Brizola; da Rocha, Ulisses Nunes; Anders, John; Marz, Manja; Chatzinotas, Antonis

Evaluation of Sequencing Library Preparation Protocols for Viral Metagenomic Analysis from Pristine Aquifer Groundwaters. Journal Article

In: Viruses, vol. 11, no. 6, pp. 484, 2019.

Abstract | Links | BibTeX

Wegner, Carl-Eric; Gaspar, Michael; Geesink, Patricia; Herrmann, Martina; Marz, Manja; Küsel, Kirsten

Biogeochemical regimes in shallow aquifers reflect the metabolic coupling of elements of nitrogen, sulfur and carbon. Journal Article

In: Appl Environ Microbiol, vol. 85, no. 5, pp. e02346-18, 2019.

Abstract | Links | BibTeX

@article{Wegner:19,

title = {Biogeochemical regimes in shallow aquifers reflect the metabolic coupling of elements of nitrogen, sulfur and carbon.},

author = {Carl-Eric Wegner and Michael Gaspar and Patricia Geesink and Martina Herrmann and Manja Marz and Kirsten Küsel},

doi = {10.1128/AEM.02346-18},

year  = {2019},

date = {2019-02-20},

urldate = {2019-01-01},

journal = {Appl Environ Microbiol},

volume = {85},

number = {5},

pages = {e02346-18},

abstract = {Near-surface groundwaters are prone to receive (in)organic matter input from their recharge areas and are known to harbour autotrophic microbial communities linked to nitrogen and sulfur metabolism. Here, we use multi-"omic" profiling to gain holistic insights into the turnover of inorganic nitrogen compounds, carbon fixation processes and organic matter processing in groundwater. We sampled microbial biomass from two superimposed aquifers via monitoring wells that follow groundwater flow from its recharge area through differences in hydrogeochemical settings and land use. Functional profiling revealed that groundwater microbiomes are mainly driven by nitrogen (nitrification, denitrification, anammox) and to a lesser extent sulfur cycling (sulfur oxidation and sulfate reduction), dependent on local hydrochemical differences. Surprisingly, the differentiation potential of the groundwater microbiome surpasses that of hydrochemistry for individual monitoring wells. Dominated by few phyla (Bacteroidetes, Proteobacteria, Planctomycetes, Thaumarchaeota), the taxonomic profiling of groundwater metagenomes and metatranscriptomes revealed pronounced differences between merely present microbiome members and those actively participating in community gene expression and biogeochemical cycling. Unexpectedly, we observed a constitutive expression of carbohydrate-active enzymes, encoded by different microbiome members, along with the groundwater flow path. The turnover of organic carbon apparently complements for lithoautotrophic carbon assimilation pathways mainly used by the groundwater microbiome dependent on the availability of oxygen and inorganic electron donors like ammonium. Groundwater is a key resource for drinking water production and irrigation. The interplay between geological setting, hydrochemistry, carbon storage and groundwater microbiome ecosystem functioning is crucial for our understanding of these important ecosystem services. We targeted the encoded and expressed metabolic potential of groundwater microbiomes along an aquifer transect that diversifies in terms of hydrochemistry and land use. Our results showed that the groundwater microbiome has a higher spatial differentiation potential than hydrochemistry.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Near-surface groundwaters are prone to receive (in)organic matter input from their recharge areas and are known to harbour autotrophic microbial communities linked to nitrogen and sulfur metabolism. Here, we use multi-"omic" profiling to gain holistic insights into the turnover of inorganic nitrogen compounds, carbon fixation processes and organic matter processing in groundwater. We sampled microbial biomass from two superimposed aquifers via monitoring wells that follow groundwater flow from its recharge area through differences in hydrogeochemical settings and land use. Functional profiling revealed that groundwater microbiomes are mainly driven by nitrogen (nitrification, denitrification, anammox) and to a lesser extent sulfur cycling (sulfur oxidation and sulfate reduction), dependent on local hydrochemical differences. Surprisingly, the differentiation potential of the groundwater microbiome surpasses that of hydrochemistry for individual monitoring wells. Dominated by few phyla (Bacteroidetes, Proteobacteria, Planctomycetes, Thaumarchaeota), the taxonomic profiling of groundwater metagenomes and metatranscriptomes revealed pronounced differences between merely present microbiome members and those actively participating in community gene expression and biogeochemical cycling. Unexpectedly, we observed a constitutive expression of carbohydrate-active enzymes, encoded by different microbiome members, along with the groundwater flow path. The turnover of organic carbon apparently complements for lithoautotrophic carbon assimilation pathways mainly used by the groundwater microbiome dependent on the availability of oxygen and inorganic electron donors like ammonium. Groundwater is a key resource for drinking water production and irrigation. The interplay between geological setting, hydrochemistry, carbon storage and groundwater microbiome ecosystem functioning is crucial for our understanding of these important ecosystem services. We targeted the encoded and expressed metabolic potential of groundwater microbiomes along an aquifer transect that diversifies in terms of hydrochemistry and land use. Our results showed that the groundwater microbiome has a higher spatial differentiation potential than hydrochemistry.

Starke, Robert; Müller, Martina; Gaspar, Michael; Marz, Manja; Küsel, Kirsten; Totsche, Kai Uwe; Bergen, Martin; Jehmlich, Nico

Candidate Brocadiales dominates C, N and S cycling in anoxic groundwater of a pristine limestone-fracture aquifer Journal Article

In: J Proteomics, vol. 152, pp. 153–160, 2016.

Abstract | Links | BibTeX

@article{Starke:17,

title = {Candidate Brocadiales dominates C, N and S cycling in anoxic groundwater of a pristine limestone-fracture aquifer},

author = {Robert Starke and Martina Müller and Michael Gaspar and Manja Marz and Kirsten Küsel and Kai Uwe Totsche and Martin Bergen and Nico Jehmlich},

doi = {10.1016/j.jprot.2016.11.003},

year  = {2016},

date = {2016-11-10},

urldate = {2016-11-10},

journal = {J Proteomics},

volume = {152},

pages = {153--160},

abstract = {Groundwater-associated microorganisms are known to play an important role in the biogeochemical C, N and S cycling. Metaproteomics was applied to characterize the diversity and the activity of microbes to identify key species in major biogeochemical processes in the anoxic groundwater of a pristine karstic aquifer located in Hainich, central Germany. Sampling was achieved by pumping 1000L water from two sites of the upper aquifer assemblage and filtration on 0.3μm glass filters. In total, 3808 protein groups were identified. Interestingly, the two wells (H4/2 and H5/2) differed not only in microbial density but also in the prevalence of different C, N and S cycling pathways. The well H5/2 was dominated by the anaerobic ammonia-oxidizing (anammox) candidate Brocadiales (31%) while other orders such as Burkholderiales (2%) or Nitrospirales (3%) were less abundant. Otherwise, the well H4/2 featured only low biomass and remarkably fewer proteins (391 to 3631 at H5/2). Candidate Brocadiales was affiliated to all major carbon fixation strategies, and to the cycling of N and S implying a major role in biogeochemical processes of groundwater aquifers. The findings of our study support functions which can be linked to the ecosystem services provided by the microbial communities present in aquifers. Subsurface environments especially the groundwater ecosystems represent a large habitat for microbial activity. Microbes are responsible for energy and nutrient cycling and are massively involved in the planet's sustainability. Microbial diversity is tremendous and the central question in current microbial ecology is "Who eats what, where and when?". In this study, we characterize a natural aquifer inhabiting microbial community to obtain evidence for the phylogenetic diversity and the metabolic activity by protein abundance and we highlight important biogeochemical cycling processes. The aquifer was dominated by Candidatus Brocadiales while other phylotypes such as Burkholderiales, Caulobacterales and Nitrospirales were less abundant. The candidate comprised all major carbon fixation strategies, ammonification, anammox and denitrification as well as assimilatory sulfate reduction. Our findings have broad implications for the understanding of microbial activities in this aquifer and consequently specific functions can be linked to the ecosystem services provided by the microbial communities present in aquifers.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Groundwater-associated microorganisms are known to play an important role in the biogeochemical C, N and S cycling. Metaproteomics was applied to characterize the diversity and the activity of microbes to identify key species in major biogeochemical processes in the anoxic groundwater of a pristine karstic aquifer located in Hainich, central Germany. Sampling was achieved by pumping 1000L water from two sites of the upper aquifer assemblage and filtration on 0.3μm glass filters. In total, 3808 protein groups were identified. Interestingly, the two wells (H4/2 and H5/2) differed not only in microbial density but also in the prevalence of different C, N and S cycling pathways. The well H5/2 was dominated by the anaerobic ammonia-oxidizing (anammox) candidate Brocadiales (31%) while other orders such as Burkholderiales (2%) or Nitrospirales (3%) were less abundant. Otherwise, the well H4/2 featured only low biomass and remarkably fewer proteins (391 to 3631 at H5/2). Candidate Brocadiales was affiliated to all major carbon fixation strategies, and to the cycling of N and S implying a major role in biogeochemical processes of groundwater aquifers. The findings of our study support functions which can be linked to the ecosystem services provided by the microbial communities present in aquifers. Subsurface environments especially the groundwater ecosystems represent a large habitat for microbial activity. Microbes are responsible for energy and nutrient cycling and are massively involved in the planet's sustainability. Microbial diversity is tremendous and the central question in current microbial ecology is "Who eats what, where and when?". In this study, we characterize a natural aquifer inhabiting microbial community to obtain evidence for the phylogenetic diversity and the metabolic activity by protein abundance and we highlight important biogeochemical cycling processes. The aquifer was dominated by Candidatus Brocadiales while other phylotypes such as Burkholderiales, Caulobacterales and Nitrospirales were less abundant. The candidate comprised all major carbon fixation strategies, ammonification, anammox and denitrification as well as assimilatory sulfate reduction. Our findings have broad implications for the understanding of microbial activities in this aquifer and consequently specific functions can be linked to the ecosystem services provided by the microbial communities present in aquifers.

Fabisch, Maria; Freyer, Gina; Johnson, Carol A.; Büchel, G.; Akob, Denise M.; Neu, R.; Küsel, Kirsten

Dominance of ‘Gallionella capsiferriformans’ and heavy metal association with Gallionella-like stalks in metal-rich pH 6 mine water discharge Journal Article

In: Geobiology , vol. 14, iss. 1, pp. 68-90, 2015.

Abstract | Links | BibTeX

@article{nokey,

title = {Dominance of ‘Gallionella capsiferriformans’ and heavy metal association with Gallionella-like stalks in metal-rich pH 6 mine water discharge},

author = {Maria Fabisch and Gina Freyer and Carol A. Johnson and G. Büchel and Denise M. Akob and R. Neu and Kirsten Küsel},

doi = {10.1111/gbi.12162},

year  = {2015},

date = {2015-09-26},

urldate = {2015-09-26},

journal = {Geobiology },

volume = {14},

issue = {1},

pages = {68-90},

abstract = {Heavy metal-contaminated, pH 6 mine water discharge created new streams and iron-rich terraces at a creek bank in a former uranium-mining area near Ronneburg, Germany. The transition from microoxic groundwater with ~5 mm Fe(II) to oxic surface water may provide a suitable habitat for microaerobic iron-oxidizing bacteria (FeOB). In this study, we investigated the potential contribution of these FeOB to iron oxidation and metal retention in this high-metal environment. We (i) identified and quantified FeOB in water and sediment at the outflow, terraces, and creek, (ii) studied the composition of biogenic iron oxides (Gallionella-like twisted stalks) with scanning and transmission electron microscopy (SEM, TEM) as well as confocal laser scanning microscopy (CLSM), and (iii) examined the metal distribution in sediments. Using quantitative PCR, a very high abundance of FeOB was demonstrated at all sites over a 6-month study period. Gallionella spp. clearly dominated the communities, accounting for up to 88% of Bacteria, with a minor contribution of other FeOB such as Sideroxydans spp. and ‘Ferrovum myxofaciens’. Classical 16S rRNA gene cloning showed that 96% of the Gallionella-related sequences had ≥97% identity to the putatively metal-tolerant ‘Gallionella capsiferriformans ES-2’, in addition to known stalk formers such as Gallionella ferruginea and Gallionellaceae strain R-1. Twisted stalks from glass slides incubated in water and sediment were composed of the Fe(III) oxyhydroxide ferrihydrite, as well as polysaccharides. SEM and scanning TEM-energy-dispersive X-ray spectroscopy revealed that stalk material contained Cu and Sn, demonstrating the association of heavy metals with biogenic iron oxides and the potential for metal retention by these stalks. Sequential extraction of sediments suggested that Cu (52–61% of total sediment Cu) and other heavy metals were primarily bound to the iron oxide fractions. These results show the importance of ‘G. capsiferriformans’ and biogenic iron oxides in slightly acidic but highly metal-contaminated freshwater environments.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Heavy metal-contaminated, pH 6 mine water discharge created new streams and iron-rich terraces at a creek bank in a former uranium-mining area near Ronneburg, Germany. The transition from microoxic groundwater with ~5 mm Fe(II) to oxic surface water may provide a suitable habitat for microaerobic iron-oxidizing bacteria (FeOB). In this study, we investigated the potential contribution of these FeOB to iron oxidation and metal retention in this high-metal environment. We (i) identified and quantified FeOB in water and sediment at the outflow, terraces, and creek, (ii) studied the composition of biogenic iron oxides (Gallionella-like twisted stalks) with scanning and transmission electron microscopy (SEM, TEM) as well as confocal laser scanning microscopy (CLSM), and (iii) examined the metal distribution in sediments. Using quantitative PCR, a very high abundance of FeOB was demonstrated at all sites over a 6-month study period. Gallionella spp. clearly dominated the communities, accounting for up to 88% of Bacteria, with a minor contribution of other FeOB such as Sideroxydans spp. and ‘Ferrovum myxofaciens’. Classical 16S rRNA gene cloning showed that 96% of the Gallionella-related sequences had ≥97% identity to the putatively metal-tolerant ‘Gallionella capsiferriformans ES-2’, in addition to known stalk formers such as Gallionella ferruginea and Gallionellaceae strain R-1. Twisted stalks from glass slides incubated in water and sediment were composed of the Fe(III) oxyhydroxide ferrihydrite, as well as polysaccharides. SEM and scanning TEM-energy-dispersive X-ray spectroscopy revealed that stalk material contained Cu and Sn, demonstrating the association of heavy metals with biogenic iron oxides and the potential for metal retention by these stalks. Sequential extraction of sediments suggested that Cu (52–61% of total sediment Cu) and other heavy metals were primarily bound to the iron oxide fractions. These results show the importance of ‘G. capsiferriformans’ and biogenic iron oxides in slightly acidic but highly metal-contaminated freshwater environments.

Johnson, Carol A.; Freyer, Gina; Fabisch, Maria; Caraballo, Manuela A.; Küsel, Kirsten; Jr, Michael F. Hochella

Observations and assessment of iron oxide and green rust nanoparticles in metal-polluted mine drainage within a steep redox gradient Journal Article

In: Environmental Chemistry , vol. 11, iss. 4, pp. 377-391, 2014.

Abstract | Links | BibTeX

@article{nokey,

title = {Observations and assessment of iron oxide and green rust nanoparticles in metal-polluted mine drainage within a steep redox gradient},

author = {Carol A. Johnson and Gina Freyer and Maria Fabisch and Manuela A. Caraballo and Kirsten Küsel and Michael F. Hochella Jr},

doi = {10.1071/EN13184},

year  = {2014},

date = {2014-05-19},

urldate = {2014-05-19},

journal = {Environmental Chemistry },

volume = {11},

issue = {4},

pages = {377-391},

abstract = {In this study of iron- and silica-bearing nanoparticle and colloid aggregates in slightly acidic mine drainage, we combined bulk scale geochemistry techniques with detailed nanoscale analyses using high-resolution transmission electron microscopy (HR-TEM) to demonstrate the complexity of iron oxide formation and transformation at a steep redox gradient (groundwater outflow into a stream), and the resulting role in metal(loid) uptake. We also identified pseudohexagonal nanosheets of Zn-bearing green rust in outflowing groundwater using HR-TEM. This is only the second study where green rust was identified in groundwater, and the second to examine naturally occurring green rust with analytical TEM. In aerated downstream waters, we found aggregates of poorly crystalline iron oxide particles (20–200 nm in diameter). Inductively coupled plasma–mass spectrometry (ICP-MS) analysis of water fractions shows that most elements such as Ni and Zn were found almost exclusively in the dissolved–nanoparticulate (<0.1 μm) fraction, whereas Cu and As were primarily associated with suspended particles. In the underlying sediments composed of deposited particles, goethite nanoneedles formed on the ferrihydrite surfaces of larger aggregated particles (100–1000 nm), resulting in more reactive surface area for metal(loid) uptake. Sequential extraction of sediments showed that many metal(loid)s, particularly As and Zn, were associated with iron oxides identified as ferrihydrite, goethite and possibly schwertmannite. Amorphous silica co-precipitation with iron oxides was prevalent at all sampling sites, but its effect on metal(loid) sorption is unknown. Fine-grained iron oxide sediments are easily remobilised during turbulent flow events, adding to the mobility of the associated metals.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Fabisch, Maria; Beulig, Felix; Akob, Denise M.; Küsel, Kirsten

Surprising abundance of Gallionella-related iron oxidizers in creek sediments at pH 4.4 or at high heavy metal concentrations Journal Article

In: Frontiers in Microbiology, vol. 4, 2013.

Abstract | Links | BibTeX

@article{nokey,

title = {Surprising abundance of Gallionella-related iron oxidizers in creek sediments at pH 4.4 or at high heavy metal concentrations},

author = {Maria Fabisch and Felix Beulig and Denise M. Akob and Kirsten Küsel},

doi = {10.3389/fmicb.2013.00390},

year  = {2013},

date = {2013-12-18},

urldate = {2013-12-18},

journal = {Frontiers in Microbiology},

volume = {4},

abstract = {We identified and quantified abundant iron-oxidizing bacteria (FeOB) at three iron-rich, metal-contaminated creek sites with increasing sediment pH from extremely acidic (R1, pH 2.7), to moderately acidic (R2, pH 4.4), to slightly acidic (R3, pH 6.3) in a former uranium-mining district. The geochemical parameters showed little variations over the 1.5 year study period. The highest metal concentrations found in creek sediments always coincided with the lowest metal concentrations in creek water at the slightly acidic site R3. Sequential extractions of R3 sediment revealed large portions of heavy metals (Ni, Cu, Zn, Pb, U) bound to the iron oxide fraction. Light microscopy of glass slides exposed in creeks detected twisted stalks characteristic of microaerobic FeOB of the family Gallionellaceae at R3 but also at the acidic site R2. Sequences related to FeOB such as Gallionella ferruginea, Sideroxydans sp. CL21, Ferritrophicum radicicola, and Acidovorax sp. BrG1 were identified in the sediments. The highest fraction of clone sequences similar to the acidophilic “Ferrovum myxofaciens” was detected in R1. Quantitative PCR using primer sets specific for Gallionella spp., Sideroxydans spp., and “Ferrovum myxofaciens” revealed that ~72% (R2 sediment) and 37% (R3 sediment) of total bacterial 16S rRNA gene copies could be assigned to groups of FeOB with dominance of microaerobic Gallionella spp. at both sites. Gallionella spp. had similar and very high absolute and relative gene copy numbers in both sediment communities. Thus, Gallionella-like organisms appear to exhibit a greater acid and metal tolerance than shown before. Microaerobic FeOB from R3 creek sediment enriched in newly developed metal gradient tubes tolerated metal concentrations of 35 mM Co, 24 mM Ni, and 1.3 mM Cd, higher than those in sediments. Our results will extend the limited knowledge of FeOB at contaminated, moderately to slightly acidic environments.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Ecology and species barriers in emerging viral diseases

Emerging viruses existing in animal reservoirs may cause epidemic or epizootic diseases if transmitted to humans or livestock. While we understand the pathogenicity and epidemiology of prototypic emerging viral diseases, we know little about the mechanisms driving virus emergence from animal reservoirs. To move ahead, we need to generalize our view on emerging viruses, taking into consideration the ecology of viruses in their natural reservoirs. We hypothesize that small mammals, mainly bats, and rodents, constitute the most relevant virus reservoirs due to their large group sizes, population density, mixing, and turnover, as well as their exposure to arthropod vectors.

People involved: Marie Lataretu, Gabriel Lencioni Lovate, Daria Meyer

Past people involved: Martin Hölzer, Nelly Fernanda Mostajo Berrospi

Collaborations: Christian Drosten, Friedemann Weber, Stephan Becker, Martin Beer, Georg Kochs

Grants: DFG SPP 1596: Ecology and Species Barriers in Emerging Viral Diseases

Related publications:

Mostajo, Nelly F.; Lataretu, Marie; Krautwurst, Sebastian; Mock, Florian; Desirò, Daniel; Lamkiewicz, Kevin; Collatz, Maximilian; Schoen, Andreas; Weber, Friedemann; Marz, Manja; Hölzer, Martin

A comprehensive annotation and differential expression analysis of short and long non-coding RNAs in 16 bat genomes Journal Article

In: NAR Genomics Bioinf, vol. 2, no. 1, pp. lqz006, 2019.

Abstract | Links | BibTeX

@article{Mostajo:20,

title = {A comprehensive annotation and differential expression analysis of short and long non-coding RNAs in 16 bat genomes},

author = {Nelly F. Mostajo and Marie Lataretu and Sebastian Krautwurst and Florian Mock and Daniel Desirò and Kevin Lamkiewicz and Maximilian Collatz and Andreas Schoen and Friedemann Weber and Manja Marz and Martin Hölzer},

url = {https://www.rna.uni-jena.de/supplements/bats/index.html},

doi = {10.1093/nargab/lqz006},

year  = {2019},

date = {2019-09-30},

urldate = {2019-09-30},

journal = {NAR Genomics Bioinf},

volume = {2},

number = {1},

pages = {lqz006},

abstract = {Although bats are increasingly becoming the focus of scientific studies due to their unique properties, these exceptional animals are still among the least studied mammals. Assembly quality and completeness of bat genomes vary a lot and especially non-coding RNA (ncRNA) annotations are incomplete or simply missing. Accordingly, standard bioinformatics pipelines for gene expression analysis often ignore ncRNAs such as microRNAs or long antisense RNAs. The main cause of this problem is the use of incomplete genome annotations. We present a complete screening for ncRNAs within 16 bat genomes. NcRNAs affect a remarkable variety of vital biological functions, including gene expression regulation, RNA processing, RNA interference and, as recently described, regulatory processes in viral infections. Within all investigated bat assemblies, we annotated 667 ncRNA families including 162 snoRNAs and 193 miRNAs as well as rRNAs, tRNAs, several snRNAs and lncRNAs, and other structural ncRNA elements. We validated our ncRNA candidates by six RNA-Seq data sets and show significant expression patterns that have never been described before in a bat species on such a large scale. Our annotations will be usable as a resource (rna.uni-jena.de/supplements/bats) for deeper studying of bat evolution, ncRNAs repertoire, gene expression and regulation, ecology and important host–virus interactions.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Hölzer, Martin; Krähling, Verena; Amman, Fabian; Barth, Emanuel; Bernhart, Stephan H.; Carmelo, Victor A. O.; Collatz, Maximilian; Doose, Gero; Eggenhofer, Florian; Ewald, Jan; Fallmann, Jörg; Feldhahn, Lasse M.; Fricke, Markus; Gebauer, Juliane; Gruber, Andreas J.; Hufsky, Franziska; Indrischek, Henrike; Kanton, Sabina; Linde, Jörg; Mostajo, Nelly F.; Ochsenreiter, Roman; Riege, Konstantin; Rivarola-Duarte, Lorena; Sahyoun, Abdullah H.; Saunders, Sita J.; Seemann, Stefan E.; Tanzer, Andrea; Vogel, Bertram; Wehner, Stefanie; Wolfinger, Michael T.; Backofen, Rolf; Gorodkin, Jan; Grosse, Ivo; Hofacker, Ivo; Hoffmann, Steve; Kaleta, Christoph; Stadler, Peter F.; Becker, Stephan; Marz, Manja

Differential transcriptional responses to Ebola and Marburg virus infection in bat and human cells Journal Article

In: Sci Rep, vol. 6, pp. 34589, 2016.

Abstract | Links | BibTeX

@article{Hoelzer:16,

title = {Differential transcriptional responses to Ebola and Marburg virus infection in bat and human cells},

author = {Martin Hölzer and Verena Krähling and Fabian Amman and Emanuel Barth and Stephan H. Bernhart and Victor A. O. Carmelo and Maximilian Collatz and Gero Doose and Florian Eggenhofer and Jan Ewald and Jörg Fallmann and Lasse M. Feldhahn and Markus Fricke and Juliane Gebauer and Andreas J. Gruber and Franziska Hufsky and Henrike Indrischek and Sabina Kanton and Jörg Linde and Nelly F. Mostajo and Roman Ochsenreiter and Konstantin Riege and Lorena Rivarola-Duarte and Abdullah H. Sahyoun and Sita J. Saunders and Stefan E. Seemann and Andrea Tanzer and Bertram Vogel and Stefanie Wehner and Michael T. Wolfinger and Rolf Backofen and Jan Gorodkin and Ivo Grosse and Ivo Hofacker and Steve Hoffmann and Christoph Kaleta and Peter F. Stadler and Stephan Becker and Manja Marz},

doi = {10.1038/srep34589},

year  = {2016},

date = {2016-10-07},

urldate = {2016-10-07},

journal = {Sci Rep},

volume = {6},

pages = {34589},

abstract = {The unprecedented outbreak of Ebola in West Africa resulted in over 28,000 cases and 11,000 deaths, underlining the need for a better understanding of the biology of this highly pathogenic virus to develop specific counter strategies. Two filoviruses, the Ebola and Marburg viruses, result in a severe and often fatal infection in humans. However, bats are natural hosts and survive filovirus infections without obvious symptoms. The molecular basis of this striking difference in the response to filovirus infections is not well understood. We report a systematic overview of differentially expressed genes, activity motifs and pathways in human and bat cells infected with the Ebola and Marburg viruses, and we demonstrate that the replication of filoviruses is more rapid in human cells than in bat cells. We also found that the most strongly regulated genes upon filovirus infection are chemokine ligands and transcription factors. We observed a strong induction of the JAK/STAT pathway, of several genes encoding inhibitors of MAP kinases (DUSP genes) and of PPP1R15A, which is involved in ER stress-induced cell death. We used comparative transcriptomics to provide a data resource that can be used to identify cellular responses that might allow bats to survive filovirus infections.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Limiting viral outbreaks with Machine Learning

Zoonosis, the natural transmission of infections from animal to human, is a far-reaching global problem, occurring more often due to globalization. In case of a virus outbreak, it is helpful to know which host organism was the original carrier of the virus, so people can be separated from these hosts. The sooner this happens, the greater the chances of limiting an outbreak.
As a fast method of predicting the original host, we are working on the ability to predict the host of a virus based on the viral genome sequence. Due to the lack of knowledge about virus adaptation, it is difficult to find practical features for machine learning methods. With this in mind, we apply deep learning methods because they do not require predefined features and are one of today’s most powerful machine learning methods.

People involved: Florian Mock

Past people involved: Adrian Viehweger

Related publications:

Mock, Florian; Viehweger, Adrian; Barth, Emanuel; Marz, Manja

VIDHOP, viral host prediction with Deep Learning Journal Article

In: Bioinformatics, vol. 37, no. 3, pp. 318–325, 2020.

Abstract | Links | BibTeX

Viruses in host genomes

Overall, little is known about the composition of the human genome. For example, it is assumed that 50% consist of repetitive sequences, whose function is still undescribed. These include SINEs or LINEs, which are thought to be of viral origin. Another component of the genome is the rarely described Endogenous Viral Elements (EVEs). These can be divided into rarely analyzed retrotransposons and other previously undescribed viral elements. Normally, retroviruses infect somatic cells and integrate their genetic material into the host genome. Infrequently, retroviruses are integrated into germline cells, in which case the viral genome can be passed on to offspring by sexual transmission. This process of integration is called endogenization. It was assumed that only retroviral viruses could integrate into the host genome (since for these viruses, integration was a necessary step in their life cycle). Surprisingly, several non-retroviral elements have already been found in different genomes, but these are confirmed individual occurrences, and there is no detailed catalog of EVEs. Little is known about the integration of non-retroviral EVEs. It has been frequently reported that the human genome harbors 8% viral elements. However, this number refers only to retroviruses and is almost 20 years old. At that time, much less was known about viruses and their replication mechanisms. Sequencing techniques and bioinformatic programs have also evolved enormously since then. It is therefore very surprising that, despite the description of individual non-retroviral elements in the human genome, there has been no fundamental systematic revalidation of the viral elements in the human genome that has occurred.

In their project, we address three fundamental questions: (1) How many viral elements are there in the human genome? (2) Are they functional? (3) Are the viral fragments accumulated over a human lifetime, or are they essentially inherited?

People involved: Muriel Ritsch

RNA structures and functions in viruses

Non-coding RNAs (ncRNA) are known regulatory elements in organisms from all kingdoms. The secondary structure of RNA is often linked to its function. When looking at a viral genome (especially an RNA genome) it only makes sense that viruses make strong use of ncRNAs to bypass the host’s immune response, regulate their own genes, or stop the gene expression of the host genes. We are concerned with the analysis of conserved RNA structures in well-described virus families and the de novo prediction of potentially functional structural elements in less known families. We use combinations of machine learning, clustering, and homology-based methods. The identification of functional structural elements could help to develop new antiviral therapies in the future, as important replication mechanisms of the virus can be disturbed.

People involved: Kevin Lamkiewicz, Gabriel Lovate, Sandra Triebel

Related publications:

Krautwurst, Sarah; Lamkiewicz, Kevin

RNA-protein interaction prediction without high-throughput data: An overview and benchmark of in silico tools Journal Article

In: Computational and Structural Biotechnology Journal, vol. 23, pp. 4036-4046, 2024, ISSN: 2001-0370.

Abstract | Links | BibTeX

@article{nokey_66,

title = {RNA-protein interaction prediction without high-throughput data: An overview and benchmark of \textit{in silico} tools},

author = {Sarah Krautwurst and Kevin Lamkiewicz},

doi = {10.1016/j.csbj.2024.11.015},

issn = {2001-0370},

year  = {2024},

date = {2024-11-08},

journal = {Computational and Structural Biotechnology Journal},

volume = {23},

pages = {4036-4046},

abstract = {RNA-protein interactions (RPIs) are crucial for accurately operating various processes in and between organisms across kingdoms of life. Mutual detection of RPI partner molecules depends on distinct sequential, structural, or thermodynamic features, which can be determined via experimental and bioinformatic methods. Still, the underlying molecular mechanisms of many RPIs are poorly understood. It is further hypothesized that many RPIs are not even described yet. Computational RPI prediction is continuously challenged by the lack of data and detailed research of very specific examples. With the discovery of novel RPI complexes in all kingdoms of life, adaptations of existing RPI prediction methods are necessary. Continuously improving computational RPI prediction is key in advancing the understanding of RPIs in detail and supplementing experimental RPI determination. The growing amount of data covering more species and detailed mechanisms support the accuracy of prediction tools, which in turn support specific experimental research on RPIs. Here, we give an overview of RPI prediction tools that do not use high-throughput data as the user's input. We review the tools according to their input, usability, and output. We then apply the tools to known RPI examples across different kingdoms of life. Our comparison shows that the investigated prediction tools do not favor a certain species and equip the user with results varying in degree of information, from an overall RPI score to detailed interacting residues. Furthermore, we provide a guide tree to assist users which RPI prediction tool is appropriate for their available input data and desired output.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Triebel, Sandra; Lamkiewicz, Kevin; Ontiveros, Nancy; Sweeney, Blake; Stadler, Peter F.; Petrov, Anton I.; Niepmann, Michael; Marz, Manja

Comprehensive survey of conserved RNA secondary structures in full-genome alignment of Hepatitis C virus Journal Article

In: Scientific Reports, vol. 14, iss. 1, 2024.

Abstract | Links | BibTeX

@article{nokey_62,

title = {Comprehensive survey of conserved RNA secondary structures in full-genome alignment of Hepatitis C virus},

author = {Sandra Triebel and Kevin Lamkiewicz and Nancy Ontiveros and Blake Sweeney and Peter F. Stadler and Anton I. Petrov and Michael Niepmann and Manja Marz},

doi = {10.1038/s41598-024-62897-0},

year  = {2024},

date = {2024-07-02},

urldate = {2024-07-02},

journal = {Scientific Reports},

volume = {14},

issue = {1},

abstract = {Hepatitis C virus (HCV) is a plus-stranded RNA virus that often chronically infects liver hepatocytes and causes liver cirrhosis and cancer. These viruses replicate their genomes employing error-prone replicases. Thereby, they routinely generate a large ‘cloud’ of RNA genomes (quasispecies) which—by trial and error—comprehensively explore the sequence space available for functional RNA genomes that maintain the ability for efficient replication and immune escape. In this context, it is important to identify which RNA secondary structures in the sequence space of the HCV genome are conserved, likely due to functional requirements. Here, we provide the first genome-wide multiple sequence alignment (MSA) with the prediction of RNA secondary structures throughout all representative full-length HCV genomes. We selected 57 representative genomes by clustering all complete HCV genomes from the BV-BRC database based on k-mer distributions and dimension reduction and adding RefSeq sequences. We include annotations of previously recognized features for easy comparison to other studies. Our results indicate that mainly the core coding region, the C-terminal NS5A region, and the NS5B region contain secondary structure elements that are conserved beyond coding sequence requirements, indicating functionality on the RNA level. In contrast, the genome regions in between contain less highly conserved structures. The results provide a complete description of all conserved RNA secondary structures and make clear that functionally important RNA secondary structures are present in certain HCV genome regions but are largely absent from other regions. Full-genome alignments of all branches of Hepacivirus C are provided in the supplement.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Lamkiewicz, Kevin

RNA secondary structures in RNA viruses: Why viruses would not exist without RNA secondary structures PhD Thesis

2022.

Links | BibTeX

Jordan-Paiz, Ana; Nevot, Maria; Lamkiewicz, Kevin; Lataretu, Marie; Franco, Sandra; Marz, Manja; Martinez, Miguel Angel

HIV-1 lethality and loss of Env protein expression induced by single synonymous substitutions in the virus genome intronic splicing silencer Journal Article

In: J Virol, vol. 94, no. 21, 2020.

Abstract | Links | BibTeX

@article{Jordan-Paiz:20,

title = {HIV-1 lethality and loss of Env protein expression induced by single synonymous substitutions in the virus genome intronic splicing silencer},

author = {Ana Jordan-Paiz and Maria Nevot and Kevin Lamkiewicz and Marie Lataretu and Sandra Franco and Manja Marz and Miguel Angel Martinez},

doi = {10.1128/jvi.01108-20},

year  = {2020},

date = {2020-10-14},

urldate = {2020-01-01},

journal = {J Virol},

volume = {94},

number = {21},

publisher = {American Society for Microbiology},

abstract = {Synonymous genome recoding has been widely used to study different aspects of virus biology. Codon usage affects the temporal regulation of viral gene expression. In this study, we performed synonymous codon mutagenesis to investigate whether codon usage affected HIV-1 Env protein expression and virus viability. We replaced the codons AGG, GAG, CCU, ACU, CUC, and GGG of the HIV-1 env gene with the synonymous codons CGU, GAA, CCG, ACG, UUA, and GGA, respectively. We found that recoding the Env protein gp120 coding region (excluding the Rev response element [RRE]) did not significantly affect virus replication capacity, even though we introduced 15 new CpG dinucleotides. In contrast, changing a single codon (AGG to CGU) located in the gp41 coding region (HXB2 env position 2125 to 2127), which was included in the intronic splicing silencer (ISS), completely abolished virus replication and Env expression. Computational analyses of this mutant revealed a severe disruption in the ISS RNA secondary structure. A variant that restored ISS secondary RNA structure also reestablished Env production and virus viability. Interestingly, this codon variant prevented both virus replication and Env translation in a eukaryotic expression system. These findings suggested that disrupting mRNA splicing was not the only means of inhibiting translation. Our findings indicated that synonymous gp120 recoding was not always deleterious to HIV-1 replication. Importantly¸ we found that disrupting an external ISS loop strongly affected HIV-1 replication and Env translation.

},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Madhugiri, Ramakanth; Karl, Nadja; Petersen, Daniel; Lamkiewicz, Kevin; Fricke, Markus; Wend, Ulrike; Scheuer, Robina; Marz, Manja; Ziebuhr, John

Structural and functional conservation of cis-acting RNA elements in coronavirus 5'-terminal genome regions Journal Article

In: Virology, vol. 517, pp. 44–55, 2017.

Abstract | Links | BibTeX

Deciphering the RNA genome packaging code of influenza A viruses

Currently, bioinformatical tools are not specifically designed for viruses. However, viruses bring unique features, which require specific bioinformatical tools to trace virus-host interaction. For example, the number of sequences in quasispecies is massively high due to their high mutation rate, but only a few interact again with the host cells. Some viruses, such as IAV or as used by AG, are segmented RNA viruses, which urgently require tools with specific features: RNA viruses should include standardized secondary structure predictions, leading to RNA-RNA interaction prediction necessary for the packaging of segmented RNA viruses.

The aims of this project are to develop a bioinformatical tool to predict RNA-RNA interactions as packaging signal for segmented viruses, such as IAV, to develop a virus-specific full genome multiple sequence alignment algorithm to track the quasispecies; and to establish RNA-RNA interaction sets and more importantly non-interaction sets.

People involved: Gabriel Lovate, Daniel Desiro, Kevin Lamkiewicz

Collaborators: Roland Marquet, Andreas Henke

Grants: HORIZON 2020 MSCA ITN — VIROINF: Understanding (harmful) virus-host interactions by linking virology and bioinformatics.

Jakob, Celia; Lovate, Gabriel L.; Desirò, Daniel; Gießler, Lara; Smyth, Redmond Patrick; Marquet, Roland; Lamkiewicz, Kevin; Marz, Manja; Schwemmle, Martin; Bolte, Hardin

Sequential disruption of SPLASH-identified vRNA–vRNA interactions challenges their role in influenza A virus genome packaging Journal Article

In: Nucleic Acids Research, 2023, ISBN: 0305-1048.

Abstract | Links | BibTeX

Desiro, Daniel

The complexity of packaging mechanisms in segmented RNA viruses PhD Thesis

2022.

Links | BibTeX

Desiro, Daniel; Hölzer, Martin; Ibrahim, Bashar; Marz, Manja

SilentMutations (SIM): a tool for analyzing long-range RNA-RNA interactions in viral genomes and structured RNAs Journal Article

In: Virus Res, vol. 260, pp. 135-141, 2018.

Abstract | Links | BibTeX

@article{Desiro:18,

title = {SilentMutations (SIM): a tool for analyzing long-range RNA-RNA interactions in viral genomes and structured RNAs},

author = {Daniel Desiro and Martin Hölzer and Bashar Ibrahim and Manja Marz},

url = {https://github.com/desiro/silentMutations},

doi = {10.1016/j.virusres.2018.11.005},

year  = {2018},

date = {2018-11-12},

urldate = {2018-11-12},

journal = {Virus Res},

volume = {260},

pages = {135-141},

abstract = {A single nucleotide change in the coding region can alter the amino acid sequence of a protein. In consequence, natural or artificial sequence changes in viral RNAs may have various effects not only on protein stability, function and structure but also on viral replication. In recent decades, several tools have been developed to predict the effect of mutations in structured RNAs such as viral genomes or non-coding RNAs. Some tools use multiple point mutations and also take coding regions into account. However, none of these tools was designed to specifically simulate the effect of mutations on viral long-range interactions. Here, we developed SilentMutations (SIM), an easy-to-use tool to analyze the effect of multiple point mutations on the secondary structures of two interacting viral RNAs. The tool can simulate disruptive and compensatory mutants of two interacting single-stranded RNAs. This allows a fast and accurate assessment of key regions potentially involved in functional long-range RNA-RNA interactions and will eventually help virologists and RNA-experts to design appropriate experiments. SIM only requires two interacting single-stranded RNA regions as input. The output is a plain text file containing the most promising mutants and a graphical representation of all interactions. We applied our tool on two experimentally validated influenza A virus and hepatitis C virus interactions and we were able to predict potential double mutants for in vitro validation experiments. The source code and documentation of SIM are freely available at github.com/desiro/silentMutations.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Virus Database, interface, and quality control

The NFDI4Microbiota consortium comprises 10 German partner institutions (including FSU Jena) and aims to build a centralized infrastructure with services for microbiome research. Viruses are a fundamental part of the microbiome and their investigation requires specialized tools and resources. Here at FSU Jena, we are building a virus genome sequence database encompassing all viruses, which will be used by virologists, viral ecologists, and others worldwide in accordance with the FAIR principles. We will do this by consulting the global network of virus experts, integrating expert knowledge and existing database structures, and incorporating international metadata standards. We plan to curate and provide an interface to access the virus genome sequences from public repositories e.g. the European Nucleotide Archive (ENA), the Sequence Read Archive (SRA), and GenBank/NCBI viruses. Further, we will offer visualization, analysis, and sharing of user-uploaded virus data, ensuring data protection for embargoed and private datasets.

People involved: Shahram Saghaei, Noriko Cassman, Sandra Triebel

Collaborations: NFDI4Microbiota

High-quality full-genome alignments of viruses

Multiple sequence alignments (MSAs) reveal homologous regions of input sequences and thus serve as a starting point for phylogenetic analyses at the molecular level. High quality MSAs can be used to assess the conservation of primary sequence and even secondary structure. In particular, alignments of viral sequences are challenging due to their high mutation rate. Our goal is to generate high-quality full-genome alignments of virus families and clades by first selecting representative genomes of a dataset via clustering. The alignment is generated by incorporating information at the amino acid, nucleotide and RNA secondary structure level using current homology-based methods and developing new methods to cope with the diversity of viral sequences. High-quality alignments of viruses can provide insights into differences and similarities at the DNA/RNA level within a virus family. At the same time, providing such information as results of codon corrections (RNA sequence to protein) or compensatory mutations (RNA sequence to RNA secondary structure) is a major challenge.

People involved: Sandra Triebel, Tom Eulenfeld, Kevin Lamkiewicz, Noriko Cassman

Related publications:

Muston, Philippa; Triebel, Sandra; Nawrocki, Eric; Ontiveros-Palacios, Nancy; Jandalala, Isaac; Sweeney, Blake; Bateman, Alex; Marz, Manja; Petrov, Anton I.; Madrigal, Pedro

Viral non-coding RNA structure annotation and API-based data retrieval with Rfam and R2DT Journal Article

In: bioRxiv, no. 2026.05.10.724034, 2026.

Abstract | Links | BibTeX

Triebel, Sandra; Eulenfeld, Tom; Ontiveros-Palacios, Nancy; Sweeney, Blake; Tautz, Norbert; Marz, Manja

First full-genome alignment representative for the genus Pestivirus Journal Article

In: RNA, vol. 32, iss. 3, pp. 311-327, 2026.

Abstract | Links | BibTeX

@article{nokey_77,

title = {First full-genome alignment representative for the genus \textit{Pestivirus}},

author = {Sandra Triebel and Tom Eulenfeld and Nancy Ontiveros-Palacios and Blake Sweeney and Norbert Tautz and Manja Marz},

doi = {10.1261/rna.080732.125},

year  = {2026},

date = {2026-03-01},

urldate = {2025-05-27},

journal = {RNA},

volume = {32},

issue = {3},

pages = {311-327},

abstract = {Members of the genus Pestivirus (family Flaviviridae) comprise economically important livestock pathogens like classical swine fever virus (CSFV) and bovine viral diarrhea virus (BVDV). Research over recent years revealed 11 recognized and eight proposed species. The single-stranded, positive-sense RNA genome encodes one large polyprotein that is processed by viral and cellular proteases into 12 mature proteins. In addition to its protein-coding function, the RNA genome contains secondary structures critical for various stages of the viral life cycle. Some of these structures, including the internal ribosome entry site (IRES) and a 3′ stem–loop, essential for genome replication, have been studied in individual pestiviruses. Here, we present the first genome-wide multiple sequence alignment comprising all known pestivirus species (accepted and tentative) and a comprehensive analysis of phylogenetically conserved RNA secondary structures across the genus. Well-characterized elements, such as a 5′ stem–loop, the IRES, and the 3′ stem–loop SL I, were conserved in all pestiviruses, whereas additional 3′ untranslated region structures were conserved only in subsets of species. We identified 29 novel conserved RNA secondary structures within the protein-coding region, with thus far unresolved functional importance. A miR-17 binding site, previously described in species A, B, and C, was detected in ten additional species but absent in species K, S, Q, and R. We identified a putative long-distance RNA interaction between the IRES and the 3′ end of the genome. Together, these findings and the comprehensive MSA of all 19 pestivirus species provide a valuable resource for future research and diagnostic applications.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Ontiveros-Palacios, Nancy; Cooke, Emma; Nawrocki, Eric P.; Triebel, Sandra; Marz, Manja; Rivas, Elena; Griffiths-Jones, Sam; Petrov, Anton I.; Bateman, Alex; Sweeney, Blake

Rfam 15: RNA families database in 2025 Journal Article

In: Nucleic Acids Research, 2025.

Abstract | Links | BibTeX

Triebel, Sandra; Lamkiewicz, Kevin; Ontiveros, Nancy; Sweeney, Blake; Stadler, Peter F.; Petrov, Anton I.; Niepmann, Michael; Marz, Manja

Comprehensive survey of conserved RNA secondary structures in full-genome alignment of Hepatitis C virus Journal Article

In: Scientific Reports, vol. 14, iss. 1, 2024.

Abstract | Links | BibTeX

@article{nokey_62,

title = {Comprehensive survey of conserved RNA secondary structures in full-genome alignment of Hepatitis C virus},

author = {Sandra Triebel and Kevin Lamkiewicz and Nancy Ontiveros and Blake Sweeney and Peter F. Stadler and Anton I. Petrov and Michael Niepmann and Manja Marz},

doi = {10.1038/s41598-024-62897-0},

year  = {2024},

date = {2024-07-02},

urldate = {2024-07-02},

journal = {Scientific Reports},

volume = {14},

issue = {1},

abstract = {Hepatitis C virus (HCV) is a plus-stranded RNA virus that often chronically infects liver hepatocytes and causes liver cirrhosis and cancer. These viruses replicate their genomes employing error-prone replicases. Thereby, they routinely generate a large ‘cloud’ of RNA genomes (quasispecies) which—by trial and error—comprehensively explore the sequence space available for functional RNA genomes that maintain the ability for efficient replication and immune escape. In this context, it is important to identify which RNA secondary structures in the sequence space of the HCV genome are conserved, likely due to functional requirements. Here, we provide the first genome-wide multiple sequence alignment (MSA) with the prediction of RNA secondary structures throughout all representative full-length HCV genomes. We selected 57 representative genomes by clustering all complete HCV genomes from the BV-BRC database based on k-mer distributions and dimension reduction and adding RefSeq sequences. We include annotations of previously recognized features for easy comparison to other studies. Our results indicate that mainly the core coding region, the C-terminal NS5A region, and the NS5B region contain secondary structure elements that are conserved beyond coding sequence requirements, indicating functionality on the RNA level. In contrast, the genome regions in between contain less highly conserved structures. The results provide a complete description of all conserved RNA secondary structures and make clear that functionally important RNA secondary structures are present in certain HCV genome regions but are largely absent from other regions. Full-genome alignments of all branches of Hepacivirus C are provided in the supplement.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

RNA-Protein interactions

RNA-protein interactions (RPIs) are ubiquitous throughout the kingdoms of life and play essential roles between and within organisms. The goal of this project is to develop computational models and algorithms to better understand the mechanisms of these interactions.

Inspired by compensatory mutations in RNAs, we calculate co-alignments between homologous RNAs and proteins of a potential RPI complex. Combined with structure information of both, RNA and protein, we aim to correlate mutations with potential interaction sites. From there, we will deduce rules and restrictions to predict other RPI complexes. Because viral RPIs are generally less well investigated, they are of particular interest.

Currently, we reviewed and evaluated available RPI prediction tools with selected RPI examples. Since the landscape of computational methods still needs improvement, we hope to contribute to developing an RPI prediction tool that will enable the reliable detection of promising novel RPIs in various organisms.

People involved: Sarah Krautwurst, Kevin Lamkiewicz

Related publications:

Krautwurst, Sarah; Lamkiewicz, Kevin

RNA-protein interaction prediction without high-throughput data: An overview and benchmark of in silico tools Journal Article

In: Computational and Structural Biotechnology Journal, vol. 23, pp. 4036-4046, 2024, ISSN: 2001-0370.

Abstract | Links | BibTeX

@article{nokey_66,

title = {RNA-protein interaction prediction without high-throughput data: An overview and benchmark of \textit{in silico} tools},

author = {Sarah Krautwurst and Kevin Lamkiewicz},

doi = {10.1016/j.csbj.2024.11.015},

issn = {2001-0370},

year  = {2024},

date = {2024-11-08},

journal = {Computational and Structural Biotechnology Journal},

volume = {23},

pages = {4036-4046},

abstract = {RNA-protein interactions (RPIs) are crucial for accurately operating various processes in and between organisms across kingdoms of life. Mutual detection of RPI partner molecules depends on distinct sequential, structural, or thermodynamic features, which can be determined via experimental and bioinformatic methods. Still, the underlying molecular mechanisms of many RPIs are poorly understood. It is further hypothesized that many RPIs are not even described yet. Computational RPI prediction is continuously challenged by the lack of data and detailed research of very specific examples. With the discovery of novel RPI complexes in all kingdoms of life, adaptations of existing RPI prediction methods are necessary. Continuously improving computational RPI prediction is key in advancing the understanding of RPIs in detail and supplementing experimental RPI determination. The growing amount of data covering more species and detailed mechanisms support the accuracy of prediction tools, which in turn support specific experimental research on RPIs. Here, we give an overview of RPI prediction tools that do not use high-throughput data as the user's input. We review the tools according to their input, usability, and output. We then apply the tools to known RPI examples across different kingdoms of life. Our comparison shows that the investigated prediction tools do not favor a certain species and equip the user with results varying in degree of information, from an overall RPI score to detailed interacting residues. Furthermore, we provide a guide tree to assist users which RPI prediction tool is appropriate for their available input data and desired output.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Improvement of ONT basecaller

People involved: Christian Höner zu Siederdissen, Jannes Spangenberg, Sebastian Krautwurst

Detecting RNA modifications with nanopore sequencing

RNA modifications such as the highly abundant N6-methyladenosine (m6A) are known as an important aspect of RNA biology. For example, m6A modification has been shown to be involved in the regulation of mRNA processing, but also RNA virus replication and translation. Second-generation sequencing methods for m6A detection are limited to position-only inference on known reference sequences. Nanopore direct RNA sequencing enables the assessment of modification status of individual reads at single-nucleotide resolution, but current detection models are still limited to position-only inference. We aim to use deep neural networks for de-novo modification detection on nanopore data that achieves high accuracy at single read, single-nucleotide resolution.

People involved: Christian Höner zu Siederdissen, Sebastian Krautwurst, Jannes Spangenberg

Related publications:

Unlocking the full potential of nanopore sequencing: tips, tricks, and advanced data analysis techniques Journal Article

In: Nucleic Acids Research, vol. 54, iss. 3, no. gkag023, 2026.

Abstract | Links | BibTeX

Meyer, Daria; Barth, Emanuel; Wiehle, Laura; Marz, Manja

diffMONT: predicting methylation-specific PCR biomarkers based on nanopore sequencing data for clinical application Journal Article

In: Bioinformatics, vol. 42, iss. 2, no. btag039, 2026.

Abstract | Links | BibTeX

@article{nokey_102,

title = {diffMONT: predicting methylation-specific PCR biomarkers based on nanopore sequencing data for clinical application},

author = {Daria Meyer and Emanuel Barth and Laura Wiehle and Manja Marz},

doi = {10.1093/bioinformatics/btag039},

year  = {2026},

date = {2026-02-01},

urldate = {2026-02-01},

journal = {Bioinformatics},

volume = {42},

number = {btag039},

issue = {2},

abstract = {Motivation

DNA methylation serves as a key biomarker in clinical diagnostics, especially in cancer detection. With methylation-specific PCR (MSP), a widely used approach, patient samples can be screened fast and efficiently for differential methylation. During MSP, methylated regions are selectively amplified with specific primers. With nanopore sequencing, knowledge about DNA methylation is generated during direct DNA sequencing without needing pretreatment of the DNA. Multiple methods, mainly developed for whole-genome bisulfite sequencing (WGBS) data, exist to predict differentially methylated regions (DMRs) in the genome. However, the predicted DMRs are often very large and not sufficiently discriminating to generate meaningful results in MSP, creating a gap between theoretical cancer marker research and practical application, as no tool currently provides methylation difference predictions tailored for PCR-based diagnostics.



Results

Here, we present diffMONT, a tool that predicts differentially methylated regions specifically suited for MSP primer design, enabling rapid translation into practical applications. diffMONT takes into account (i) the specific length of primer and amplicon regions, (ii) the fact that one condition should be unmethylated, and (iii) a minimal required amount of differentially methylated cytosines within the primer regions. We compared the results of diffMONT to metilene and DSS based on a publicly available nanopore sequencing dataset and show that the regions predicted by diffMONT are more specific toward hypermethylated regions. diffMONT accelerates the design of methylation-specific diagnostic assays, bridging the gap between theoretical research and clinical application.



Availability and implementation

The source code for diffMONT, an open-source Python-based tool, is available at https://github.com/rnajena/diffMONT/, with an archived release under https://zenodo.org/records/17641031.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Spangenberg, Jannes; zu Siederdissen, Christian Höner; Goettsch, Winfried; Köhler, Lennart; Luke, Liz Maria; Papenfort, Kai; Marz, Manja

Dynamont: A comprehensive cross-species comparison of ONT segmentation tools Journal Article

In: GigaScience, vol. 15, no. giag005, 2026.

Abstract | Links | BibTeX

@article{nokey_103,

title = {Dynamont: A comprehensive cross-species comparison of ONT segmentation tools},

author = {Jannes Spangenberg and Christian Höner zu Siederdissen and Winfried Goettsch and Lennart Köhler and Liz Maria Luke and Kai Papenfort and Manja Marz},

doi = {10.1093/gigascience/giag005},

year  = {2026},

date = {2026-01-19},

journal = {GigaScience},

volume = {15},

number = {giag005},

abstract = {Background

Oxford Nanopore Technologies (ONT) sequencing enables direct, long-read sequencing of DNA and RNA, preserving nucleotide modifications. During basecalling, deep neural networks translate raw nanopore signals into nucleotide sequences, internally segmenting the signal to align it with the corresponding bases. This is a challenging task due to uneven motor protein rotation, signal variability, low-quality reads, and the presence of nucleotide modifications. However, the signal to nucleotide assignment is critical for novel downstream signal analysis. Existing tools, such as Tombo Resquiggle, f5c Eventalign, f5c Resquiggle, and Uncalled4, operate after basecalling and rely on event-based segmentation and mapping approaches that often fail to align low-quality or modified reads and lack confidence estimates for segmentation accuracy.



Results

Here, we present a large-scale comparative study in which 5 segmentation tools, including our novel tool Dynamont, are applied to 16 ONT-sequenced datasets spanning different kingdoms of life. Overall, we segmented 160,000 reads and evaluated the tools’ performance on a combination of 12 signal and downstream assembly metrics. Our study is accompanied by a comprehensive and extensible supplement that summarizes all datasets, execution instructions, and evaluation results. We score the segmentation results using an aggregated metric score, created from all our analyzed metrics.



Conclusions

No tool delivered the best results for all datasets. We recommend a careful choice and normalization of evaluation metrics to select the best segmentation tool as a critical step in the process of ONT signal segmentation. Across nearly all RNA datasets, Dynamont outperforms other segmentation tools in terms of aggregated metric scores. For DNA datasets, however, the performance is more variable, with mixed results observed across tools.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Meyer, Daria; Hennig, Anne; Hums, Anna-Bawany; Guntinas-Lichius, Orlando; Schmitz, Martina; Marz, Manja

Nanopore sequencing-derived methylation biomarker prediction for methylation-specific PCR in patients with head and neck squamous cell carcinoma Journal Article

In: Clinical Epigenetics, vol. 17, 2025.

Abstract | Links | BibTeX

The RMaP challenge of predicting RNA modifications by nanopore sequencing Journal Article

In: Communications Chemistry, vol. 8, iss. 1, 2024.

Abstract | Links | BibTeX

@article{nokey_79,

title = {The RMaP challenge of predicting RNA modifications by nanopore sequencing},

author = {Jannes Spangenberg and Stefan Mündnich and Anne Busch and Stefan Pastore and Anna Wierczeiko and Winfried Goettsch and Vincent Dietrich and Leszek P. Pryszcz and Sonia Cruciani and Eva Maria Novoa and Kandarp Joshi and Ranjan Perera and Salvatore Di Giorgio and Paola Arrubarrena and Irem Tellioglu and Chi-Lam Poon and Yuk Kei Wan and Jonathan Göke and Andreas Hildebrandt and Christoph Dieterich and Mark Helm and Manja Marz and Susanne Gerber and Nicolo Alagna},

doi = {10.1038/s42004-025-01507-0},

year  = {2024},

date = {2024-12-04},

urldate = {2024-12-04},

journal = {Communications Chemistry},

volume = {8},

issue = {1},

abstract = {The field of epitranscriptomics is undergoing a technology-driven revolution. During past decades, RNA modifications like N6-methyladenosine (m6A), pseudouridine (ψ), and 5-methylcytosine (m5C) became acknowledged for playing critical roles in cellular processes. Direct RNA sequencing by Oxford Nanopore Technologies (ONT) enabled the detection of modifications in native RNA, by detecting noncanonical RNA nucleosides properties in raw data. Consequently, the field’s cutting edge has a heavy component in computer science, opening new avenues of cooperation across the community, as exchanging data is as impactful as exchanging samples. Therefore, we seize the occasion to bring scientists together within the RNA Modification and Processing (RMaP) challenge to advance solutions for RNA modification detection and discuss ideas, problems and approaches. We show several computational methods to detect the most researched mRNA modifications (m6A, ψ, and m5C). Results demonstrate that a low prediction error and a high prediction accuracy can be achieved on these modifications across different approaches and algorithms. The RMaP challenge marks a substantial step towards improving algorithms’ comparability, reliability, and consistency in RNA modification prediction. It points out the deficits in this young field that need to be addressed in further challenges.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

zu Siederdissen, Christian Höner; Spangenberg, Jannes; Bisdorf, Kevin; Krautwurst, Sebastian; Srivastava, Akash; Marz, Manja; Taubert, Martin

Nanopore sequencing enables novel detection of deuterium incorporation in DNA Journal Article

In: Computational and Structural Biotechnology Journal, vol. 23, 2024.

Abstract | Links | BibTeX

@article{nokey_74,

title = {Nanopore sequencing enables novel detection of deuterium incorporation in DNA},

author = {Christian {Höner zu Siederdissen} and Jannes Spangenberg and Kevin Bisdorf and Sebastian Krautwurst and Akash Srivastava and Manja Marz and Martin Taubert},

doi = {10.1016/j.csbj.2024.09.027},

year  = {2024},

date = {2024-10-03},

urldate = {2024-10-03},

journal = {Computational and Structural Biotechnology Journal},

volume = {23},

abstract = {Identifying active microbes is crucial to understand their role in ecosystem functions. Metabolic labeling with heavy, non-radioactive isotopes, i.e., stable isotope probing (SIP), can track active microbes by detecting heavy isotope incorporation in biomolecules such as DNA. However, the detection of heavy isotope-labeled nucleotides directly during sequencing has, to date, not been achieved. In this study, Oxford nanopore sequencing was utilized to detect heavy isotopes incorporation in DNA molecules. Two isotopes widely used in SIP experiments were employed to label a bacterial isolate: deuterium (D, as D2O) and carbon-13 (13C, as glucose). We hypothesize that labeled DNA is distinguishable from unlabeled DNA by changes in the nanopore signal. To verify this distinction, we employed a Bayesian classifier trained on signal distributions of short oligonucleotides (k-mers) from labeled and unlabeled sequencing reads. Our results show a clear distinction between D-labeled and unlabeled reads, based on changes in median and median absolute deviation (MAD) of the nanopore signals for different k-mers. In contrast, 13C-labeled DNA cannot be distinguished from unlabeled DNA. For D, the model employed correctly predicted more than 85% of the reads. Even when metabolic labeling was conducted with only 30% D2O, 80% of the obtained reads were correctly classified with a 5% false discovery rate. Our work demonstrates the feasibility of direct detection of deuterium incorporation in DNA molecules during Oxford nanopore sequencing. This finding represents a first step in establishing the combined use of nanopore sequencing and SIP for tracking active organisms in microbial ecology.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Magnipore: Prediction of differential single nucleotide changes in the Oxford Nanopore Technologies sequencing signal of SARS-CoV-2 samples Journal Article

In: bioRxiv, 2023.

Abstract | Links | BibTeX

Krautwurst, Sebastian; Dijkman, Ronald; Thiel, Volker; Krumbholz, Andi; Marz, Manja

Direct RNA Sequencing for Complete Viral Genomes Book Section

In: Frishman, Dmitrij; Marz, Manja (Ed.): Virus Bioinformatics, CRC Press, 2021.

Abstract | Links | BibTeX

@incollection{Krautwurst:21,

title = {Direct RNA Sequencing for Complete Viral Genomes},

author = {Sebastian Krautwurst and Ronald Dijkman and Volker Thiel and Andi Krumbholz and Manja Marz},

editor = {Dmitrij Frishman and Manja Marz},

url = {https://www.taylorfrancis.com/chapters/edit/10.1201/9781003097679-3/direct-rna-sequencing-complete-viral-genomes-sebastian-krautwurst-ronald-dijkman-volker-thiel-andi-krumbholz-manja-marz},

year  = {2021},

date = {2021-01-01},

urldate = {2021-01-01},

booktitle = {Virus Bioinformatics},

publisher = {CRC Press},

abstract = {Determination of nucleotide sequences present in biological samples (termed “sequencing”) has become a key method in almost all fields of bioscience, including virology. Since the advent of high-throughput sequencing (“second-generation sequencing”), it is possible to sequence millions of DNA fragments (“reads”) in parallel at very high accuracy, enabling the inference of single nucleotide polymorphisms (SNPs) between virus strains.



In this chapter, we provide details on how the long-read sequencing technologies (“third-generation sequencing”) which were developed in recent years have expanded the toolkit for researchers beyond the possibilities of short-read sequencing, with a focus on virus sequencing. With increased read lengths, it is possible to sequence full viral transcripts and genomes in single contiguous reads, enabling detailed studies of transcript isoforms, haplotypes, and viral quasispecies. In comparison, long-read technologies have generally higher raw read error rates, but an accurate assembly of transcripts and genomes is facilitated or made unnecessary due to the long contiguous sequences. One of the technologies, namely nanopore sequencing, also uniquely allows for direct RNA sequencing without the need for the creation or amplification of complementary DNA. This enables accurate capture of RNA content in a sample “as is,” e.g., in cells infected by RNA viruses. The protocol also leaves RNA modifications intact, which can be inferred during sequencing. Nanopore sequencing can be implemented at low costs and with constant genome coverage using cDNA amplicon sequencing methods, e.g., for highly parallel screening during virus outbreaks.},

keywords = {},

pubstate = {published},

tppubtype = {incollection}

}

Viehweger, Adrian; Krautwurst, Sebastian; Lamkiewicz, Kevin; Madhugiri, Ramakanth; Ziebuhr, John; Hölzer, Martin; Marz, Manja

Direct RNA nanopore sequencing of full-length coronavirus genomes provides novel insights into structural variants and enables modification analysis. Journal Article

In: Genome Res, vol. 29, pp. 1545-1554, 2019.

Abstract | Links | BibTeX

@article{Viehweger:19a,

title = {Direct RNA nanopore sequencing of full-length coronavirus genomes provides novel insights into structural variants and enables modification analysis.},

author = {Adrian Viehweger and Sebastian Krautwurst and Kevin Lamkiewicz and Ramakanth Madhugiri and John Ziebuhr and Martin Hölzer and Manja Marz},

doi = {10.1101/gr.247064.118},

year  = {2019},

date = {2019-08-22},

urldate = {2019-08-22},

journal = {Genome Res},

volume = {29},

pages = {1545-1554},

publisher = {Cold Spring Harbor Laboratory},

abstract = {Sequence analyses of RNA virus genomes remain challenging owing to the exceptional genetic plasticity of these viruses. Because of high mutation and recombination rates, genome replication by viral RNA-dependent RNA polymerases leads to populations of closely related viruses, so-called “quasispecies.” Standard (short-read) sequencing technologies are ill-suited to reconstruct large numbers of full-length haplotypes of (1) RNA virus genomes and (2) subgenome-length (sg) RNAs composed of noncontiguous genome regions. Here, we used a full-length, direct RNA sequencing (DRS) approach based on nanopores to characterize viral RNAs produced in cells infected with a human coronavirus. By using DRS, we were able to map the longest (∼26-kb) contiguous read to the viral reference genome. By combining Illumina and Oxford Nanopore sequencing, we reconstructed a highly accurate consensus sequence of the human coronavirus (HCoV)-229E genome (27.3 kb). Furthermore, by using long reads that did not require an assembly step, we were able to identify, in infected cells, diverse and novel HCoV-229E sg RNAs that remain to be characterized. Also, the DRS approach, which circumvents reverse transcription and amplification of RNA, allowed us to detect methylation sites in viral RNAs. Our work paves the way for haplotype-based analyses of viral quasispecies by showing the feasibility of intra-sample haplotype separation. Even though several technical challenges remain to be addressed to exploit the potential of the nanopore technology fully, our work illustrates that DRS may significantly advance genomic studies of complex virus populations, including predictions on long-range interactions in individual full-length viral RNA haplotypes.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Lechner, Marcus; Marz, Manja; Ihling, Christian; Sinz, Andrea; Stadler, Peter F; Krauss, Veiko

The correlation of genome size and DNA methylation rate in metazoans Journal Article

In: Theory Biosci, vol. 132, pp. 47–60, 2012.

Abstract | Links | BibTeX

@article{Lechner:13,

title = {The correlation of genome size and DNA methylation rate in metazoans},

author = {Marcus Lechner and Manja Marz and Christian Ihling and Andrea Sinz and Peter F Stadler and Veiko Krauss},

doi = {10.1007/s12064-012-0167-y},

year  = {2012},

date = {2012-11-07},

urldate = {2012-11-07},

journal = {Theory Biosci},

volume = {132},

pages = {47--60},

abstract = {Total DNA methylation rates are well known to vary widely between different metazoans. The phylogenetic distribution of this variation, however, has not been investigated systematically. We combine here publicly available data on methylcytosine content with the analysis of nucleotide compositions of genomes and transcriptomes of 78 metazoan species to trace the evolution of abundance and distribution of DNA methylation. The depletion of CpG and the associated enrichment of TpG and CpA dinucleotides are used to infer the intensity and localization of germline CpG methylation and to estimate its evolutionary dynamics. We observe a positive correlation of the relative methylation of CpG motifs with genome size. We tested this trend successfully by measuring total DNA methylation with LC/MS in orthopteran insects with very different genome sizes: house crickets, migratory locusts and meadow grasshoppers. We hypothesize that the observed correlation between methylation rate and genome size is due to a dependence of both variables from long-term effective population size and is driven by the accumulation of repetitive sequences that are typically methylated during periods of small population sizes. This process may result in generally methylated, large genomes such as those of jawed vertebrates. In this case, the emergence of a novel demethylation pathway and of novel reader proteins for methylcytosine may have enabled the usage of cytosine methylation for promoter-based gene regulation. On the other hand, persistently large populations may lead to a compression of the genome and to the loss of the DNA methylation machinery, as observed, e.g., in nematodes.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Identifying DNA methylation biomarkers using nanopore sequencing

Carcinogenesis is associated with DNA methylation changes. Especially the methylation of 5-methylcytosine (5mC) in the context of regions with numerous 5’-cytosine-phosphate-guanine-3’ (CpG) occurrences, so-called CpG islands, plays a role here. These DNA methylation changes occur already at an early stage of cancer. Additionally, it is relatively simple to develop molecular biological tests once the regions of differential methylation are known. Therefore it is convincing to use DNA methylations for cancer screening. Nanopore sequencing makes it possible to identify DNA base modifications (e.g., 5mC) at nucleotide resolution. We aim to develop a workflow to identify DNA methylation cancer biomarkers for cancer subtypes that are not well studied yet.

People involved: Daria Meyer, Milena Žarković

Collaborations: oncgnostics GmbH

Cell-free RNA sequencing

Cell-free RNA is present in the blood of every human being as a result of vesicular secretion from the cells of the human body. Sequencing cell-free RNA has been very promising for the diagnosis of several diseases ranging from cancer to cardiovascular diseases. In contrast to protein-based biomarkers, a huge advantage of RNA is that it can be amplified. In recent years technological developments enabled the amplification and sequencing of tiny amounts of RNA even from single cells. Another advantage of RNA over DNA (which could also be amplified) is that it is continuously shed from cells. In contrast, DNA exits cells only when the cell is dying. Hence, cell-free RNA holds the promise of continuous probing of transcriptional changes in the cells of the human body. We aim to apply methods from the field of single-cell RNA sequencing to sequence cell-free RNA in different medically relevant contexts. This approach enables the sequencing of minute amounts of RNA and our expertise in sequencing very small amounts of RNA from cells or even phase-separated condensates will facilitate the detection of even the smallest amounts of RNA from the blood or other body fluids.

People involved: Damian Wollny, Julia Micheel, Aram Safrastyan, Franziska Aron

Antibiotic resistance in the Ganges river valley

Antibiotics increasingly fail to treat a growing number of medical conditions due to antimicrobial resistance. This trend is especially acute in developing countries such as India, where broad resistances are known to have emerged. It is known that densely populated cities can drive the emergence and spread of antimicrobial resistance through for example industrial production sites, wastewater management practices, and other cultural characteristics. Proximity to waterways or associated water collections seems especially relevant.
To identify controllable drivers of resistance emergence and spread we investigate two cities on the river Ganges in India – Allahabad and Kanpur. We also investigate the effect of human interference by analyzing samples before and after Kumbh Mela, which is by far the largest religious gathering in Prayagraj. This will allow us to discern naturally occurring resistance from resistance created by humans.

People involved: Akash Srivastava, Sebastian Krautwurst

Collaborations: Ziauddin Ahammad Shaikh, Adrian Viehweger

Grants: BMBF – DBT Cooperative Science Program: Development of metagenomics assisted surveillance tools for tracking antibiotic resistance in river bodies — A study in the Ganges river valley (NANOLOG)

Epigenetic profiling of aging mouse brain at base resolution

Recent studies have proven that epigenetics, especially 5-methylcytosine (5mC), plays a pivotal role in aging. Along these lines, previous studies have reported diverse epigenetic profiles among different cell types like neurons and oligodendrocytes of the same individual. Besides methylation, DNA undergoes various other types of epigenetic modification. It remains to be investigated if these modifications changes upon aging and can thus also serve as an alternative reliable molecular marker of the epigenetic age of an individual. Thus, it is essential to identify variations in other epigenetic modifications of DNA in specific cell types from the same individual. We plan to study various modifications in a single chain reaction using long-read sequencing on the MinIon platform from ONT.

People involved: Akash Srivastava

The role of non-coding RNAs in human placental development

Inside the placenta, the fetal syncytiotrophoblast forms the interface between fetus and mother, from which exosomes and microvesicles are permanently released into the maternal circulation. These particles contain fetal proteins and ncRNAs for communication with neighboring and distant maternal cells. The number, size, and content of these particles may reflect or predict placental disorders. Several severe pregnancy pathologies, including preeclampsia, are human-specific and their pathomechanisms are not yet understood.
To date, most examples of ncRNAs that have been identified to be specific for fetal tissues, such as the placenta, are members of the group of microRNAs (miRNAs). Long ncRNAs have only been marginally investigated. We need to expand the knowledge about ncRNAs in the placenta and ncRNAs released from it to revolutionize the understanding of regulation processes inside the placenta and of fetal-maternal communication.

People involved: Sebastian Krautwurst, Milena Žarković, Franziska Hufsky

Collaborations: Udo Markert, Diana Maria Morales Prieto

Grants: DFG MA 5082/9-1: Embryonale nicht-kodierende RNAs in der menschlichen Plazenta und dem mütterlichen Blutkreislauf

Related publications:

Hölzer, Martin; Reuschel, Charlotte; Vorimore, Fabien; Laroucau, Karine; Sachse, Konrad

Exploring the genomic landscape of Chlamydiifrater species: novel features include multiple truncated major outer membrane proteins, unique genes and chlamydial plasticity zone orthologs Journal Article

In: Access Microbiology, vol. 7, 2025.

Abstract | Links | BibTeX

@article{nokey_80,

title = {Exploring the genomic landscape of Chlamydiifrater species: novel features include multiple truncated major outer membrane proteins, unique genes and chlamydial plasticity zone orthologs},

author = {Martin Hölzer and Charlotte Reuschel and Fabien Vorimore and Karine Laroucau and Konrad Sachse},

doi = {10.1099/acmi.0.000936.v3},

year  = {2025},

date = {2025-02-03},

urldate = {2025-02-03},

journal = {Access Microbiology},

volume = {7},

abstract = {Recently discovered obligate intracellular bacteria belonging to the genus Chlamydiifrater with the species of Chlamydiifrater phoenicopteri and Chlamydiifrater volucris were studied to explore the composition of their genomes and their relatedness to Chlamydia, the other genus of the family Chlamydiaceae. We investigated 4 isolates of Cf. volucris, 2 of them newly sequenced, and one of Cf. phoenicopteri alongside 12 representatives of the Chlamydia species. Our study uncovers previously unrecognized genomic structures within Chlamydiifrater using a hybrid sequencing approach and advanced annotation pipelines, providing insights into species-specific adaptations and evolutionary dynamics. The integration of long-read sequencing data, comprehensive re-annotation strategies and pan-genomics enabled the localization of the unique plasticity zone and the identification of novel gene clusters in Chlamydiifrater strains, which improves our understanding of chlamydial genome architecture and plasticity in the family Chlamydiaceae. Our analysis revealed that 761 CDS (~80%) are shared among members of both genera. We further identified 158 unique genes of Chlamydiifrater species, but their annotation remains challenging because of the absence of functionally annotated orthologs in public databases. A full-length ompA gene encoding the major outer membrane porin was seen in all Chlamydiifrater strains. We also describe the localization and structure of multiple truncated CDS of ompA family members, representing one of this study’s most interesting findings. While genome analysis of Chlamydiifrater spp. confirmed numerous common features shared with representatives of the genus Chlamydia, many unique genomic elements were identified that underpin the distinct phenotype and separate genetic position of these new microorganisms.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Murrieta-Coxca, José M; Barth, Emanuel; Fuentes-Zacarias, Paulina; Gutiérrez-Samudio, Ruby N; Groten, Tanja; Gellhaus, Alexandra; Köninger, Angela; Marz, Manja; Markert, Udo R; Morales-Prieto, Diana M

Identification of altered miRNAs and their targets in placenta accreta Journal Article

In: Front Endocrinol, vol. 14, pp. 1021640, 2023.

Abstract | Links | BibTeX

@article{nokey,

title = {Identification of altered miRNAs and their targets in placenta accreta},

author = {José M Murrieta-Coxca and Emanuel Barth and Paulina Fuentes-Zacarias and Ruby N Gutiérrez-Samudio and Tanja Groten and Alexandra Gellhaus and Angela Köninger and Manja Marz and Udo R Markert and Diana M Morales-Prieto

},

doi = {10.3389/fendo.2023.1021640},

year  = {2023},

date = {2023-03-03},

journal = {Front Endocrinol},

volume = {14},

pages = {1021640},

abstract = {Placenta accreta spectrum (PAS) is one of the major causes of maternal morbidity and mortality worldwide with increasing incidence. PAS refers to a group of pathological conditions ranging from the abnormal attachment of the placenta to the uterus wall to its perforation and, in extreme cases, invasion into surrounding organs. Among them, placenta accreta is characterized by a direct adhesion of the villi to the myometrium without invasion and remains the most common diagnosis of PAS. Here, we identify the potential regulatory miRNA and target networks contributing to placenta accreta development. Using small RNA-Seq followed by RT-PCR confirmation, altered miRNA expression, including that of members of placenta-specific miRNA clusters (e.g., C19MC and C14MC), was identified in placenta accreta samples compared to normal placental tissues. In situ hybridization (ISH) revealed expression of altered miRNAs mostly in trophoblast but also in endothelial cells and this profile was similar among all evaluated degrees of PAS. Kyoto encyclopedia of genes and genomes (KEGG) analyses showed enriched pathways dysregulated in PAS associated with cell cycle regulation, inflammation, and invasion. mRNAs of genes associated with cell cycle and inflammation were downregulated in PAS. At the protein level, NF-κB was upregulated while PTEN was downregulated in placenta accreta tissue. The identified miRNAs and their targets are associated with signaling pathways relevant to controlling trophoblast function. Therefore, this study provides miRNA:mRNA associations that could be useful for understanding PAS onset and progression.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Žarković, Milena; Hufsky, Franziska; Markert, Udo R; Marz, Manja

The Role of Non-Coding RNAs in the Human Placenta Journal Article

In: Cells, vol. 11, iss. 9, pp. 1588, 2022.

Abstract | Links | BibTeX

Morales-Prieto, Diana M.; Barth, Emanuel; Murrieta-Coxca, Jose Martín; Favaro, Rodolfo R.; Gutiérrez-Samudio, Ruby N.; Chaiwangyen, Wittaya; Ospina-Prieto, Stephanie; Gruhn, Bernd; Schleußner, Ekkehard; Marz, Manja; Markert, Udo R.

Identification of miRNAs and associated pathways regulated by Leukemia Inhibitory Factor in trophoblastic cell lines. Journal Article

In: Placenta, vol. 88, pp. 20–27, 2019.

Abstract | Links | BibTeX

@article{Morales-Prieto:19,

title = {Identification of miRNAs and associated pathways regulated by Leukemia Inhibitory Factor in trophoblastic cell lines.},

author = {Diana M. Morales-Prieto and Emanuel Barth and Jose Martín Murrieta-Coxca and Rodolfo R. Favaro and Ruby N. Gutiérrez-Samudio and Wittaya Chaiwangyen and Stephanie Ospina-Prieto and Bernd Gruhn and Ekkehard Schleußner and Manja Marz and Udo R. Markert},

doi = {10.1016/j.placenta.2019.09.005},

year  = {2019},

date = {2019-09-12},

urldate = {2019-09-12},

journal = {Placenta},

volume = {88},

pages = {20--27},

abstract = {Leukemia Inhibitory Factor (LIF) regulates behavior of trophoblast cells and their interaction with immune and endothelial cells. In vitro, trophoblast cell response to LIF may vary depending on the cell model. Reported differences in the miRNA profile of trophoblastic cells may be responsible for these observations. Therefore, miRNA expression was investigated in four trophoblastic cell lines under LIF stimulation followed by in silico analysis of altered miRNAs and their associated pathways. Low density TaqMan miRNA assays were used to quantify levels of 762 mature miRNAs under LIF stimulation in three choriocarcinoma-derived (JEG-3, ACH-3P and AC1-M59) and a trophoblast immortalized (HTR-8/SVneo) cell lines. Expression of selected miRNAs was confirmed in primary trophoblast cells and cell lines by qPCR. Targets and associated pathways of the differentially expressed miRNAs were inferred from the miRTarBase followed by a KEGG Pathway Enrichment Analysis. HTR-8/SVneo and JEG-3 cells were transfected with miR-21-mimics and expression of miR-21 targets was assessed by qPCR. A similar number of miRNAs changed in each tested cell line upon LIF stimulation, however, low coincidence of individual miRNA species was observed and occurred more often among choriocarcinoma-derived cells (complete data set at http://www.ncbi.nlm.nih.gov/geo/ under GEO accession number GSE130489). Altered miRNAs were categorized into pathways involved in human diseases, cellular processes and signal transduction. Six cascades were identified as significantly enriched, including JAK/STAT and TGFB-SMAD. Upregulation of miR-21-3p was validated in all cell lines and primary cells and STAT3 was confirmed as its target. Dissimilar miRNA responses may be involved in differences of LIF effects on trophoblastic cell lines.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Dukhovny, Anna; Lamkiewicz, Kevin; Chen, Qian; Fricke, Markus; Jabrane-Ferrat, Nabila; Marz, Manja; Jung, Jae U.; Sklan, Ella H.

A CRISPR activation screen identifies genes protecting from Zika virus infection Journal Article

In: J Virol, vol. 93, no. 16, 2019.

Abstract | Links | BibTeX

@article{Dukhovny:19,

title = {A CRISPR activation screen identifies genes protecting from Zika virus infection},

author = {Anna Dukhovny and Kevin Lamkiewicz and Qian Chen and Markus Fricke and Nabila Jabrane-Ferrat and Manja Marz and Jae U. Jung and Ella H. Sklan},

doi = {10.1128/JVI.00211-19},

year  = {2019},

date = {2019-07-30},

urldate = {2019-07-30},

journal = {J Virol},

volume = {93},

number = {16},

publisher = {American Society for Microbiology Journals},

abstract = {Zika virus (ZIKV) is an arthropod borne emerging pathogen causing febrile illness. ZIKV is associated Guillain-Barré syndrome and other neurological complications. Infection during pregnancy is associated with pregnancy complications and developmental and neurological abnormalities collectively defined as congenital Zika syndrome. There is still no vaccine or specific treatment for ZIKV infection. To identify host factors that can rescue cells from ZIKV infection we used a genome scale CRISPR activation screen. Our highly ranking hits included a short list of interferon stimulated genes (ISGs) previously reported to have antiviral activity. Validation of the screen results highlighted IFNL2 and IFI6 as genes providing high levels of protection from ZIKV. Activation of these genes had an effect on an early stage in viral infection. In addition, infected cells expressing sgRNAs for both of these genes displayed lower levels of cell death compared to controls. Furthermore, the identified genes were significantly induced in ZIKV infected placenta explants. Thus, these results highlight a set of ISGs directly relevant for rescuing cells from ZIKV infection or its associated cell death and substantiates CRISPR activation screens as a tool to identify host factors impeding pathogen infection.IMPORTANCE Zika virus (ZIKV) is an emerging vector-borne pathogen causing a febrile disease. ZIKV infection might also trigger Guillain-Barré syndrome, neuropathy and myelitis. Vertical transmission of ZIKV can cause fetus demise, still birth or severe congenital abnormalities and neurological complications. There is no vaccine or specific antiviral treatment against ZIKV. We used a genome wide CRISPR activation screen, where genes are activated from their native promoters to identify host cell factors that protect cells from ZIKV infection or associated cell death. The results provide better understanding of key host factors that protect cells from ZIKV infection and might assist in identifying novel antiviral targets.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Zika virus (ZIKV) is an arthropod borne emerging pathogen causing febrile illness. ZIKV is associated Guillain-Barré syndrome and other neurological complications. Infection during pregnancy is associated with pregnancy complications and developmental and neurological abnormalities collectively defined as congenital Zika syndrome. There is still no vaccine or specific treatment for ZIKV infection. To identify host factors that can rescue cells from ZIKV infection we used a genome scale CRISPR activation screen. Our highly ranking hits included a short list of interferon stimulated genes (ISGs) previously reported to have antiviral activity. Validation of the screen results highlighted IFNL2 and IFI6 as genes providing high levels of protection from ZIKV. Activation of these genes had an effect on an early stage in viral infection. In addition, infected cells expressing sgRNAs for both of these genes displayed lower levels of cell death compared to controls. Furthermore, the identified genes were significantly induced in ZIKV infected placenta explants. Thus, these results highlight a set of ISGs directly relevant for rescuing cells from ZIKV infection or its associated cell death and substantiates CRISPR activation screens as a tool to identify host factors impeding pathogen infection.IMPORTANCE Zika virus (ZIKV) is an emerging vector-borne pathogen causing a febrile disease. ZIKV infection might also trigger Guillain-Barré syndrome, neuropathy and myelitis. Vertical transmission of ZIKV can cause fetus demise, still birth or severe congenital abnormalities and neurological complications. There is no vaccine or specific antiviral treatment against ZIKV. We used a genome wide CRISPR activation screen, where genes are activated from their native promoters to identify host cell factors that protect cells from ZIKV infection or associated cell death. The results provide better understanding of key host factors that protect cells from ZIKV infection and might assist in identifying novel antiviral targets.

Chaiwangyen, Wittaya; Gutiérrez-Samudio, Ruby N.; Markert, Udo R.; Marz, Manja; Morales-Prieto, Diana M.; Ospina-Prieto, Stephanie

KL 5 Trophoblast-immune cell communication via microRNA transported in extracellular vesicles Journal Article

In: Pregnancy Hypertens, vol. 9, pp. 5, 2017.

BibTeX

Statistical modeling of genomic and transcriptomic data

In the last two decades in the biotechnological area, one revolutionary advancement was chased by another, leading the life sciences into the big data era. However, besides the availability of vast amounts of different biological data, we still lack sufficient statistical models and methods to accurately process and evaluate these data. We aim to develop specialized statistical tools in the context of genomics (e.g., fuzzy k-meres) and transcriptomics (e.g., accurate modeling of read count distributions). We work on different aspects of statistical analysis, starting from the theoretical problem formulation, to the implementation of statistical models and the appropriate visualization of results.

People involved: Emanuel Barth

Research » Projects

Identifying DNA methylation biomarkers using nanopore sequencing

Quantifying Liver Perfusion

VirJenDB: the virus database based in Jena

Signal Segmentation in Oxford Nanopore Technologies Data Using HMM Models

Detection of RNA Modifications in Oxford Nanopore Technologies Data Using Neural Networks

Molecular genetic, metagenomic and bioinformatic studies on the endometrium and placenta

The virome of the lung

FRESCO-Phage: Longterm transfer of FRozen, Encapsulated multidonor Stool bacteriophage filtrates for active ulcerative COlitis

Alternative splicing and isoforms

A comprehensive toxin-antitoxin atlas

Bioinformatics support for researchers of the FSU and associated research institutes

Viruses in groundwater

Ecology and species barriers in emerging viral diseases

Limiting viral outbreaks with Machine Learning

Viruses in host genomes

RNA structures and functions in viruses

Deciphering the RNA genome packaging code of influenza A viruses

Virus Database, interface, and quality control

High-quality full-genome alignments of viruses

RNA-Protein interactions

Improvement of ONT basecaller

Detecting RNA modifications with nanopore sequencing

Identifying DNA methylation biomarkers using nanopore sequencing

Cell-free RNA sequencing

Antibiotic resistance in the Ganges river valley

Epigenetic profiling of aging mouse brain at base resolution

The role of non-coding RNAs in human placental development

Statistical modeling of genomic and transcriptomic data

Past Projects