Author: Bertil Schmidt

0000000000269851

AUTHOR

Bertil Schmidt

Next-generation sequencing: big data meets high performance computing

The progress of next-generation sequencing has a major impact on medical and genomic research. This high-throughput technology can now produce billions of short DNA or RNA fragments in excess of a few terabytes of data in a single run. This leads to massive datasets used by a wide range of applications including personalized cancer treatment and precision medicine. In addition to the hugely increased throughput, the cost of using high-throughput technologies has been dramatically decreasing. A low sequencing cost of around US$1000 per genome has now rendered large population-scale projects feasible. However, to make effective use of the produced data, the design of big data algorithms and t…

0000000000269851

AUTHOR

Bertil Schmidt

Next-generation sequencing: big data meets high performance computing

CUDA-Accelerated Alignment of Subsequences in Streamed Time Series Data

Bit-Parallel Approximate Pattern Matching on the Xeon Phi Coprocessor

Suffix Array Construction on Multi-GPU Systems

kmcEx: memory-frugal and retrieval-efficient encoding of counted k-mers.

Multiple Protein Sequence Alignment with MSAProbs

RabbitMash: accelerating hash-based genome analysis on modern multi-core architectures

Accelerating short read mapping on an FPGA (abstract only)

CUDA-enabled hierarchical ward clustering of protein structures based on the nearest neighbour chain algorithm

Bit-parallel approximate pattern matching: Kepler GPU versus Xeon Phi

Automatische Detektion der primär sklerosierenden Cholangitis (PSC) anhand von 3D-MRCP Datensätzen mittels Deep Learning

AFS: identification and quantification of species composition by metagenomic sequencing

Fourth Workshop on using Emerging Parallel Architectures

All-Food-Seq (AFS) : a quantifiable screen for species in biological samples by deep DNA sequencing

Reconfigurable Accelerator for the Word-Matching Stage of BLASTN

parSRA: A framework for the parallel execution of short read aligners on compute clusters

Accelerating metagenomic read classification on CUDA-enabled GPUs.

Parallel and scalable short-read alignment on multi-core clusters using UPC++

CUSHAW2-GPU: Empowering Faster Gapped Short-Read Alignment Using GPU Computing

Gossip

GEM

Deep Learning für die automatische Bestimmung von klinisch relevanten Herzparametern mittels Kardio-MRT

Efficient Parallel Sort on AVX-512-Based Multi-Core and Many-Core Architectures

Long read alignment based on maximal exact match seeds

Deep Semantic Segmentation von 4D DCE MRT Untersuchungen der Lunge zum Erheben Klinischer Biomarker bei Chronisch Obstruktiver Lungenerkrankung

MetaCache: context-aware classification of metagenomic reads using minhashing.

Parallel algorithms for large-scale biological sequence alignment on Xeon-Phi based clusters

Iterative sparse matrix-vector multiplication for accelerating the block Wiedemann algorithm over GF(2) on multi-graphics processing unit systems

SparseHC: A Memory-efficient Online Hierarchical Clustering Algorithm

Fast dendrogram-based OTU clustering using sequence embedding

High-speed and accurate color-space short-read alignment with CUSHAW2

HECTOR : a parallel multistage homopolymer spectrum based error corrector for 454 sequencing data

Efficient and Accurate OTU Clustering with GPU-Based Sequence Alignment and Dynamic Dendrogram Cutting.

Massively parallel computation of atmospheric neutrino oscillations on CUDA-enabled accelerators

Deep learning in next-generation sequencing

Graphical Workflow System for Modification Calling by Machine Learning of Reverse Transcription Signatures

Additional file 1: Figure S1. of CLOVE: classification of genomic fusions into structural variation events

GPU-accelerated exhaustive search for third-order epistatic interactions in case–control studies

Millimeter-Scale and Billion-Atom Reactive Force Field Simulation on Sunway Taihulight

Parallel and Space-Efficient Construction of Burrows-Wheeler Transform and Suffix Array for Big Genome Data

WarpCore: A Library for fast Hash Tables on GPUs

CUSHAW3: Sensitive and Accurate Base-Space and Color-Space Short-Read Alignment with Hybrid Seeding

XLCS: A New Bit-Parallel Longest Common Subsequence Algorithm on Xeon Phi Clusters

CUDA-enabled Sparse Matrix–Vector Multiplication on GPUs using atomic operations

Scalable Clustering by Iterative Partitioning and Point Attractor Representation

Identification and quantification of meat product ingredients by whole-genome metagenomics (All-Food-Seq)

Automated detection and classification of synoptic-scale fronts from atmospheric data grids

Combining GPU and FPGA technology for efficient exhaustive interaction analysis in GWAS

CorCast: A Distributed Architecture for Bayesian Epidemic Nowcasting and its Application to District-Level SARS-CoV-2 Infection Numbers in Germany

MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems

Unified Parallel C++

Locality-sensitive hashing enables signal classification in high-throughput mass spectrometry raw data at scale

High-speed exhaustive 3-locus interaction epistasis analysis on FPGAs

DySC: software for greedy clustering of 16S rRNA reads.

SWAPHI-LS: Smith-Waterman Algorithm on Xeon Phi coprocessors for Long DNA Sequences

MetaCache-GPU: Ultra-Fast Metagenomic Classification

Massively Parallel ANS Decoding on GPUs

The Sliced COO Format for Sparse Matrix-Vector Multiplication on CUDA-enabled GPUs

FMapper: Scalable read mapper based on succinct hash index on SunWay TaihuLight

SNVSniffer: an integrated caller for germline and somatic single-nucleotide and indel mutations

WarpDrive: Massively Parallel Hashing on Multi-GPU Nodes

Parallelized Clustering of Protein Structures on CUDA-Enabled GPUs

mD3DOCKxb: An Ultra-Scalable CPU-MIC Coordinated Virtual Screening Framework

RabbitQC: high-speed scalable quality control for sequencing data

Ultra-Fast Detection of Higher-Order Epistatic Interactions on GPUs

Parallelized short read assembly of large genomes using de Bruijn graphs

SNVSniffer: An integrated caller for germline and somatic SNVs based on Bayesian models

AnySeq: A High Performance Sequence Alignment Library based on Partial Evaluation

CRiSPy-CUDA: Computing Species Richness in 16S rRNA Pyrosequencing Datasets with CUDA

Neighbor-list-free molecular dynamics on sunway TaihuLight supercomputer

Musket: a multistage k-mer spectrum-based error corrector for Illumina sequence data

CUDA-BLASTP: Accelerating BLASTP on CUDA-enabled graphics hardware

RNACache: Fast Mapping of RNA-Seq Reads to Transcriptomes Using MinHashing

SWMapper: Scalable Read Mapper on SunWay TaihuLight

FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures

Pairwise DNA Sequence Alignment Optimization

BGSA: a bit-parallel global sequence alignment toolkit for multi-core and many-core architectures