Author: Stefan Kramer

0000000000470986

AUTHOR

Stefan Kramer

Multi-label Classification Using Stacked Hierarchical Dirichlet Processes with Reduced Sampling Complexity

Nonparametric topic models based on hierarchical Dirichlet processes (HDPs) allow for the number of topics to be automatically discovered from the data. The computational complexity of standard Gibbs sampling techniques for model training is linear in the number of topics. Recently, it was reduced to be linear in the number of topics per word using a technique called alias sampling combined with Metropolis Hastings (MH) sampling. We propose a different proposal distribution for the MH step based on the observation that distributions on the upper hierarchy level change slower than the document-specific distributions at the lower level. This reduces the sampling complexity, making it linear i…

0000000000470986

AUTHOR

Stefan Kramer

Multi-label Classification Using Stacked Hierarchical Dirichlet Processes with Reduced Sampling Complexity

Prototype-based learning on concept-drifting data streams

Online Density Estimation of Heterogeneous Data Streams in Higher Dimensions

Towards identifying drug side effects from social media using active learning and crowd sourcing.

Polymeric Nanoparticles: Polymeric Nanoparticles with Neglectable Protein Corona (Small 18/2020)

Secure Sum Outperforms Homomorphic Encryption in (Current) Collaborative Deep Learning

Forest of Normalized Trees: Fast and Accurate Density Estimation of Streaming Data

A label compression method for online multi-label classification

Machine Learning and Knowledge Discovery in Databases. Research Track

Focusing Knowledge-based Graph Argument Mining via Topic Modeling

An inductive learning perspective on automated generation of feature models from given product specifications

Hub-Centered Gene Network Reconstruction Using Automatic Relevance Determination

Forecast of Study Success in the STEM Disciplines Based Solely on Academic Records

Adapted Transfer of Distance Measures for Quantitative Structure-Activity Relationships and Data-Driven Selection of Source Datasets

Online Sparse Collapsed Hybrid Variational-Gibbs Algorithm for Hierarchical Dirichlet Process Topic Models

HPMA-Based Nanoparticles for Fast, Bioorthogonal iEDDA Ligation

Alternating model trees

Ensembles of Randomized Time Series Shapelets Provide Improved Accuracy while Reducing Computational Costs

Polymeric Nanoparticles with Neglectable Protein Corona

Structural clustering of millions of molecular graphs

A structural cluster kernel for learning on graphs

Optimization of curation of the dataset with data on repeated dose toxicity

Identification of ELF3 as an early transcriptional regulator of human urothelium

Online Estimation of Discrete Densities

HPMA-Based Nanocarriers for Effective Immune System Stimulation.

Scalable Clustering by Iterative Partitioning and Point Attractor Representation

Efficient Redundancy Reduced Subgroup Discovery via Quadratic Programming

Convolutional Neural Networks for the Identification of Regions of Interest in PET Scans: A Study of Representation Learning for Diagnosing Alzheimer’s Disease

Exploring Multi-Objective Optimization for Multi-Label Classifier Ensembles

Cinema Data Mining

Targeting cells of the immune system: mannosylated HPMA–LMA block-copolymer micelles for targeting of dendritic cells

Innovative Strategies to Develop Chemical Categories Using a Combination of Structural and Toxicological Properties.

DySC: software for greedy clustering of 16S rRNA reads.

Incremental linear model trees on massive datasets

A Large-Scale Empirical Evaluation of Cross-Validation and External Test Set Validation in (Q)SAR.

Eawag-Soil in enviPath: a new resource for exploring regulatory pesticide soil biodegradation pathways and half-life data.

Extracting information from support vector machines for pattern-based classification

Effect of Core-Crosslinking on Protein Corona Formation on Polymeric Micelles.

Privacy Preserving Client/Vertical-Servers Classification

Modeling recurrent distributions in streams using possible worlds

Modeling Multi-label Recurrence in Data Streams

Scavenger – A Framework for Efficient Evaluation of Dynamic and Modular Algorithms

Gaussian Mixture Models and Model Selection for [18F] Fluorodeoxyglucose Positron Emission Tomography Classification in Alzheimer’s Disease

BMaD – A Boolean Matrix Decomposition Framework

A probabilistic condensed representation of data for stream mining

A Hybrid Machine Learning and Knowledge Based Approach to Limit Combinatorial Explosion in Biodegradation Prediction

Session details: Volume I: Artificial intelligence &amp; agents, distributed systems, and information systems: data mining track

Pairwise Learning to Rank by Neural Networks Revisited: Reconstruction, Theoretical Analysis and Practical Performance

Integrating LSTMs with Online Density Estimation for the Probabilistic Forecast of Energy Consumption

cuBool: Bit-Parallel Boolean Matrix Factorization on CUDA-Enabled Accelerators

enviPath - The environmental contaminant biotransformation pathway resource

Improving structural similarity based virtual screening using background knowledge

Machine learning for a combined electroencephalographic anesthesia index to detect awareness under anesthesia

Rule Extraction From Binary Neural Networks With Convolutional Rules for Model Validation.

Pruning Incremental Linear Model Trees with Approximate Lookahead

A Nonlinear Label Compression and Transformation Method for Multi-label Classification Using Autoencoders

Long-term biodistribution study of HPMA- ran -LMA copolymers in vivo by means of 131 I-labeling

CheS-Mapper - Chemical Space Mapping and Visualization in 3D

Similarity boosted quantitative structure-activity relationship--a systematic study of enhancing structural descriptors by molecular similarity.

Exploring Multiobjective Optimization for Multiview Clustering

A Survey of Multi-Label Topic Models

Multi-label classification using boolean matrix decomposition

A Brief History of Learning Symbolic Higher-Level Representations from Data (And a Curious Look Forward)

Online Induction of Probabilistic Real Time Automata

Trading off accuracy for efficiency by randomized greedy warping

Model selection based product kernel learning for regression on graphs

Deep neural networks to recover unknown physical parameters from oscillating time series.

CheS-Mapper 2.0 for visual validation of (Q)SAR models

Towards Bankruptcy Prediction: Deep Sentiment Mining to Detect Financial Distress from Business Management Reports

An In-Depth Experimental Comparison of RNTNs and CNNs for Sentence Modeling

Cinema audiences reproducibly vary the chemical composition of air during films, by broadcasting scene specific emissions on breath

Fair Pairwise Learning to Rank

Graph Clustering with Local Density-Cut

Maximum Common Subgraph based locally weighted regression

Filtered circular fingerprints improve either prediction or runtime performance while retaining interpretability

Session details: Volume I: Artificial intelligence & agents, distributed systems, and information systems: data mining track