Author: Stefan Kramer

0000000000136109

AUTHOR

Stefan Kramer

showing 75 related works from this author

Multi-label Classification Using Stacked Hierarchical Dirichlet Processes with Reduced Sampling Complexity

2018

Nonparametric topic models based on hierarchical Dirichlet processes (HDPs) allow for the number of topics to be automatically discovered from the data. The computational complexity of standard Gibbs sampling techniques for model training is linear in the number of topics. Recently, it was reduced to be linear in the number of topics per word using a technique called alias sampling combined with Metropolis Hastings (MH) sampling. We propose a different proposal distribution for the MH step based on the observation that distributions on the upper hierarchy level change slower than the document-specific distributions at the lower level. This reduces the sampling complexity, making it linear i…

Topic modelComputational complexity theoryComputer science02 engineering and technologyLatent Dirichlet allocationDirichlet distributionsymbols.namesakeArtificial Intelligence020204 information systems0202 electrical engineering electronic engineering information engineeringMathematicsMulti-label classificationbusiness.industrySampling (statistics)Pattern recognitionHuman-Computer InteractionDirichlet processMetropolis–Hastings algorithmHardware and ArchitectureTest setsymbols020201 artificial intelligence & image processingArtificial intelligencebusinessAlgorithmSoftwareInformation SystemsGibbs sampling2017 IEEE International Conference on Big Knowledge (ICBK)

0000000000136109

AUTHOR

Stefan Kramer

Multi-label Classification Using Stacked Hierarchical Dirichlet Processes with Reduced Sampling Complexity

2018

Prototype-based learning on concept-drifting data streams

2014

Online Density Estimation of Heterogeneous Data Streams in Higher Dimensions

2016

Towards identifying drug side effects from social media using active learning and crowd sourcing.

2019

Polymeric Nanoparticles: Polymeric Nanoparticles with Neglectable Protein Corona (Small 18/2020)

2020

Forest of Normalized Trees: Fast and Accurate Density Estimation of Streaming Data

2018

A label compression method for online multi-label classification

2018

Machine Learning and Knowledge Discovery in Databases. Research Track

2021

An inductive learning perspective on automated generation of feature models from given product specifications

2018

Hub-Centered Gene Network Reconstruction Using Automatic Relevance Determination

2012

Forecast of Study Success in the STEM Disciplines Based Solely on Academic Records

2020

Adapted Transfer of Distance Measures for Quantitative Structure-Activity Relationships and Data-Driven Selection of Source Datasets

2012

Online Sparse Collapsed Hybrid Variational-Gibbs Algorithm for Hierarchical Dirichlet Process Topic Models

2017

HPMA-Based Nanoparticles for Fast, Bioorthogonal iEDDA Ligation

2019

Alternating model trees

2015

Polymeric Nanoparticles with Neglectable Protein Corona

2020

Structural clustering of millions of molecular graphs

2014

A structural cluster kernel for learning on graphs

2012

Optimization of curation of the dataset with data on repeated dose toxicity

2015

Identification of ELF3 as an early transcriptional regulator of human urothelium

2014

Online Estimation of Discrete Densities

2013

HPMA-Based Nanocarriers for Effective Immune System Stimulation.

2019

Scalable Clustering by Iterative Partitioning and Point Attractor Representation

2016

Efficient Redundancy Reduced Subgroup Discovery via Quadratic Programming

2012

Convolutional Neural Networks for the Identification of Regions of Interest in PET Scans: A Study of Representation Learning for Diagnosing Alzheimer…

2017

Exploring Multi-Objective Optimization for Multi-Label Classifier Ensembles

2019

Cinema Data Mining

2015

Targeting cells of the immune system: mannosylated HPMA–LMA block-copolymer micelles for targeting of dendritic cells

2016

Innovative Strategies to Develop Chemical Categories Using a Combination of Structural and Toxicological Properties.

2016

DySC: software for greedy clustering of 16S rRNA reads.

2012

Incremental linear model trees on massive datasets

2013

A Large-Scale Empirical Evaluation of Cross-Validation and External Test Set Validation in (Q)SAR.

2013

Eawag-Soil in enviPath: a new resource for exploring regulatory pesticide soil biodegradation pathways and half-life data.

2017

Extracting information from support vector machines for pattern-based classification

2014

Effect of Core-Crosslinking on Protein Corona Formation on Polymeric Micelles.

2021

Privacy Preserving Client/Vertical-Servers Classification

2019

Modeling recurrent distributions in streams using possible worlds

2015

Modeling Multi-label Recurrence in Data Streams

2019

Scavenger – A Framework for Efficient Evaluation of Dynamic and Modular Algorithms

Session details: Volume I: Artificial intelligence & agents, distributed systems, and information systems: data mining track