Author: André Brinkmann

0000000000073984

AUTHOR

André Brinkmann

AIOC2: A deep Q-learning approach to autonomic I/O congestion control in Lustre

Abstract In high performance computing systems, I/O congestion is a common problem in large-scale distributed file systems. However, the current implementation mainly requires administrator to manually design low-level implementation and optimization, we proposes an adaptive I/O congestion control framework, named AIOC 2 , which can not only adaptively tune the I/O congestion control parameters, but also exploit the deep Q-learning method to start the training parameters and optimize the tuning for different types of workloads from the server and the client at the same time. AIOC 2 combines the feedback-based dynamic I/O congestion control and deep Q-learning parameter tuning technology to …

0000000000073984

AUTHOR

André Brinkmann

AIOC2: A deep Q-learning approach to autonomic I/O congestion control in Lustre

Quantum chemical meta-workflows in MoSGrid

Improving Collective I/O Performance Using Non-volatile Memory Devices

Extending SSD lifetime in database applications with page overwrites

Improving LSM‐trie performance by parallel search

Towards Dynamic Scripted pNFS Layouts

Online Management of Hybrid DRAM-NVMM Memory for HPC

GekkoFS - A Temporary Distributed File System for HPC Applications

File system scalability with highly decentralized metadata on independent storage devices

One Phase Commit: A Low Overhead Atomic Commitment Protocol for Scalable Metadata Services

Building a Medical Research Cloud in the EASI-CLOUDS Project

Improving checkpointing intervals by considering individual job failure probabilities

Constant Time Garbage Collection in SSDs

Building a medical research cloud in the EASI-CLOUDS project

Fusing storage and computing for the domain of business intelligence and analytics: research opportunities

MCD: Overcoming the Data Download Bottleneck in Data Centers

On the Influence of PRNGs on Data Distribution

And Now for Something Completely Different: Running Lisp on GPUs

Streamlining distributed Deep Learning I/O with ad hoc file systems

Deduplication Potential of HPC Applications’ Checkpoints

Accelerating Application Migration in HPC

Distributing Storage in Cloud Environments

FADaC

Pure Functions in C: A Small Keyword for Automatic Parallelization

Using On-Demand File Systems in HPC Environments

Scalable Monitoring System for Clouds

A configurable rule based classful token bucket filter network request scheduler for the lustre file system

Reducing False Node Failure Predictions in HPC

Challenges and Solutions for Tracing Storage Systems

Simurgh

MERCURY: A Transparent Guided I/O Framework for High Performance I/O Stacks

VarySched: A Framework for Variable Scheduling in Heterogeneous Environments

Deriving and comparing deduplication techniques using a model-based classification

Random Slicing: Efficient and Scalable Data Placement for Large-Scale Storage Systems

DelveFS - An Event-Driven Semantic File System for Object Stores

Randomized renaming in shared memory systems.

Balls into non-uniform bins

Improving MLC flash performance and endurance with extended P/E cycles

Effects and Benefits of Node Sharing Strategies in HPC Batch Systems

Hyperion

POSTER: Optimizing scientific file I/O patterns using advice based knowledge

Persistent software transactional memory in Haskell

Evaluation of a hash-compress-encrypt pipeline for storage system applications

A gearbox model for processing large volumes of data by using pipeline systems encapsulated into virtual containers

Direct lookup and hash-based metadata placement for local file systems

Sorted deduplication: How to process thousands of backup streams

Advanced Stochastic Petri Net Modeling with the Mercury Scripting Language

Lone Star Stack: Architecture of a Disk-Based Archival System

NVMM-Oriented Hierarchical Persistent Client Caching for Lustre

ESB: Ext2 Split Block Device

Algorithmic differentiation for cloud schemes (IFS Cy43r3) using CoDiPack (v1.8.1)

LPCC

Challenges and Opportunities of User-Level File Systemsfor HPC

GekkoFS — A Temporary Burst Buffer File System for HPC Applications

Scheduling shared continuous resources on many-cores

Design of an exact data deduplication cluster

LoneStar RAID

Topic 5: Parallel and Distributed Data Management

The MoSGrid Science Gateway – A Complete Solution for Molecular Simulations

Smart grid-aware scheduling in data centres

Migration Techniques in HPC Environments

ADA-FS—Advanced Data Placement via Ad hoc File Systems at Extreme Scales

An Analysis of Flash Page Reuse With WOM Codes

Compiler Driven Automatic Kernel Context Migration for Heterogeneous Computing

Algorithmic Differentiation for Cloud Schemes