6533b834fe1ef96bd129d4a9
RESEARCH PRODUCT
Parallelized Clustering of Protein Structures on CUDA-Enabled GPUs
Hoang-vu DangAnna Katharina HildebrandtAndreas HildebrandtBertil Schmidtsubject
CUDASpeedupComputer scienceNearest-neighbor chain algorithmParallel computingCluster analysisRoot-mean-square deviationPoseWard's methodHierarchical clusteringdescription
Estimation of the pose in which two given molecules might bind together to form a potential complex is a crucial task in structural biology. To solve this so-called "docking problem", most algorithms initially generate large numbers of candidate poses (or decoys) which are then clustered to allow for subsequent computationally expensive evaluations of reasonable representatives. Since the number of such candidates ranges from thousands to millions, performing the clustering on standard CPUs is highly time consuming. In this paper we analyze and evaluate different approaches to parallelize the nearest neighbor chain algorithm to perform hierarchical Ward clustering of protein structures using both atom-based root mean square deviation (RMSD) and rigid-based RMSD molecular distances on a GPU. This leads to a speedup of around one order-of-magnitude of our CUDA implementation on a GeForce Titan GPU compared to a multi-threaded CPU implementation on a Core-i7 2700.
year | journal | country | edition | language |
---|---|---|---|---|
2014-02-01 | 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing |