Search results for "Supercomputer"

showing 10 items of 45 documents

Cell-List based Molecular Dynamics on Many-Core Processors: A Case Study on Sunway TaihuLight Supercomputer

2020

Molecular dynamics (MD) simulations are playing an increasingly important role in several research areas. The most frequently used potentials in MD simulations are pair-wise potentials. Due to the memory wall, computing pair-wise potentials on many-core processors are usually memory bounded. In this paper, we take the SW26010 processor as an exemplary platform to explore the possibility to break the memory bottleneck by improving data reusage via cell-list-based methods. We use cell-lists instead of neighbor-lists in the potential computation, and apply a number of novel optimization methods. Theses methods include: an adaptive replica arrangement strategy, a parameter profile data structur…

CoprocessorCell lists010304 chemical physicsComputer scienceReplica020207 software engineering02 engineering and technologyParallel computingSupercomputerData structure01 natural sciencesBottleneckMolecular dynamics0103 physical sciencesScalability0202 electrical engineering electronic engineering information engineeringSunway TaihuLightSC20: International Conference for High Performance Computing, Networking, Storage and Analysis
researchProduct

Portable Video Supercomputing

2004

As inexpensive imaging chips and wireless telecommunications are incorporated into an increasing array, of portable products, the need for high efficiency, high throughput embedded processing will become an important challenge in computer architecture. Videocentric applications, such wireless videoconferencing, real-time video enhancement and analysis, and new, immersive modes of distance education, will exceed the computational capabilities of current microprocessor and digital signal processor (DSP) architectures. A new class of embedded computers, portable video supercomputers, will combine supercomputer performance with the energy efficiency required for deployment in portable systems. …

Digital signal processorComputer scienceData parallelismVideo processingSupercomputerTheoretical Computer ScienceMicroarchitectureMPEG encodinglaw.inventionMicroprocessorComputational Theory and MathematicsComputer architectureHardware and ArchitecturelawSIMDSoftware
researchProduct

Use of Supercomputing towards the Generation of a Process Execution Plan in Distributed Real-Time Systems

2010

This work consider the scheduling of periodic tasks or processes with real-time constraints in a distributed environment. Each task must be executed meeting deadlines, precedence relationships and resources constraints. The problem of scheduling tasks on a distributed environment can be viewed as a problem of assigning processes to the processors but keeping the schedulability in local environments. Thus, the problem can be divided in two phases: the first phase is assigning processes to processors and the second is to schedule assigned processes in each processor in the distributed environment. This paper focuses in the first phase. It introduces a heuristic mechanism for assigning process…

Distributed Computing EnvironmentStatistical classificationComputer scienceDistributed computingReal-time computingExecution planTask analysisProcessor schedulingUtilization factorSupercomputerScheduling (computing)2010 IEEE Electronics, Robotics and Automotive Mechanics Conference
researchProduct

Computational nuclear quantum many-body problem: The UNEDF project

2013

The UNEDF project was a large-scale collaborative effort that applied high-performance computing to the nuclear quantum many-body problem. The primary focus of the project was on constructing, validating, and applying an optimized nuclear energy density functional, which entailed a wide range of pioneering developments in microscopic nuclear structure and reactions, algorithms, high-performance computing, and uncertainty quantification. UNEDF demonstrated that close associations among nuclear physicists, mathematicians, and computer scientists can lead to novel physics outcomes built on algorithmic innovations and computational developments. This review showcases a wide range of UNEDF scien…

Energy density functionalNuclear Theoryta114Computer scienceFOS: Physical sciencesGeneral Physics and AstronomyComputerApplications_COMPUTERSINOTHERSYSTEMSSupercomputerNuclear Theory (nucl-th)Many-body problemRange (mathematics)Hardware and ArchitectureSystems engineeringStatistical physicsUncertainty quantificationQuantumNuclear theoryComputer Physics Communications
researchProduct

AIOC2: A deep Q-learning approach to autonomic I/O congestion control in Lustre

2021

Abstract In high performance computing systems, I/O congestion is a common problem in large-scale distributed file systems. However, the current implementation mainly requires administrator to manually design low-level implementation and optimization, we proposes an adaptive I/O congestion control framework, named AIOC 2 , which can not only adaptively tune the I/O congestion control parameters, but also exploit the deep Q-learning method to start the training parameters and optimize the tuning for different types of workloads from the server and the client at the same time. AIOC 2 combines the feedback-based dynamic I/O congestion control and deep Q-learning parameter tuning technology to …

ExploitComputer Networks and CommunicationsComputer sciencebusiness.industryQ-learningInterference (wave propagation)SupercomputerComputer Graphics and Computer-Aided DesignTheoretical Computer ScienceNetwork congestionArtificial IntelligenceHardware and ArchitectureEmbedded systemLustre (file system)Latency (engineering)businessThroughput (business)SoftwareParallel Computing
researchProduct

Real-time computation of parameter fitting and image reconstruction using graphical processing units

2016

Abstract In recent years graphical processing units (GPUs) have become a powerful tool in scientific computing. Their potential to speed up highly parallel applications brings the power of high performance computing to a wider range of users. However, programming these devices and integrating their use in existing applications is still a challenging task. In this paper we examined the potential of GPUs for two different applications. The first application, created at Paul Scherrer Institut (PSI), is used for parameter fitting during data analysis of μ SR (muon spin rotation, relaxation and resonance) experiments. The second application, developed at ETH, is used for PET (Positron Emission T…

FOS: Computer and information sciencesMulti-core processorSpeedup010308 nuclear & particles physicsComputer scienceComputationFOS: Physical sciencesGeneral Physics and AstronomyIterative reconstructionComputational Physics (physics.comp-ph)Supercomputer01 natural sciences030218 nuclear medicine & medical imagingComputational science03 medical and health sciencesRange (mathematics)CUDA0302 clinical medicineComputer Science - Distributed Parallel and Cluster ComputingHardware and Architecture0103 physical sciencesSingle-coreDistributed Parallel and Cluster Computing (cs.DC)Physics - Computational PhysicsComputer Physics Communications
researchProduct

A prospect for computing in porous materials research: Very large fluid flow simulations

2016

Abstract Properties of porous materials, abundant both in nature and industry, have broad influences on societies via, e.g. oil recovery, erosion, and propagation of pollutants. The internal structure of many porous materials involves multiple scales which hinders research on the relation between structure and transport properties: typically laboratory experiments cannot distinguish contributions from individual scales while computer simulations cannot capture multiple scales due to limited capabilities. Thus the question arises how large domain sizes can in fact be simulated with modern computers. This question is here addressed using a realistic test case; it is demonstrated that current …

General Computer ScienceComputer scienceLattice Boltzmann method0208 environmental biotechnologyGPULattice Boltzmann methods02 engineering and technologyParallel computing01 natural sciencesPermeability010305 fluids & plasmasTheoretical Computer ScienceComputational sciencePorous materialPetascale computing0103 physical sciencesFluid dynamicsFluid flow simulationPorosityta113ta114Supercomputer020801 environmental engineeringAddressing modePermeability (earth sciences)Petascale computingModeling and SimulationPorous mediumJournal of Computational Science
researchProduct

The HPC Certification Forum: Toward a Globally Acknowledged HPC Certification

2020

The goal of the HPC Certification Forum is to categorize, define, and examine competencies expected from proficient HPC practitioners. The community-led forum is working toward establishing a globally acknowledged HPC certification process, a process that engages with HPC centers to identify gaps in users’ knowledge, and with users to identify the skills required to perform their tasks. In this article, we introduce the forum and summarize the progress made over the last two years. The release of the first officially supported certificate is planned for the second half of 2020.

General Computer ScienceComputer scienceProcess (engineering)05 social sciencesGeneral Engineering050301 educationCertificationCertificateSupercomputerEngineering managementParallel processing (DSP implementation)Task analysis0501 psychology and cognitive sciences0503 education050104 developmental & child psychologyComputing in Science & Engineering
researchProduct

High-speed exhaustive 3-locus interaction epistasis analysis on FPGAs

2015

Abstract Epistasis, the interaction between genes, has become a major topic in molecular and quantitative genetics. It is believed that these interactions play a significant role in genetic variations causing complex diseases. Several algorithms have been employed to detect pairwise interactions in genome-wide association studies (GWAS) but revealing higher order interactions remains a computationally challenging task. State of the art tools are not able to perform exhaustive search for all three-locus interactions in reasonable time even for relatively small input datasets. In this paper we present how a hardware-assisted design can solve this problem and provide fast, efficient and exhaus…

General Computer ScienceComputer sciencebusiness.industryEpistasis and functional genomicsBrute-force searchGenome-wide association studyMutual informationQuantitative geneticsMachine learningcomputer.software_genreSupercomputerTheoretical Computer ScienceModeling and SimulationEpistasisPairwise comparisonArtificial intelligencebusinesscomputerJournal of Computational Science
researchProduct

On the Use of GPU for Accelerating Communication-Aware Mapping Techniques

2015

Different communication-aware mapping techniques were proposed in recent years for improving the performance of distributed systems based on both, off-chip and on-chip networks. Some of these proposals were based on heuristic search for finding pseudo-optimal assignments of tasks and processing elements. However, the technology integration improvements have allowed a significant increase in the number of network nodes, requiring the acceleration of the heuristic search. In this paper, we propose a comparative study of the local search method used in a communication-aware mapping technique, when implemented on different parallel architectures. We compare the performance provided by a version…

General Computer Sciencebusiness.industryComputer scienceGraphics processing unit02 engineering and technologyParallel computingSupercomputer020202 computer hardware & architectureAcceleration0202 electrical engineering electronic engineering information engineeringTechnology integration020201 artificial intelligence & image processingLocal search (optimization)Mapping techniquesArchitecturebusinessThe Computer Journal
researchProduct