Search results for "PGAS"

showing 7 items of 7 documents

Nanoseconds Timing System Based on IEEE 1588 FPGA Implementation

2019

Clock synchronization procedures are mandatory in most physical experiments where event fragments are readout by spatially dislocated sensors and must be glued together to reconstruct key parameters (e.g. energy, interaction vertex etc.) of the process under investigation. These distributed data readout topologies rely on an accurate time information available at the frontend, where raw data are acquired and tagged with a precise timestamp prior to data buffering and central data collecting. This makes the network complexity and latency, between frontend and backend electronics, negligible within upper bounds imposed by the frontend data buffer capability. The proposed research work describ…

EthernetFOS: Computer and information sciencesNuclear and High Energy PhysicsEye diagram; field-programmable gate arrays (FPGAs); front-end electronics; hardware; synchronization; timing systemfront-end electronicEye diagramtiming systemSerial communicationData bufferNetwork topology01 natural sciencesClock synchronizationNOComputer Science - Networking and Internet ArchitecturePE2_20103 physical sciencesSynchronization (computer science)hardwareElectrical and Electronic EngineeringNetworking and Internet Architecture (cs.NI)010308 nuclear & particles physicsbusiness.industrySettore FIS/01 - Fisica Sperimentalefront-end electronicsNuclear Energy and Engineeringfield-programmable gate arrays (FPGAs)Precision Time ProtocolbusinesssynchronizationComputer hardwareData link layer

researchProduct

Parallel and scalable short-read alignment on multi-core clusters using UPC++

2016

[Abstract]: The growth of next-generation sequencing (NGS) datasets poses a challenge to the alignment of reads to reference genomes in terms of alignment quality and execution speed. Some available aligners have been shown to obtain high quality mappings at the expense of long execution times. Finding fast yet accurate software solutions is of high importance to research, since availability and size of NGS datasets continue to increase. In this work we present an efficient parallelization approach for NGS short-read alignment on multi-core clusters. Our approach takes advantage of a distributed shared memory programming model based on the new UPC++ language. Experimental results using the …

Parallel computingInternetGenome HumanBioinformaticsPGASShort read alignmentlcsh:RComputational BiologyHigh-Throughput Nucleotide SequencingReproducibility of Resultslcsh:Medicine004 InformatikHumansProgramming Languageslcsh:QHigh performance computinglcsh:ScienceSequence AlignmentAlgorithms004 Data processingResearch Article

researchProduct

A Portable Readout System for Microstrip Silicon Sensors (ALIBAVA)

2009

A readout system for microstrip silicon sensors has been developed. This system is able to measure the collected charge in one or two microstrip silicon sensors by reading out all the channels of the sensor(s), up to 256. The system can operate either with non-irradiated and irradiated sensors as well as with n-type and p-type microstrip silicon sensors. Heavily irradiated sensors will be used at the Super Large Hadron Collider, so this system can be used to research the performance of microstrip silicon sensors in conditions as similar as possible to the Super Large Hadron Collider operating conditions. The system has two main parts: a hardware part and a software part. The hardware part a…

Nuclear and High Energy PhysicsEngineeringanalog processing circuitsMotherboardPhysics::Instrumentation and DetectorsInterface (computing)Analog-digital conversionFPGAshigh energy physics instrumentationUSBMicrostripChargelaw.inventionCharge sensitive amplifiersData acquisitionmicroprocessorslawlogic designElectrical and Electronic Engineeringdetector instrumentationtime to digitalbusiness.industryReading (computer)electronicsDetectorElectrical engineeringConvertersCollectionsfront-end electronicssemiconductor detectorsNuclear Energy and Engineeringdata acquisition systemsdigital integrated circuitsbusinessDaughterboard

researchProduct

The Mu3e Data Acquisition

2020

The Mu3e experiment aims to find or exclude the lepton flavour violating decay $\mu^+\to e^+e^-e^+$ with a sensitivity of one in 10$^{16}$ muon decays. The first phase of the experiment is currently under construction at the Paul Scherrer Institute (PSI, Switzerland), where beams with up to 10$^8$ muons per second are available. The detector will consist of an ultra-thin pixel tracker made from High-Voltage Monolithic Active Pixel Sensors (HV-MAPS), complemented by scintillating tiles and fibres for precise timing measurements. The experiment produces about 100 Gbit/s of zero-suppressed data which are transported to a filter farm using a network of FPGAs and fast optical links. On the filte…

Nuclear and High Energy PhysicsParticle physicsPhysics - Instrumentation and DetectorsMesonPhysics::Instrumentation and Detectorsdata acquisitionfibre: opticalFOS: Physical scienceshigh energy physics instrumentationprinted circuits7. Clean energycomputer: networkOptical fiber communicationData acquisitionsemiconductor detector: pixelOptical switchesmultiprocessor: graphicshardwareSensitivity (control systems)muon+: decay[PHYS.PHYS.PHYS-INS-DET]Physics [physics]/Physics [physics]/Instrumentation and Detectors [physics.ins-det]Electrical and Electronic EngineeringGeneralLiterature_REFERENCE(e.g.dictionariesencyclopediasglossaries)scintillation counterFPGAClocksPhysicsData acquisition (DAQ)MuonPixelMesonsDetectorlepton: flavor: violationField programmable gate arraysDetectorsInstrumentation and Detectors (physics.ins-det)sensitivityNuclear Energy and EngineeringFilter (video)field programmable gate arrays (FPGAs)Data acquisition (DAQ); field programmable gate arrays (FPGAs); high energy physics instrumentation; printed circuitselectronics: readoutHigh Energy Physics::ExperimentLeptonelectronics: design

researchProduct

A novel hardware accelerator for the HEVC intra prediction

2015

International audience; A novel hardware accelerator for the High Efficiency Video Coding (HEVC) intra prediction is presented in this paper in order to reduce the computation complexity within this standard and to accelerate the concerned calculations. We propose a new pipelined structure that we called Processing Element (PE) to execute all angular modes, and we repeat it in five paths that our architecture composed of. We present also another structure to carry out the Planar mode. This architecture supports all intra prediction modes for all prediction unit sizes. The synthesis results show that our design can run at 213 MHz for Xilinx Virtex 6 and is capable to process real time 120 10…

HEVC0209 industrial biotechnologyAdderVirtexComputer scienceProcessing element020208 electrical & electronic engineering1080pFPGAs02 engineering and technologyParallel computingIntra prediction[SPI]Engineering Sciences [physics]020901 industrial engineering & automationPlanar0202 electrical engineering electronic engineering information engineering[ SPI ] Engineering Sciences [physics]Hardware accelerationField-programmable gate arrayCoding (social sciences)

researchProduct

Fully pipelined real time hardware solution for High Efficiency Video Coding (HEVC) intra prediction

2016

International audience; A fully pipelined hardware accelerator for the High Efficiency Video Coding (HEVC) intra prediction is presented in this paper in order to reduce the computation complexity coming with this module and to accelerate the concerned calculations. Two reconfigurable structures are developed in this paper, the first one concerns angular modes and is identified as Processing Element for Angular (PEA) modes, the other is made in order to handle with the Planar mode and is identified as Processing Element for the Planar (PEP) mode. Each structure is repeated in five paths, that our architecture composed of, working in parallel way. This architecture supports all intra predict…

HEVC[ INFO ] Computer Science [cs]Image compressionComputer scienceReal-time processing1080pFPGAs02 engineering and technologyIntra prediction0202 electrical engineering electronic engineering information engineering[INFO]Computer Science [cs]Field-programmable gate arrayVirtexbusiness.industryReconfigurable computing020206 networking & telecommunicationsFrame rateReconfigurable computingHardware and ArchitectureHardware acceleration020201 artificial intelligence & image processingbusinessSoftwareComputer hardwareImage compressionCoding (social sciences)

researchProduct

Large-scale genome-wide association studies on a GPU cluster using a CUDA-accelerated PGAS programming model

2015

[Abstract] Detecting epistasis, such as 2-SNP interactions, in genome-wide association studies (GWAS) is an important but time consuming operation. Consequently, GPUs have already been used to accelerate these studies, reducing the runtime for moderately-sized datasets to less than 1 hour. However, single-GPU approaches cannot perform large-scale GWAS in reasonable time. In this work we present multiEpistSearch, a tool to detect epistasis that works on GPU clusters. While CUDA is used for parallelization within each GPU, the workload distribution among GPUs is performed with Unified Parallel C++ (UPC++), a novel extension of C++ that follows the Partitioned Global Address Space (PGAS) model…

Scale (ratio)BioinformaticsComputer sciencePGASGPUCUDAGenome-wide association studyParallel computingGPU clusterSoftware_PROGRAMMINGTECHNIQUESTheoretical Computer ScienceComputational scienceCUDAHardware and ArchitectureUnified Parallel CProgramming paradigmPartitioned global address spacecomputerUPC++Softwarecomputer.programming_languageThe International Journal of High Performance Computing Applications

researchProduct