Search results for "prosessorit"

showing 8 items of 8 documents

Designing a graphics processing unit accelerated petaflop capable lattice Boltzmann solver: Read aligned data layouts and asynchronous communication

2017

The lattice Boltzmann method is a well-established numerical approach for complex fluid flow simulations. Recently, general-purpose graphics processing units (GPUs) have become available as high-performance computing resources at large scale. We report on designing and implementing a lattice Boltzmann solver for multi-GPU systems that achieves 1.79 PFLOPS performance on 16,384 GPUs. To achieve this performance, we introduce a GPU compatible version of the so-called bundle data layout and eliminate the halo sites in order to improve data access alignment. Furthermore, we make use of the possibility to overlap data transfer between the host central processing unit and the device GPU with comp…

load balancedata layoutlarge-scale I/Ovirtauslaskentaprosessoritasynchronous communicationgraphics processing unitTitanLattice Boltzmannmemory alignmentComputingMethodologies_COMPUTERGRAPHICS
researchProduct

Sulautettujen järjestelmien hyödyntäminen toisen asteen ammatillisessa opetuksessa

2017

Tässä Pro gradu -tutkielmassa paneudutaan sulautettujen järjestelmien hyödyntämiseen toisen asteen opetuksessa Tieto- ja tietoliikennetekniikan opetussuunnitelmassa. Tutkielmassa tutkitaan LEGO Mindstorms -, Arduino UNO - ja MatrixTSL:n ECIO-järjestelmien soveltuvuutta eri tietotekniikka-alueiden ammatilliseen opetukseen. This Master's Thesis focuses on the utilization of embedded systems in secondary education in the curriculum of Information and Communication Technology. The thesis examines the suitability of the LEGO Mindstorms, Arduino UNO and MatrixTSL ECIO systems for vocational education in various IT fields.

mikro-ohjaimetohjelmointitietotekniikkatietoliikennetekniikkasulautettu järjestelmämikroprosessorit
researchProduct

Protection against reverse engineering in ARM

2020

With the advent of the mobile industry, we face new security challenges. ARM architecture is deployed in most mobile phones, homeland security, IoT, autonomous cars and other industries, providing a hypervisor API (via virtualization extension technology). To research the applicability of this virtualization technology for security in this platform is an interesting endeavor. The hypervisor API is an addition available for some ARMv7-a and is available with any ARMv8-a processor. Some ARM platforms also offer TrustZone, which is a separate exception level designed for trusted computing. However, TrustZone may not be available to engineers as some vendors lock it. We present a method of appl…

IoTmobiililaitteetARMtakaisinmallinnusesineiden internetsecuritymobilelangaton tekniikkahypervisortietoturvamikroprosessorit
researchProduct

Investigating the Impact of Radiation-Induced Soft Errors on the Reliability of Approximate Computing Systems

2020

International audience; Approximate Computing (AxC) is a well-known paradigm able to reduce the computational and power overheads of a multitude of applications, at the cost of a decreased accuracy. Convolutional Neural Networks (CNNs) have proven to be particularly suited for AxC because of their inherent resilience to errors. However, the implementation of AxC techniques may affect the intrinsic resilience of the application to errors induced by Single Events in a harsh environment. This work introduces an experimental study of the impact of neutron irradiation on approximate computing techniques applied on the data representation of a CNN.

Approximate computingComputer scienceReliability (computer networking)Radiation effectsRadiation induced02 engineering and technologyneuroverkotExternal Data Representation01 natural sciencesConvolutional neural networkSoftwareHardware020204 information systems0103 physical sciences0202 electrical engineering electronic engineering information engineering[SPI.NANO]Engineering Sciences [physics]/Micro and nanotechnologies/MicroelectronicsResilience (network)mikroprosessoritNeutronsResilience010308 nuclear & particles physicsbusiness.industryReliabilityApproximate computingPower (physics)[SPI.TRON]Engineering Sciences [physics]/ElectronicsComputer engineeringsäteilyfysiikka[INFO.INFO-ES]Computer Science [cs]/Embedded SystemsbusinessSoftware
researchProduct

Designing a graphics processing unit accelerated petaflop capable lattice Boltzmann solver: Read aligned data layouts and asynchronous communication

2016

The lattice Boltzmann method is a well-established numerical approach for complex fluid flow simulations. Recently, general-purpose graphics processing units (GPUs) have become available as high-performance computing resources at large scale. We report on designing and implementing a lattice Boltzmann solver for multi-GPU systems that achieves 1.79 PFLOPS performance on 16,384 GPUs. To achieve this performance, we introduce a GPU compatible version of the so-called bundle data layout and eliminate the halo sites in order to improve data access alignment. Furthermore, we make use of the possibility to overlap data transfer between the host central processing unit and the device GPU with com…

virtauslaskentalarge-scale I/OComputer scienceGraphics processing unitLattice Boltzmann methodscomputational fluid dynamicsParallel computinggraphics processing unit01 natural sciencesmemory alignmentprocessors010305 fluids & plasmasTheoretical Computer Science0103 physical sciencesData structure alignment0101 mathematicsGraphicsComputingMethodologies_COMPUTERGRAPHICSta113data layoutta114prosessoritSolverLattice Boltzmann010101 applied mathematicsData accessHardware and ArchitectureAsynchronous communicationCentral processing unitasynchronous communicationTitanSoftwareThe International Journal of High Performance Computing Applications
researchProduct

Monisäikeistys pelimoottoreissa

2006

pelimoottoritrinnakkaisohjelmointiohjelmointirinnakkaiskäyttömoniydinprosessorit
researchProduct

ISAdetect

2020

Static and dynamic binary analysis techniques are actively used to reverse engineer software's behavior and to detect its vulnerabilities, even when only the binary code is available for analysis. To avoid analysis errors due to misreading op-codes for a wrong CPU architecture, these analysis tools must precisely identify the Instruction Set Architecture (ISA) of the object code under analysis. The variety of CPU architectures that modern security and reverse engineering tools must support is ever increasing due to massive proliferation of IoT devices and the diversity of firmware and malware targeting those devices. Recent studies concluded that falsely identifying the binary code's ISA ca…

Reverse engineeringprosessoritComputer scienceFirmware02 engineering and technologycomputer.file_formatcomputer.software_genrehaittaohjelmatInstruction setObject codeComputer engineering020204 information systemsEndianness0202 electrical engineering electronic engineering information engineeringMalwareesineiden internet020201 artificial intelligence & image processingBinary codeExecutabletietoturvacomputerProceedings of the Tenth ACM Conference on Data and Application Security and Privacy
researchProduct

On GPU-accelerated fast direct solvers and their applications in image denoising

2015

block cyclic reductionnäytönohjaimetOpenCLnumeeriset menetelmätprosessoritimage denoisingparallel computingmean curvatureGPU computingkuvankäsittelyimage processingfast Poisson solverseparable block tridiagonal linear systemPSCR methodoptimointialgoritmitohjelmointiaugmented Lagrangian methodkohinafast direct solverrinnakkaislaskentaalternating direction methods of multipliers
researchProduct