Search results for "Floating"
showing 10 items of 61 documents
Propuesta de elaboración de un glosario bilingüe (español-francés) en línea de términos arquitectónicos
2019
[ES] En estas páginas se expone cómo constituir un glosario técnico (español-francés) en línea de términos sobre Arquitectura Flotante. A partir de un corpus de documentos, en francés y en español, publicados en Internet, se procederá a la localización y reunión de términos específicos de este campo en cada una de las lenguas mencionadas; a la descripción de los distintos tipos de unidades léxicas propias de este ámbito; y la presentación de las correspondencias traductológicas entre ambas lenguas o, en su defecto, a la propuesta de equivalencias definitorias. Este trabajo constituye una aportación al estudio de la traducción técnica en el campo de la Arquitectura.
A dynamic program analysis to find floating-point accuracy problems
2012
Programs using floating-point arithmetic are prone to accuracy problems caused by rounding and catastrophic cancellation. These phenomena provoke bugs that are notoriously hard to track down: the program does not necessarily crash and the results are not necessarily obviously wrong, but often subtly inaccurate. Further use of these values can lead to catastrophic errors.In this paper, we present a dynamic program analysis that supports the programmer in finding accuracy problems. Our analysis uses binary translation to perform every floating-point computation side by side in higher precision. Furthermore, we use a lightweight slicing approach to track the evolution of errors.We evaluate our…
Efficient and portable acceleration of quantum chemical many-body methods in mixed floating point precision using OpenACC compiler directives
2016
It is demonstrated how the non-proprietary OpenACC standard of compiler directives may be used to compactly and efficiently accelerate the rate-determining steps of two of the most routinely applied many-body methods of electronic structure theory, namely the second-order M{\o}ller-Plesset (MP2) model in its resolution-of-the-identity (RI) approximated form and the (T) triples correction to the coupled cluster singles and doubles model (CCSD(T)). By means of compute directives as well as the use of optimized device math libraries, the operations involved in the energy kernels have been ported to graphics processing unit (GPU) accelerators, and the associated data transfers correspondingly o…
LARGE-SCALE SIMULATIONS IN CONDENSED MATTER PHYSICS —THE NEED FOR A TERAFLOP COMPUTER
1992
The introduction of vector processors {“supercomputers” with a performance in the range of 109 floating point operations (1 GFLOP) per second} has had an enormous impact on computational condensed matter physics. The possibility of a substantially enhanced performance by massively parallel processors (“teraflop” machines with 1012 floating point operations per second) will allow satisfactory treatment of a large range of important scientific problems which have to a great extent thus far escaped numerical resolution. The present paper describes only a few examples (out of a long list of interesting research problems!) for which the availability of “teraflops” will allow spectacular progres…
Hardware-efficient matrix inversion algorithm for complex adaptive systems
2012
This work shows an FPGA implementation for the matrix inversion algebra operation. Usually, large matrix dimension is required for real-time signal processing applications, especially in case of complex adaptive systems. A hardware efficient matrix inversion procedure is described using QR decomposition of the original matrix and modified Gram-Schmidt method. This works attempts a direct VHDL description using few predefined packages and fixed point arithmetic for better optimization. New proposals for intermediate calculations are described, leading to efficient logic occupation together with better performance and accuracy in the vector space algebra. Results show that, for a relatively s…
The Discursive Constitution of a World-Spanning Region and the Role of Empty Signifiers: The Case of Francophonia
2007
The cultural turn in political science, history, and political geography has opened new perspectives on the division of the world into geographic entities. Nation-states, regions, districts, etc., are no longer qualified as quasi-natural objects based upon intrinsic qualities but, rather, as contingent results of social or accordingly discursive processes. The Organisation Internationale de la Francophonie (OIF) defines Francophonia as an “geocultural space” (espace geoculturel) and an international community of more than 50 states. In this contribution, the concept of political communities as “imagined communities” and the advancements of discourse theory by Laclau and Mouffe are used in o…
Fight on plankton! Or, phytoplankton shape and size as adaptive tools to get ahead in the struggle for life
2011
A renewed interest in investigating the relationships existing between body size and environmental variables is pervading ecological studies. Phytoplankton has a long tradition as model system in studies of community ecology and several research concepts were developed using these organisms. In this paper we try to review the relevance of analyzing the morphological features of phytoplankton in ecology. Starting with a brief account of allometric relationships existing in phytoplankton, we i) examine the physical context in which phytoplankton grow, and ii) highlight the role of their size in nutrient uptake, and that of their shape in light harvesting. Moreover, the way in which the morpho…
A Novel Systolic Parallel Hardware Architecture for the FPGA Acceleration of Feedforward Neural Networks
2019
New chips for machine learning applications appear, they are tuned for a specific topology, being efficient by using highly parallel designs at the cost of high power or large complex devices. However, the computational demands of deep neural networks require flexible and efficient hardware architectures able to fit different applications, neural network types, number of inputs, outputs, layers, and units in each layer, making the migration from software to hardware easy. This paper describes novel hardware implementing any feedforward neural network (FFNN): multilayer perceptron, autoencoder, and logistic regression. The architecture admits an arbitrary input and output number, units in la…
LightSpMV: Faster CSR-based sparse matrix-vector multiplication on CUDA-enabled GPUs
2015
Compressed sparse row (CSR) is a frequently used format for sparse matrix storage. However, the state-of-the-art CSR-based sparse matrix-vector multiplication (SpMV) implementations on CUDA-enabled GPUs do not exhibit very high efficiency. This has motivated the development of some alternative storage formats for GPU computing. Unfortunately, these alternatives are incompatible with most CPU-centric programs and require dynamic conversion from CSR at runtime, thus incurring significant computational and storage overheads. We present LightSpMV, a novel CUDA-compatible SpMV algorithm using the standard CSR format, which achieves high speed by benefiting from the fine-grained dynamic distribut…
Laser Floating Zone Growth: Overview, Singular Materials, Broad Applications, and Future Perspectives
2021
This article belongs to the Special Issue Laser-Induced Crystallization.