Search results for " Reinforcement"
showing 10 items of 51 documents
Using Inverse Reinforcement Learning with Real Trajectories to Get More Trustworthy Pedestrian Simulations
2020
Reinforcement learning is one of the most promising machine learning techniques to get intelligent behaviors for embodied agents in simulations. The output of the classic Temporal Difference family of Reinforcement Learning algorithms adopts the form of a value function expressed as a numeric table or a function approximator. The learned behavior is then derived using a greedy policy with respect to this value function. Nevertheless, sometimes the learned policy does not meet expectations, and the task of authoring is difficult and unsafe because the modification of one value or parameter in the learned value function has unpredictable consequences in the space of the policies it represents…
On the "Strength" of Behavior.
2020
AbstractThe place of the concept of response strength in a natural science of behavior has been the subject of much debate. This article reconsiders the concept of response strength for reasons linked to the foundations of a natural science of behavior. The notion of response strength is implicit in many radical behaviorists’ work. Palmer (2009) makes it explicit by applying the response strength concept to three levels: (1) overt behavior, (2) covert behavior, and (3) latent or potential behavior. We argue that the concept of response strength is superfluous in general, and an explication of the notion of giving causal status to nonobservable events like latent behavior or response strengt…
El consentimiento en el proceso penal : ¿un oxímoron?
2021
We are involved in a moment of deep changes in Criminal Procedure due, among other reasons, to the permanent expansion of Criminal Law. Legislative modifications have taken place one another over the world and they have led to a functional reformulation of the role played by the protagonists of Criminal Procedure. These changes have not finished yet and the future is still to come, but new elements are already emerging: the consent of the actors of the process and of the victims and the reinforcement of the principle of opportunity are undoubtedly some of them. Probation, diversion, compliance, criminal mediation are institutions that stand on the principle of consent for procedural purpose…
On the feasibility of personal audio systems over a network of distributed loudspeakers
2018
Los sistemas de reproducción de audio personal se ocupan de la creación de zonas sonoras personales dentro de una habitación sin necesidad de utilizar auriculares. Estos sistemas utilizan un conjunto de altavoces y diseñan los filtros necesarios en cada altavoz con el fin de que la señal de audio deseada llegue a cada persona en la sala lo más libre de interferencias posible. Existen propuestas muy interesantes en la literatura que hacen uso de arrays circulares o lineales, pero en este trabajo estudiamos el problema considerando una red de altavoces distribuidos controlados por un conjunto de nodos acústicos, que pueden intercambiar información a través de una red. Enunciamos el modelo de …
Stress-Strain Law for Confined Concrete with Hardening or Softening Behavior
2013
This paper provides a new general stress-strain law for concrete confined by steel, fiber reinforced polymer (FRP), or fiber reinforced cementitious matrix (FRCM), obtained by a suitable modification of the well-known Sargin’s curve for steel confined concrete. The proposed law is able to reproduce stress-strain curve of any shape, having both hardening or softening behavior, by using a single closed-form simple algebraic expression with constant coefficients. The coefficients are defined on the basis of the stress and the tangent modulus of the confined concrete in three characteristic points of the curve, thus being related to physical meaningful parameters. It will be shown that if the v…
Numerical analysis of delamination in through-thickness reinforced composite laminates
2009
Composite laminates show a high vulnerability to out-of-plane actions, responsible for localized damage between two adjacent laminae, i.e. delamination phenomenon. A recent technological solution to improve the strength of the composite laminates in the thickness direction consists in inserting through-thickness reinforcement. In this paper, the composite delamination is analyzed in the context of non-linear fracture mechanics by an original two-phase interface model able to describe the anisotropic elastic and post-elastic mechanical response given by the presence of the reinforcement fibres. The two phases (adhesive joint or matrix of the composite and the reinforcement) are characterized…
Experimental application of digital image correlation for the tensile characterization of basalt FRCM composites
2021
Abstract Composites made with inorganic matrix, namely fabric reinforced cementitious mortar (FRCM) composites are becoming widespread as strengthening materials for existing masonry structures. These composites are made of a dry grid of fibres embedded in an inorganic matrix. FRCMs can be considered a valid alternative to traditional organic composites such as fibre reinforced polymers (FRPs) because of their better compatibility with the masonry support. This work presents an experimental study for the tensile characterization of a basalt fabric reinforced cementitious mortar (BFRCM) composite. Tensile tests were carried out on coupons reinforced with one, two or three layers of grid to i…
Timber anti-seismic devices in historical architecture in the Mediterranean area
2017
Questo lavoro esamina l’uso di un dispositivo strutturale in legno storico utilizzato come rinforzo della muratura per i sistemi di prevenzione sismica nell'area mediterranea. Tale tecnologia viene realizzata mediante un telaio tridimensionale tridimensionale di legno incorporato nella muratura di pietra per legare insieme le varie parti strutturali e contribuire alla resistenza sismica complessiva. Molto spesso, questo principio costruttivo fu esteso non solo per le parti più deboli, ma per l'intero edificio, creando nuove configurazioni strutturali che erano in grado di assorbire gli effetti delle azioni sismiche. Dalle esperienze costruttive di epoca romana (opus craticium), questo siste…
Emergent behaviors and scalability for multi-agent reinforcement learning-based pedestrian models
2017
This paper analyzes the emergent behaviors of pedestrian groups that learn through the multiagent reinforcement learning model developed in our group. Five scenarios studied in the pedestrian model literature, and with different levels of complexity, were simulated in order to analyze the robustness and the scalability of the model. Firstly, a reduced group of agents must learn by interaction with the environment in each scenario. In this phase, each agent learns its own kinematic controller, that will drive it at a simulation time. Secondly, the number of simulated agents is increased, in each scenario where agents have previously learnt, to test the appearance of emergent macroscopic beha…
Robust Adaptive Modulation and Coding (AMC) selection in LTE systems using reinforcement learning
2014
Adaptive Modulation and Coding (AMC) in LTE networks is commonly employed to improve system throughput by ensuring more reliable transmissions. Most of existing AMC methods select the modulation and coding scheme (MCS) using pre-computed mappings between MCS indexes and channel quality indicator (CQI) feedbacks that are periodically sent by the receivers. However, the effectiveness of this approach heavily depends on the assumed channel model. In addition CQI feedback delays may cause throughput losses. In this paper we design a new AMC scheme that exploits a reinforcement learning algorithm to adjust at run-time the MCS selection rules based on the knowledge of the effect of previous AMC d…