Search results for " Reinforcement"

showing 10 items of 51 documents

Using Inverse Reinforcement Learning with Real Trajectories to Get More Trustworthy Pedestrian Simulations

2020

Reinforcement learning is one of the most promising machine learning techniques to get intelligent behaviors for embodied agents in simulations. The output of the classic Temporal Difference family of Reinforcement Learning algorithms adopts the form of a value function expressed as a numeric table or a function approximator. The learned behavior is then derived using a greedy policy with respect to this value function. Nevertheless, sometimes the learned policy does not meet expectations, and the task of authoring is difficult and unsafe because the modification of one value or parameter in the learned value function has unpredictable consequences in the space of the policies it represents…

0209 industrial biotechnologyreinforcement learningComputer scienceGeneral Mathematics02 engineering and technologypedestrian simulationTask (project management)learning by demonstration020901 industrial engineering & automationAprenentatgeInformàticaBellman equation0202 electrical engineering electronic engineering information engineeringComputer Science (miscellaneous)Reinforcement learningEngineering (miscellaneous)business.industrycausal entropylcsh:MathematicsProcess (computing)020206 networking & telecommunicationsFunction (mathematics)inverse reinforcement learninglcsh:QA1-939Problem domainTable (database)Artificial intelligenceTemporal difference learningbusinessoptimizationMathematics

researchProduct

On the "Strength" of Behavior.

2020

AbstractThe place of the concept of response strength in a natural science of behavior has been the subject of much debate. This article reconsiders the concept of response strength for reasons linked to the foundations of a natural science of behavior. The notion of response strength is implicit in many radical behaviorists’ work. Palmer (2009) makes it explicit by applying the response strength concept to three levels: (1) overt behavior, (2) covert behavior, and (3) latent or potential behavior. We argue that the concept of response strength is superfluous in general, and an explication of the notion of giving causal status to nonobservable events like latent behavior or response strengt…

050103 clinical psychologyPrivate eventsSocial Psychology05 social sciencesOvert behaviorSubject (philosophy)Strengthening by reinforcementExperimental and Cognitive PsychologyMolar approachClinical PsychologyExplicationVDP::Medisinske Fag: 700::Helsefag: 800CovertDiscrete units0501 psychology and cognitive sciencesSignpostsResponse reservoir050102 behavioral science & comparative psychologyPsychologyResponse strengthCognitive psychologyOriginal ResearchPerspectives on behavior science

researchProduct

El consentimiento en el proceso penal : ¿un oxímoron?

2021

We are involved in a moment of deep changes in Criminal Procedure due, among other reasons, to the permanent expansion of Criminal Law. Legislative modifications have taken place one another over the world and they have led to a functional reformulation of the role played by the protagonists of Criminal Procedure. These changes have not finished yet and the future is still to come, but new elements are already emerging: the consent of the actors of the process and of the victims and the reinforcement of the principle of opportunity are undoubtedly some of them. Probation, diversion, compliance, criminal mediation are institutions that stand on the principle of consent for procedural purpose…

:CIENCIAS JURÍDICAS [UNESCO]principle of oppotunity 208 235among other reasonscriminal mediation are institutions that stand on the principle of consent for procedural purposes and show this new scenario that is emerging. Speaking about consent in criminal proceedings seems to be no longer an oxymoron Consentimientoto the permanent expansion of Criminal Law. Legislative modifications have taken place one another over the world and they have led to a functional reformulation of the role played by the protagonists of Criminal Procedure. These changes have not finished yet and the future is still to comecompliance2070-8157 22082 Revista Boliviana de Derecho 565487 2021 31 7730057 El consentimiento en el proceso penal ¿un oxímoron? Barona VilarConsentbut new elements are already emerging: the consent of the actors of the process and of the victims and the reinforcement of the principle of opportunity are undoubtedly some of them. ProbationUNESCO::CIENCIAS JURÍDICASdiversionprincipio de oportunidadSilvia We are involved in a moment of deep changes in Criminal Procedure duethe consent of the actors of the process and of the victims and the reinforcement of the principle of opportunity are undoubtedly some of them. Probation [but new elements are already emerging]

researchProduct

On the feasibility of personal audio systems over a network of distributed loudspeakers

2018

Los sistemas de reproducción de audio personal se ocupan de la creación de zonas sonoras personales dentro de una habitación sin necesidad de utilizar auriculares. Estos sistemas utilizan un conjunto de altavoces y diseñan los filtros necesarios en cada altavoz con el fin de que la señal de audio deseada llegue a cada persona en la sala lo más libre de interferencias posible. Existen propuestas muy interesantes en la literatura que hacen uso de arrays circulares o lineales, pero en este trabajo estudiamos el problema considerando una red de altavoces distribuidos controlados por un conjunto de nodos acústicos, que pueden intercambiar información a través de una red. Enunciamos el modelo de …

Audio signalbusiness.product_category:CIENCIAS TECNOLÓGICAS [UNESCO]MicrophoneComputer scienceAcoustics020206 networking & telecommunications02 engineering and technologypersonal audio systemsUNESCO::CIENCIAS TECNOLÓGICASGeneralLiterature_MISCELLANEOUSSignal-to-noise ratioSound reinforcement system0202 electrical engineering electronic engineering information engineeringElectronic engineering020201 artificial intelligence & image processingLoudspeakerDirectional soundwireless acoustic sensor networksbusinessHeadphones

researchProduct

Stress-Strain Law for Confined Concrete with Hardening or Softening Behavior

2013

This paper provides a new general stress-strain law for concrete confined by steel, fiber reinforced polymer (FRP), or fiber reinforced cementitious matrix (FRCM), obtained by a suitable modification of the well-known Sargin’s curve for steel confined concrete. The proposed law is able to reproduce stress-strain curve of any shape, having both hardening or softening behavior, by using a single closed-form simple algebraic expression with constant coefficients. The coefficients are defined on the basis of the stress and the tangent modulus of the confined concrete in three characteristic points of the curve, thus being related to physical meaningful parameters. It will be shown that if the v…

Constant coefficientsMaterials scienceFiber reinforced polymers (FRP)Article SubjectStress–strain curvefiber reinforced cementitiuos matrix (FRCM)Fibre-reinforced plasticConfined concretefiber reinforced cementitiuos matrix (FRCM); Confined concrete; Fiber reinforced polymers (FRP); modelsmodelsSettore ICAR/09 - Tecnica Delle Costruzionilcsh:TA1-2040LawTangent modulusHardening (metallurgy)Algebraic expressionComposite materialConfinement of concrete general stress-strain law transverse reinforcement FRP FRCMCementitious matrixlcsh:Engineering (General). Civil engineering (General)SofteningCivil and Structural EngineeringAdvances in Civil Engineering

researchProduct

Numerical analysis of delamination in through-thickness reinforced composite laminates

2009

Composite laminates show a high vulnerability to out-of-plane actions, responsible for localized damage between two adjacent laminae, i.e. delamination phenomenon. A recent technological solution to improve the strength of the composite laminates in the thickness direction consists in inserting through-thickness reinforcement. In this paper, the composite delamination is analyzed in the context of non-linear fracture mechanics by an original two-phase interface model able to describe the anisotropic elastic and post-elastic mechanical response given by the presence of the reinforcement fibres. The two phases (adhesive joint or matrix of the composite and the reinforcement) are characterized…

Delamination Through-thickness reinforcement InterfaceSettore ICAR/08 - Scienza Delle Costruzioni

researchProduct

Experimental application of digital image correlation for the tensile characterization of basalt FRCM composites

2021

Abstract Composites made with inorganic matrix, namely fabric reinforced cementitious mortar (FRCM) composites are becoming widespread as strengthening materials for existing masonry structures. These composites are made of a dry grid of fibres embedded in an inorganic matrix. FRCMs can be considered a valid alternative to traditional organic composites such as fibre reinforced polymers (FRPs) because of their better compatibility with the masonry support. This work presents an experimental study for the tensile characterization of a basalt fabric reinforced cementitious mortar (BFRCM) composite. Tensile tests were carried out on coupons reinforced with one, two or three layers of grid to i…

Digital image correlationMaterials scienceDigital image correlation (DIC)Composite number0211 other engineering and technologies020101 civil engineering02 engineering and technologyBendingFRCMTensile tests0201 civil engineering021105 building & constructionUltimate tensile strengthGeneral Materials ScienceComposite materialCivil and Structural Engineeringbusiness.industryBuilding and ConstructionMasonryCrack patternCharacterization (materials science)Settore ICAR/09 - Tecnica Delle CostruzioniReinforcement ratioBasalt grid Crack pattern Digital image correlation (DIC) FRCM Reinforcement ratio Tensile testsCementitiousMortarbusinessBasalt grid

researchProduct

Timber anti-seismic devices in historical architecture in the Mediterranean area

2017

Questo lavoro esamina l’uso di un dispositivo strutturale in legno storico utilizzato come rinforzo della muratura per i sistemi di prevenzione sismica nell'area mediterranea. Tale tecnologia viene realizzata mediante un telaio tridimensionale tridimensionale di legno incorporato nella muratura di pietra per legare insieme le varie parti strutturali e contribuire alla resistenza sismica complessiva. Molto spesso, questo principio costruttivo fu esteso non solo per le parti più deboli, ma per l'intero edificio, creando nuove configurazioni strutturali che erano in grado di assorbire gli effetti delle azioni sismiche. Dalle esperienze costruttive di epoca romana (opus craticium), questo siste…

Engineeringbusiness.industryApplied MathematicsComputational Mechanicsarchitettura storica tecniche costruttive legno muratura presidi costruttivi antisismici intelaiatura ligneaSettore ICAR/10 - Architettura Tecnicaconstruction materials constructive technology historical architecture masonry seismic reinforcement timber frame wood. 1MasonryCivil engineeringComputer Science ApplicationsComputational MathematicsModeling and SimulationMediterranean areaArchitecturebusinessInternational Journal of Computational Methods and Experimental Measurements

researchProduct

Emergent behaviors and scalability for multi-agent reinforcement learning-based pedestrian models

2017

This paper analyzes the emergent behaviors of pedestrian groups that learn through the multiagent reinforcement learning model developed in our group. Five scenarios studied in the pedestrian model literature, and with different levels of complexity, were simulated in order to analyze the robustness and the scalability of the model. Firstly, a reduced group of agents must learn by interaction with the environment in each scenario. In this phase, each agent learns its own kinematic controller, that will drive it at a simulation time. Secondly, the number of simulated agents is increased, in each scenario where agents have previously learnt, to test the appearance of emergent macroscopic beha…

Engineeringmedia_common.quotation_subject02 engineering and technologyPedestrianMachine learningcomputer.software_genreConsistency (database systems)Robustness (computer science)0202 electrical engineering electronic engineering information engineeringReinforcement learningQuality (business)Macromedia_commonInformáticaPedestrian simulation and modelingKinematic controllerbusiness.industry020207 software engineeringEmergent behavioursBehavioural simulationHardware and ArchitectureModeling and SimulationScalability020201 artificial intelligence & image processingArtificial intelligencebusinessMulti-agent reinforcement learning (Marl)computerSoftwareSimulation Modelling Practice and Theory

researchProduct

Robust Adaptive Modulation and Coding (AMC) selection in LTE systems using reinforcement learning

2014

Adaptive Modulation and Coding (AMC) in LTE networks is commonly employed to improve system throughput by ensuring more reliable transmissions. Most of existing AMC methods select the modulation and coding scheme (MCS) using pre-computed mappings between MCS indexes and channel quality indicator (CQI) feedbacks that are periodically sent by the receivers. However, the effectiveness of this approach heavily depends on the assumed channel model. In addition CQI feedback delays may cause throughput losses. In this paper we design a new AMC scheme that exploits a reinforcement learning algorithm to adjust at run-time the MCS selection rules based on the knowledge of the effect of previous AMC d…

Engineeringreinforcement learningSettore ING-INF/03 - Telecomunicazionibusiness.industryLink adaptationchannel qualityChannel modelsLTE channel quality adaptive modulation and coding (AMC) reinforcement learning performance evaluation.performance evaluationLTERobustness (computer science)Electronic engineeringReinforcement learningDecision processbusinessReinforcement learning algorithmCoding (social sciences)adaptive modulation and coding (AMC)

researchProduct