Search results for "speech recognition"

showing 10 items of 357 documents

Visual Cortex Performs a Sort of Non-linear ICA

2010

Here, the standard V1 cortex model optimized to reproduce image distortion psychophysics is shown to have nice statistical properties, e.g. approximate factorization of the PDF of natural images. These results confirm the efficient encoding hypothesis that aims to explain the organization of biological sensors by information theory arguments.

business.industrySpeech recognitionPattern recognitionMutual informationInformation theoryVisual cortexmedicine.anatomical_structureFactorizationEncoding (memory)DistortionmedicinePsychophysicssortComputer visionArtificial intelligencebusinesshealth care economics and organizationsMathematics
researchProduct

Perception and replication of planar sonic gestures

2012

As tables, boards, and walls become surfaces where interaction can be supported by auditory displays, it becomes important to know how accurately and effectively a spatial gesture can be rendered by means of an array of loudspeakers embedded in the surface. Two experiments were designed and performed to assess: (i) how sequences of sound pulses are perceived as gestures when the pulses are distributed in space and time along a line; (ii) how the timing of pulses affects the perceived and reproduced continuity of sequences; and (iii) how effectively a second parallel row of speakers can extend sonic gestures to a two-dimensional space. Results show that azimuthal trajectories can be effectiv…

Surface (mathematics)Settore INF/01 - InformaticaGeneral Computer ScienceComputer scienceSpeech recognitionAcousticsComputer Science (all)Auditory localizationExperimental and Cognitive PsychologySonic gestureReplication (computing)Theoretical Computer ScienceAzimuthAuditory localization; sonic gesturesInterval (music)PlanarLine (geometry)sonic gesturesLoudspeakerGestureACM Transactions on Applied Perception
researchProduct

Selecting one of two regular sound sequences : Perceptual and motor effects of tempo

2008

This study assessed the influence of tempo on selecting a sound sequence. In Exp. 1, synchronization with one of the two regular subsequences in a complex sequence was measured. 30 participants indicated a preference for the fastest subsequence when subsequences were in a slow tempo range (≥ 500 msec. IOI), and with the slower subsequence when they were in the fast tempo range (≤ 300 msec. IOI). These results were replicated using a perceptual task (Exp. 2 and 3) in which the 30 listeners had to detect a temporal irregularity in one of the two subsequences. Detection was better when the temporal irregularity was in the fastest subsequence than in the slowest one when the complex sequence w…

AdultMaleAdolescentmedia_common.quotation_subjectSpeech recognition[SHS.PSY]Humanities and Social Sciences/PsychologyExperimental and Cognitive PsychologyChoice Behavior[SHS.PSY] Humanities and Social Sciences/Psychology[ SHS.PSY ] Humanities and Social Sciences/PsychologyDiscrimination PsychologicalPerceptionSubsequenceTask Performance and AnalysisHumansAttentionMathematicsmedia_commonCommunicationSequencebusiness.industryEquipment DesignSensory SystemsFast tempoSoundAcoustic StimulationMotor SkillsPattern Recognition PhysiologicalTime PerceptionAuditory PerceptionEquipment FailureFemalebusinessPsychomotor PerformancePsychoacoustics
researchProduct

Information domain approach to the investigation of cardio-vascular, cardio-pulmonary, and vasculo-pulmonary causal couplings

2011

The physiological mechanisms related to cardio-vascular (CV), cardio-pulmonary (CP), and vasculo-pulmonary (VP) regulation may be probed through multivariate time series analysis tools. This study applied an information domain approach for the evaluation of non-linear causality to the beat-to-beat variability series of heart period (t), systolic arterial pressure (s), and respiration (r) measured during tilt testing and paced breathing (PB) protocols. The approach quantifies the causal coupling from the series i to the series j (C(ij)) as the amount of information flowing from i to j. A measure of directionality is also obtained as the difference between two reciprocal causal couplings (D(i…

medicine.medical_specialtySupine positioncausalityPhysiologySpeech recognitionBaroreflexlcsh:Physiologypaced breathingconditional entropyhead-up tiltInternal medicinePhysiology (medical)medicineHeart rate variabilitybaroreflexarterial pressure variabilityrespiratory sinus arrhythmiaVagal toneRespiratory systemOriginal Researchlcsh:QP1-981business.industryheart rate variabilityCardiorespiratory fitnessBlood pressureSettore ING-INF/06 - Bioingegneria Elettronica E InformaticaCardiologyBreathingArterial pressure variability; Baroreflex; Causality; Conditional entropy; Head-up tilt; Heart rate variability; Paced breathing; Respiratory sinus arrhythmia; Physiology; Physiology (medical)business
researchProduct

Extraction of ERP from EEG data

2007

In this article, a simple but novel technique for extracting a linear subspace related to event related potentials (ERPs) from ElectroEncephaloGraphy (EEG) data is introduced. The technique consists of a sequence of basic linear operations applied to multidimensional EEG data in a problem-specific manner. The derivation of the proposed technique is given and results with real data are described together with overall conclusions.

SequenceQuantitative Biology::Neurons and Cognitionmedicine.diagnostic_testComputer sciencebusiness.industrySpeech recognitionPattern recognitionElectroencephalographyIndependent component analysisLinear subspaceComputingMethodologies_PATTERNRECOGNITIONSignal-to-noise ratioEeg dataEvent-related potentialmedicineArtificial intelligenceNoise (video)business2007 9th International Symposium on Signal Processing and Its Applications
researchProduct

2014

Due to its millisecond-scale temporal resolution, EEG allows to assess neural correlates with precisely defined temporal relationship relative to a given event. This knowledge is generally lacking in data from functional magnetic resonance imaging (fMRI) which has a temporal resolution on the scale of seconds so that possibilities to combine the two modalities are sought. Previous applications combining event-related potentials (ERPs) with simultaneous fMRI BOLD generally aimed at measuring known ERP components in single trials and correlate the resulting time series with the fMRI BOLD signal. While it is a valuable first step, this procedure cannot guarantee that variability of the chosen …

Neural correlates of consciousnessgenetic structuresmedicine.diagnostic_testGeneral NeuroscienceSpeech recognitionElectroencephalographyEEG-fMRIbehavioral disciplines and activitiesIndependent component analysisTask (project management)nervous systemTemporal resolutionmedicineGeneralizability theoryFunctional magnetic resonance imagingPsychologypsychological phenomena and processesFrontiers in Neuroscience
researchProduct

Sign Languages Recognition Based on Neural Network Architecture

2017

In the last years, many steps forward have been made in speech and natural languages recognition and were developed many virtual assistants such as Apple’s Siri, Google Now and Microsoft Cortana. Unfortunately, not everyone can use voice to communicate to other people and digital devices. Our system is a first step for extending the possibility of using virtual assistants to speech impaired people by providing an artificial sign languages recognition based on neural network architecture.

American Sign LanguageComputer sciencebusiness.industryTime delay neural networkDeep learningSpeech recognition020207 software engineering02 engineering and technologylanguage.human_languageRecurrent neural network0202 electrical engineering electronic engineering information engineeringNeural network architecturelanguage020201 artificial intelligence & image processingArtificial intelligencebusinessNatural languageSign (mathematics)
researchProduct

The processing of consonants and vowels during letter identity and letter position assignment in visual-word recognition: an ERP study.

2009

Abstract Recent research suggests that there is a processing distinction between consonants and vowels in visual-word recognition. Here we conjointly examine the time course of consonants and vowels in processes of letter identity and letter position assignment. Event related potentials (ERPs) were recorded while participants read words and pseudowords in a lexical decision task. The stimuli were displayed under different conditions in a masked priming paradigm with a 50-ms SOA: (i) identity/baseline condition e.g., chocolate-CHOCOLATE); (ii) vowels-delayed condition (e.g., choc l te-CHOCOLATE); (iii) consonants-delayed condition (cho o ate-CHOCOLATE); (iv) consonants-transposed condition (…

AdultMaleLinguistics and LanguageAdolescentCognitive Neurosciencemedia_common.quotation_subjectSpeech recognitionExperimental and Cognitive PsychologyLanguage and LinguisticsIdentity (music)Speech and HearingYoung AdultEvent-related potentialReading (process)Lexical decision taskReaction TimeHumansmedia_commonVisual word recognitionBrainElectroencephalographyLinguisticsPattern Recognition VisualReadingWord recognitionTime courseEvoked Potentials VisualFemalePsychologyPriming (psychology)Brain and language
researchProduct

Auditory distance perception in an acoustic pipe

2008

In a study of auditory distance perception, we investigated the effects of exaggeration the acoustic cue of reverberation where the intensity of sound did not vary noticeably. The set of stimuli was obtained by moving a sound source inside a 10.2-m long pipe having a 0.3-m diameter. Twelve subjects were asked to listen to a speech sound while keeping their head inside the pipe and then to estimate the egocentric distance from the sound source using a magnitude production procedure. The procedure was repeated eighteen times using six different positions of the sound source. Results show that the point at which perceived distance equals physical distance is located approximately 3.5 m away fr…

Auditory displayReverberationRange (music)Critical distanceSound and Music ComputingGeneral Computer SciencePerformanceSpeech recognitionmedia_common.quotation_subjectExperimental and Cognitive PsychologySound and Music Computing; Auditory display; Distance perceptionTheoretical Computer ScienceLoudnessPerceptionExperimentationSound (geography)media_commonMathematicsExperimentation; Measurement; Performance; Acoustic pipe; Auditory display; Distance perceptionMeasurementgeographygeography.geographical_feature_categorySettore INF/01 - InformaticaAuditory displaySound intensityAcoustic pipeAcoustic pipe; auditory display; distance perceptionDistance perceptionACM Transactions on Applied Perception
researchProduct

Vocal fold strain and vocal pitch in singing:Radiographic observations of singers and nonsingers

1998

Summary The relationship between vocal fold strain and vocal pitch in singersand nonsingers singing a rising pitch series has been indirectly investigated by means of lateral radiographs. Nonsingers tend to exhibit more strain than singers. To standardize the degree of strain, an index of strain per semitone is proposed. The semitone strain indicates the average amount of strain per 1 semitone of pitch increase or decrease. The index has been shown to be affected by several factors: gender, singing training, singing technique, voice class, age, and status of muscle function. Observations suggest that similar groups of individuals occupy different positions on the stress-strain curve, indica…

AdultMalemedicine.medical_specialtyAdolescentVoice QualitySpeech recognitionThyroid GlandVocal CordsAudiologySemitoneSpeech and HearingSex FactorsPhonationotorhinolaryngologic diseasesmedicineHumansSpeechAgedMathematicsAge FactorsMiddle AgedLPN and LVNhumanitiesVocal pitchRadiographyOtorhinolaryngologyVoiceFemaleSingingpsychological phenomena and processesJournal of Voice
researchProduct