Search results for "speech recognition"
showing 10 items of 357 documents
Visual Cortex Performs a Sort of Non-linear ICA
2010
Here, the standard V1 cortex model optimized to reproduce image distortion psychophysics is shown to have nice statistical properties, e.g. approximate factorization of the PDF of natural images. These results confirm the efficient encoding hypothesis that aims to explain the organization of biological sensors by information theory arguments.
Perception and replication of planar sonic gestures
2012
As tables, boards, and walls become surfaces where interaction can be supported by auditory displays, it becomes important to know how accurately and effectively a spatial gesture can be rendered by means of an array of loudspeakers embedded in the surface. Two experiments were designed and performed to assess: (i) how sequences of sound pulses are perceived as gestures when the pulses are distributed in space and time along a line; (ii) how the timing of pulses affects the perceived and reproduced continuity of sequences; and (iii) how effectively a second parallel row of speakers can extend sonic gestures to a two-dimensional space. Results show that azimuthal trajectories can be effectiv…
Selecting one of two regular sound sequences : Perceptual and motor effects of tempo
2008
This study assessed the influence of tempo on selecting a sound sequence. In Exp. 1, synchronization with one of the two regular subsequences in a complex sequence was measured. 30 participants indicated a preference for the fastest subsequence when subsequences were in a slow tempo range (≥ 500 msec. IOI), and with the slower subsequence when they were in the fast tempo range (≤ 300 msec. IOI). These results were replicated using a perceptual task (Exp. 2 and 3) in which the 30 listeners had to detect a temporal irregularity in one of the two subsequences. Detection was better when the temporal irregularity was in the fastest subsequence than in the slowest one when the complex sequence w…
Information domain approach to the investigation of cardio-vascular, cardio-pulmonary, and vasculo-pulmonary causal couplings
2011
The physiological mechanisms related to cardio-vascular (CV), cardio-pulmonary (CP), and vasculo-pulmonary (VP) regulation may be probed through multivariate time series analysis tools. This study applied an information domain approach for the evaluation of non-linear causality to the beat-to-beat variability series of heart period (t), systolic arterial pressure (s), and respiration (r) measured during tilt testing and paced breathing (PB) protocols. The approach quantifies the causal coupling from the series i to the series j (C(ij)) as the amount of information flowing from i to j. A measure of directionality is also obtained as the difference between two reciprocal causal couplings (D(i…
Extraction of ERP from EEG data
2007
In this article, a simple but novel technique for extracting a linear subspace related to event related potentials (ERPs) from ElectroEncephaloGraphy (EEG) data is introduced. The technique consists of a sequence of basic linear operations applied to multidimensional EEG data in a problem-specific manner. The derivation of the proposed technique is given and results with real data are described together with overall conclusions.
2014
Due to its millisecond-scale temporal resolution, EEG allows to assess neural correlates with precisely defined temporal relationship relative to a given event. This knowledge is generally lacking in data from functional magnetic resonance imaging (fMRI) which has a temporal resolution on the scale of seconds so that possibilities to combine the two modalities are sought. Previous applications combining event-related potentials (ERPs) with simultaneous fMRI BOLD generally aimed at measuring known ERP components in single trials and correlate the resulting time series with the fMRI BOLD signal. While it is a valuable first step, this procedure cannot guarantee that variability of the chosen …
Sign Languages Recognition Based on Neural Network Architecture
2017
In the last years, many steps forward have been made in speech and natural languages recognition and were developed many virtual assistants such as Apple’s Siri, Google Now and Microsoft Cortana. Unfortunately, not everyone can use voice to communicate to other people and digital devices. Our system is a first step for extending the possibility of using virtual assistants to speech impaired people by providing an artificial sign languages recognition based on neural network architecture.
The processing of consonants and vowels during letter identity and letter position assignment in visual-word recognition: an ERP study.
2009
Abstract Recent research suggests that there is a processing distinction between consonants and vowels in visual-word recognition. Here we conjointly examine the time course of consonants and vowels in processes of letter identity and letter position assignment. Event related potentials (ERPs) were recorded while participants read words and pseudowords in a lexical decision task. The stimuli were displayed under different conditions in a masked priming paradigm with a 50-ms SOA: (i) identity/baseline condition e.g., chocolate-CHOCOLATE); (ii) vowels-delayed condition (e.g., choc l te-CHOCOLATE); (iii) consonants-delayed condition (cho o ate-CHOCOLATE); (iv) consonants-transposed condition (…
Auditory distance perception in an acoustic pipe
2008
In a study of auditory distance perception, we investigated the effects of exaggeration the acoustic cue of reverberation where the intensity of sound did not vary noticeably. The set of stimuli was obtained by moving a sound source inside a 10.2-m long pipe having a 0.3-m diameter. Twelve subjects were asked to listen to a speech sound while keeping their head inside the pipe and then to estimate the egocentric distance from the sound source using a magnitude production procedure. The procedure was repeated eighteen times using six different positions of the sound source. Results show that the point at which perceived distance equals physical distance is located approximately 3.5 m away fr…
Vocal fold strain and vocal pitch in singing:Radiographic observations of singers and nonsingers
1998
Summary The relationship between vocal fold strain and vocal pitch in singersand nonsingers singing a rising pitch series has been indirectly investigated by means of lateral radiographs. Nonsingers tend to exhibit more strain than singers. To standardize the degree of strain, an index of strain per semitone is proposed. The semitone strain indicates the average amount of strain per 1 semitone of pitch increase or decrease. The index has been shown to be affected by several factors: gender, singing training, singing technique, voice class, age, and status of muscle function. Observations suggest that similar groups of individuals occupy different positions on the stress-strain curve, indica…