Search results for "Speech recognition"
showing 10 items of 357 documents
Evidence for a spatial bias in the perception of sequences of brief tones
2013
Listeners are unable to report the physical order of particular sequences of brief tones. This phenomenon of temporal dislocation depends on tone durations and frequencies. The current study empirically shows that it also depends on the spatial location of the tones. Dichotically testing a three-tone sequence showed that the central tone tends to be reported as the first or the last element when it is perceived as part of a left-to-right motion. Since the central-tone dislocation does not occur for right-to-left sequences of the same tones, this indicates that there is a spatial bias in the perception of sequences. © 2013 Acoustical Society of America.
Temporal weighting of loudness: Comparison between two different psychophysical tasks
2016
International audience; Psychophysical studies on loudness have so far examined the temporal weighting of loudness solely in level-discrimination tasks. Typically, listeners were asked to discriminate hundreds of level-fluctuating sounds regarding their global loudness. Temporal weights, i.e., the importance of each temporal portion of the stimuli for the loudness judgment, were then estimated from listeners' responses. Consistent non-uniform " u-shaped " temporal weighting patterns were observed, with greater weights assigned to the first and the last temporal portions of the stimuli, revealing significant primacy and recency effects, respectively. In this study, the question was addressed…
Measuring and modeling real-time responses to music: the dynamics of tonality induction.
2003
We examined a variety of real-time responses evoked by a single piece of music, the organ Duetto BWV 805 by J S Bach. The primary data came from a concurrent probe-tone method in which the probe-tone is sounded continuously with the music. Listeners judged how well the probe tone fit with the music at each point in time. The process was repeated for all probe tones of the chromatic scale. A self-organizing map (SOM) [Kohonen 1997 Self-organizing Maps (Berlin: Springer)] was used to represent the developing and changing sense of key reflected in these judgments. The SOM was trained on the probe-tone profiles for 24 major and minor keys (Krumhansl and Kessler 1982 Psychological Review89 334–…
Interval between two sequential arrays determines their storage state in visual working memory.
2020
AbstractThe visual information can be stored as either “active” representations in the active state or “activity-silent” representations in the passive state during the retention period in visual working memory (VWM). Catering to the dynamic nature of visual world, we explored how the temporally dynamic visual input was stored in VWM. In the current study, the memory arrays were presented sequentially, and the contralateral delay activity (CDA), an electrophysiological measure, was used to identify whether the memory representations were transferred into the passive state. Participants were instructed to encode two sequential arrays and retrieve them respectively, with two conditions of int…
Ability for Voice Recognition Is a Marker for Dyslexia in Children
2014
A recent voice recognition experiment conducted by Perrachione, Del Tufo, and Gabrieli (2011) revealed that, in normal adult readers, the accuracy at identifying human voices was better in the participants’ mother tongue than in an unfamiliar language, while this difference was absent in a group of adults with dyslexia. This pattern favored a view of dyslexia as due to “fundamentally impoverished native-language phonological representations.” To further examine this issue, we conducted two voice recognition experiments, one with children with/without dyslexia, and the other with adults with/without dyslexia. Results revealed that children/adults with dyslexia were less accurate at identify…
Earlier timbre processing of instrumental tones compared to equally complex spectrally rotated sounds as revealed by the mismatch negativity.
2014
Harmonically rich sounds have been shown to be processed more efficiently by the human brain compared to single sinusoidal tones. To control for stimulus complexity as a potentially confounding factor, tones and equally complex spectrally rotated sounds, have been used in the present study to investigate the role of the overtone series in sensory auditory processing in non-musicians. Timbre differences in instrumental tones with equal pitch elicited a MMN which was earlier compared to that elicited by the spectrally rotated sounds, indicating that harmonically rich tones are processed faster compared to non-musical sounds without an overtone series, even when pitch is not the relevant infor…
Automatic and controlled processing of acoustic and phonetic contrasts
2003
Changes in the temporal properties of the speech signal provide important cues for phoneme identification. An impairment or inability to detect such changes may adversely affect one's ability to understand spoken speech. The difference in meaning between the Finnish words tuli (fire) and tuuli (wind), for example, lies in the difference between the duration of the vowel /u/. Detecting changes in the temporal properties of the speech signal, therefore, is critical for distinguishing between phonemes and identifying words. In the current study, we tested whether detection of changes in speech sounds, in native Finnish speakers, would vary as a function of the position within the word that the…
Selecting one of two regular sound sequences : Perceptual and motor effects of tempo
2008
This study assessed the influence of tempo on selecting a sound sequence. In Exp. 1, synchronization with one of the two regular subsequences in a complex sequence was measured. 30 participants indicated a preference for the fastest subsequence when subsequences were in a slow tempo range (≥ 500 msec. IOI), and with the slower subsequence when they were in the fast tempo range (≤ 300 msec. IOI). These results were replicated using a perceptual task (Exp. 2 and 3) in which the 30 listeners had to detect a temporal irregularity in one of the two subsequences. Detection was better when the temporal irregularity was in the fastest subsequence than in the slowest one when the complex sequence w…
Effects of Selective Attention on Syntax Processing in Music and Language
2010
Abstract The present study investigated the effects of auditory selective attention on the processing of syntactic information in music and speech using event-related potentials. Spoken sentences or musical chord sequences were either presented in isolation, or simultaneously. When presented simultaneously, participants had to focus their attention either on speech, or on music. Final words of sentences and final harmonies of chord sequences were syntactically either correct or incorrect. Irregular chords elicited an early right anterior negativity (ERAN), whose amplitude was decreased when music was simultaneously presented with speech, compared to when only music was presented. However, t…
The Effect of Adaptive Nonlinear Frequency Compression on Phoneme Perception
2017
Purpose This study implemented a fitting method, developed for use with frequency lowering hearing aids, across multiple testing sites, participants, and hearing aid conditions to evaluate speech perception with a novel type of frequency lowering. Method A total of 8 participants, including children and young adults, participated in real-world hearing aid trials. A blinded crossover design, including posttrial withdrawal testing, was used to assess aided phoneme perception. The hearing aid conditions included adaptive nonlinear frequency compression (NFC), static NFC, and conventional processing. Results Enabling either adaptive NFC or static NFC improved group-level detection and recognit…