Search results for "speech"

showing 10 items of 1281 documents

Part of Speech Tagging Using Hidden Markov Models

2020

Abstract In this paper, we present a wide range of models based on less adaptive and adaptive approaches for a PoS tagging system. These parameters for the adaptive approach are based on the n-gram of the Hidden Markov Model, evaluated for bigram and trigram, and based on three different types of decoding method, in this case forward, backward, and bidirectional. We used the Brown Corpus for the training and the testing phase. The bidirectional trigram model almost reaches state of the art accuracy but is disadvantaged by the decoding speed time while the backward trigram reaches almost the same results with a way better decoding speed time. By these results, we can conclude that the decodi…

Computer scienceBrown CorpusSpeech recognitionBigramTrigramHidden Markov modelTag systemSentenceWord (computer architecture)Decoding methodsInternational Journal of Advanced Statistics and IT&C for Economics and Life Sciences

researchProduct

Lists of Spanish sentences with equivalent predictability, phonetic content, length, and frequency of the last word.

2010

This paper presents a pool of Spanish sentences designed for use in cognitive research and speech processing in circumstances in which the effects of context are relevant. These lists of sentences are divided into six lists of 25 equivalent high-predictability sentences and six lists of 25 low-predictability sentences according to the extent to which the last word can be predicted by the preceding context. These lists were also equivalent in phonetic content, length and frequency of the last word. These lists are intended for use in psycholinguistic research with Spanish-speaking listeners.

Computer scienceExperimental and Cognitive PsychologyContext (language use)computer.software_genreYoung AdultPhoneticsCognitive researchHumansAttentionPredictabilityContent (Freudian dream analysis)LanguagePsycholinguisticsbusiness.industryResearchSpeech IntelligibilitySpeech processingSensory SystemsSemanticsWord lists by frequencySpeech PerceptionArtificial intelligencebusinesscomputerWord (computer architecture)Natural language processingPerceptual and motor skills

researchProduct

Towards Diagrammatic Patterns

2008

This article presents the idea that the graphical representation (concrete syntax) of a visual language can be specified based on some pre-defined diagrammatic patterns. A diagram from the Specification and Description Language (SDL) is used as illustration.

Computer scienceProgramming languagebusiness.industryObject languageComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Specification languagecomputer.software_genreSpecification and Description LanguageVisual languageDiagrammatic reasoningLanguage Of Temporal Ordering SpecificationUniversal Networking LanguageSoftware_SOFTWAREENGINEERINGProgramming language specificationComputer Science::Programming LanguagesArtificial intelligencebusinesscomputerNatural language processingcomputer.programming_language

researchProduct

Overview of the Second BUCC Shared Task: Spotting Parallel Sentences in Comparable Corpora

2017

This paper presents the BUCC 2017 shared task on parallel sentence extraction from comparable corpora. It recalls the design of the datasets, presents their final construction and statistics and the methods used to evaluate system results. 13 runs were submitted to the shared task by 4 teams, covering three of the four proposed language pairs: French-English (7 runs), German-English (3 runs), and Chinese-English (3 runs). The best F-scores as measured against the gold standard were 0.84 (German-English), 0.80 (French-English), and 0.43 (Chinese-English). Because of the design of the dataset, in which not all gold parallel sentence pairs are known, these are only minimum values. We examined …

Computer scienceSentence extractionbusiness.industrySpeech recognition020206 networking & telecommunications02 engineering and technologyGold standard (test)Spottingcomputer.software_genreTask (project management)0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingArtificial intelligencebusinesscomputerNatural language processingSentenceProceedings of the 10th Workshop on Building and Using Comparable Corpora

researchProduct

Morphological Analysis Combined with a Machine Learning Approach to Detect Utrasound Median Sagittal Sections for the Nuchal Translucency Measurement

2017

The screening of chromosomal defects, as trisomy 13, 18 and 21, can be obtained by the measurement of the nuchal translucency thickness scanning during the end of the first trimester of pregnancy. This contribution proposes an automatic methodology to detect mid-sagittal sections to identify the correct measurement of nuchal translucency. Wavelet analysis and neural network classifiers are the main strategies of the proposed methodology to detect the frontal components of the skull and the choroid plexus with the support of radial symmetry analysis. Real clinical ultrasound images were adopted to measure the performance and the robustness of the methodology, thus it can be highlighted an er…

Computer scienceSpeech recognition02 engineering and technologyWavelet analysi03 medical and health sciences0302 clinical medicineWaveletMid-sagittal section Neural network Nuchal translucency Symmetry transform Wavelet analysis.Nuchal translucencyRobustness (computer science)Nuchal Translucency Measurement0202 electrical engineering electronic engineering information engineeringmedicineMid-sagittal sectionSettore INF/01 - InformaticaArtificial neural networkbusiness.industrySymmetry transformPattern recognitionmedicine.diseaseNeural networkSagittal planemedicine.anatomical_structureNuchal translucencyMorphological analysis020201 artificial intelligence & image processingArtificial intelligenceTrisomybusiness030217 neurology & neurosurgery

researchProduct

Fusion Architectures for Word-Based Audiovisual Speech Recognition

2020

Computer scienceSpeech recognitionAudiovisual speechWord (computer architecture)Interspeech 2020

researchProduct

Spectrogram analysis of multipath fading channels

2015

The analysis of the Doppler power spectral density (PSD) of measured and simulated data is an important topic in the area of mobile radio channel modelling. In this paper, we estimate the Doppler PSD of multipath fading channels by using the concept of the spectrogram. The spectrogram is a spectral representation that gives insight into how the distribution of the spectral density of a signal changes over time. The multipath fading channel is modelled by a sum-of-cisoids (SOC) process. A closed-form solution is presented for the spectrogram and the corresponding time-dependent autocorrelation function (ACF). The closed-form solutions disclose several unwanted effects that come with the limi…

Computer scienceSpeech recognitionAutocorrelationBandwidth (signal processing)Spectral densitysymbols.namesakeComputer Science::SoundsymbolsSpectrogramFadingAlgorithmDoppler effectMultipath propagationComputer Science::Information TheoryCommunication channel2015 IEEE 26th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC)

researchProduct

Atrial activity extraction for atrial fibrillation analysis using blind source separation.

2004

This contribution addresses the extraction of atrial activity (AA) from real electrocardiogram (ECG) recordings of atrial fibrillation (AF). We show the appropriateness of independent component analysis (ICA) to tackle this biomedical challenge when regarded as a blind source separation (BSS) problem. ICA is a statistical tool able to reconstruct the unobservable independent sources of bioelectric activity which generate, through instantaneous linear mixing, a measurable set of signals. The three key hypothesis that make ICA applicable in the present scenario are discussed and validated: 1) AA and ventricular activity (VA) are generated by sources of independent bioelectric activity; 2) AA …

Computer scienceSpeech recognitionHeart VentriclesBiomedical EngineeringSignalBlind signal separationSensitivity and SpecificityElectrocardiographyRobustness (computer science)Heart Conduction SystemAtrial FibrillationmedicineHumansDiagnosis Computer-AssistedHeart AtriaPrincipal Component Analysismedicine.diagnostic_testBody Surface Potential MappingContrast (statistics)Reproducibility of ResultsAtrial fibrillationmedicine.diseaseIndependent component analysisKurtosisElectrocardiographyAlgorithmsIEEE transactions on bio-medical engineering

researchProduct

A Musical Pattern Discovery System Founded on a Modeling of Listening Strategies

2004

Music is a domain of expression that conveys a paramount degree of complexity. The musical surface, composed of a multitude of notes, results from the elaboration of numerous structures of different types and sizes. The composer constructs this structural complexity in a more or less explicit way. The listener, faced by such a complex phenomenon, is able to reconstruct only a limited part of it, mostly in a non-explicit way. One particular aim of music analysis is to objectify such complexity, thus offering to the listener a tool for enriching the appreciation of music (Lartillot and SaintJames, 2004). The trouble is, traditional musical analysis, although offering a valuable understanding …

Computer scienceSpeech recognitionMusical050105 experimental psychology060404 music[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI][INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing[STAT.ML]Statistics [stat]/Machine Learning [stat.ML][INFO.INFO-FL]Computer Science [cs]/Formal Languages and Automata Theory [cs.FL]Media Technology0501 psychology and cognitive sciencesSet (psychology)Musical formCognitive scienceStructure (mathematical logic)[INFO.INFO-PL]Computer Science [cs]/Programming Languages [cs.PL][SHS.MUSIQ]Humanities and Social Sciences/Musicology and performing arts05 social sciences06 humanities and the artsData structureComputer Science ApplicationsExpression (architecture)Music theory[INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]NA0604 artsMusicMusical analysis

researchProduct

Depression Assessment by Fusing High and Low Level Features from Audio, Video, and Text

2016

International audience; Depression is a major cause of disability world-wide. The present paper reports on the results of our participation to the depression sub-challenge of the sixth Audio/Visual Emotion Challenge (AVEC 2016), which was designed to compare feature modalities ( audio, visual, interview transcript-based) in gender-based and gender-independent modes using a variety of classification algorithms. In our approach, both high and low level features were assessed in each modality. Audio features were extracted from the low-level descriptors provided by the challenge organizers. Several visual features were extracted and assessed including dynamic characteristics of facial elements…

Computer scienceSpeech recognitionPosterior probabilitymultimodal fusionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONImage processing02 engineering and technology[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI][SPI]Engineering Sciences [physics]AVEC 2016Histogram0202 electrical engineering electronic engineering information engineeringFeature (machine learning)[ SPI ] Engineering Sciences [physics]Affective computingaffective computing[ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI]speech processing[SPI.ACOU]Engineering Sciences [physics]/Acoustics [physics.class-ph]Modality (human–computer interaction)[ SPI.ACOU ] Engineering Sciences [physics]/Acoustics [physics.class-ph]pattern recognition020206 networking & telecommunicationsSpeech processingimage processingStatistical classificationdepression assessment13. Climate actionPattern recognition (psychology)020201 artificial intelligence & image processing

researchProduct