Search results for "Language processing"
showing 10 items of 421 documents
Deriving Enhanced Universal Dependencies from a Hybrid Dependency-Constituency Treebank
2018
The treebanks provided by the Universal Dependencies (UD) initiative are a state-of-the-art resource for cross-lingual and monolingual syntax-based linguistic studies, as well as for multilingual dependency parsing. Creating a UD treebank for a language helps further the UD initiative by providing an important dataset for research and natural language processing in that language. In this paper, we describe how we created a UD treebank for Latvian, and how we obtained both the basic and enhanced UD representations from the data in Latvian Treebank which is annotated according to a hybrid dependency-constituency grammar model. The hybrid model was inspired by Lucien Tesniere’s dependency gram…
Different Languages - Different Sentence Types? On Exclamative Sentences
2016
It is not equally easy for all languages to establish an exclamative sentence type. It seems the easiest for those languages that feature a morphological marking for an exclamative sentence type like Turkish or Vietnamese. English on the other hand is a language that does not mark exclamative clauses with an easily identifiable marker but uses certain preferred constructions, which allow us to separate a class of ‘exclamative sentences’ from other sentence types. However, there is another class of languages for which it is even harder to determine if ‘exclamative sentences’ exist as a sentence type. In those languages, these sentences share a striking amount of formal properties with senten…
The Effects of Multiple‐Exposure Textual Enhancement on Child L2 Learners’ Development in Derivational Morphology: A Multi‐Site Study
2021
Combining Machine Translated Sentence Chunks from Multiple MT Systems
2018
This paper presents a hybrid machine translation (HMT) system that pursues syntactic analysis to acquire phrases of source sentences, translates the phrases using multiple online machine translation (MT) system application program interfaces (APIs) and generates output by combining translated chunks to obtain the best possible translation. The aim of this study is to improve translation quality of English – Latvian texts over each of the individual MT APIs. The selection of the best translation hypothesis is done by calculating the perplexity for each hypothesis using an n-gram language model. The result is a phrase-based multi-system machine translation system that allows to improve MT out…
Semi-automated annotation of page-based documents within the Genre and Multimodality framework
2016
This paper describes ongoing work on a tool developed for annotating document images for their multimodal features and compiling this information into a corpus. The tool leverages open source computer vision and natural language processing libraries to describe the content and structure of multimodal documents and to generate multiple layers of XML annotation. The paper introduces the annotation schema, describes the document processing pipeline and concludes with a brief description of future work.
Eye Movement Analyses for Obtaining Readability Formula for Latvian Texts for Primary School
2017
To determine the difficulty of text, readability formulas can be used. The research was made to find readability formula for Latvian. Readability formulas for English were used as guidelines. The novelty was the use of eye movement tracking during reading to obtain quantitative data that lead to readability formula. Eye fixation durations were gathered during readability measurements. Average values of fixation durations were calculated to characterize texts and readers. 15 texts with various difficulty levels were composed for exposing them to readers. More than 300 children of grades 1 - 4 were participating in measurements. Average values of eye fixation durations of readers from a certa…
Introduction to Gestural Similarity in Music. An Application of Category Theory to the Orchestra
2019
Mathematics, and more generally computational sciences, intervene in several aspects of music. Mathematics describes the acoustics of the sounds giving formal tools to physics, and the matter of music itself in terms of compositional structures and strategies. Mathematics can also be applied to the entire making of music, from the score to the performance, connecting compositional structures to acoustical reality of sounds. Moreover, the precise concept of gesture has a decisive role in understanding musical performance. In this paper, we apply some concepts of category theory to compare gestures of orchestral musicians, and to investigate the relationship between orchestra and conductor, a…
Semantic structures of timbre emerging from social and acoustic descriptions of music
2011
The perceptual attributes of timbre have inspired a considerable amount of multidisciplinary research, but because of the complexity of the phenomena, the approach has traditionally been confined to laboratory conditions, much to the detriment of its ecological validity. In this study, we present a purely bottom-up approach for mapping the concepts that emerge from sound qualities. A social media ( http://www.last.fm ) is used to obtain a wide sample of verbal descriptions of music (in the form of tags) that go beyond the commonly studied concept of genre, and from this the underlying semantic structure of this sample is extracted. The structure that is thereby obtained is then evaluated th…
Legibility of Text and Pictograms in Variable Message Signs: Can Single-Word Messages Outperform Pictograms?
2018
The current research shows the advantage of single-word messages in the particular case of variable message signs (VMSs) with a high aspect ratio.Early studies on traffic sign design proposed that pictorial information would advantage equivalent text messages in static signs.We used a driving simulator to present individually 36 VMSs, showing six words (e.g., "congestion") and six danger signs (e.g., congestion traffic sign). In Experiment 1, 18 drivers read aloud the text or orally identified the pictograms as soon as they could correctly do it. In Experiment 2, a different sample of 18 drivers gave a motor response, according to the meaning of the message. We analyzed the legibility dista…
A new set of 299 pictures for psycholinguistic studies : French norms for name agreement, image agreement, conceptual familiarity, visual complexity,…
2003
Pictures are often used as stimuli in studies of perception, language, and memory. Since performances on different sets of pictures are generally contrasted, stimulus selection requires the use of standardized material to match pictures across different variables. Unfortunately, the number of standardized pictures available for empirical research is rather limited. The aim of the present study is to provide French normative data for a new set of 299 black-and-white drawings. Alario and Ferrand (1999) were closely followed in that the pictures were standardized on six variables: name agreement, image agreement, conceptual familiarity, visual complexity, image variability, and age of acquisit…