Search results for "Natural language"
showing 10 items of 650 documents
Comparing the Quality of Neural Machine Translation and Professional Post-Editing
2019
This empirical corpus study explores the quality of neural machine translations (NMT) and their post-edits (NMTPE) at the German Department of the European Commission’s Directorate-General for Translation (DGT) by evaluating NMT outputs, NMTPE, and respective revisions (REV) with the automatic error annotation tool Hjerson (Popovic 2011) and the more fine-grained manual MQM framework (Lommel 2014). Results show that quality assurance measures by post-editors and revisors at the DGT are most often necessary for lexical errors. More specifically, if post-editors correct mistranslations, terminology or stylistic errors in an NMT sentence, revisors are likely to correct the same type of error i…
A Lexicon-based Approach for Sentiment Classification of Amazon Books Reviews in Italian Language
2016
We present a system aimed at the automatic classification of the sentiment orientation expressed into book reviews written in Italian language. The system we have developed is found on a lexicon-based approach and uses NLP techniques in order to take into account the linguistic relation between terms in the analyzed texts. The classification of a review is based on the average sentiment strenght of its sentences, while the classification of each sentence is obtained through a parsing process inspecting, for each term, a window of previous items to detect particular combinations of elements giving inversions or variations of polarity. The score of a single word depends on all the associated …
Models of the Translation Process
2017
Visual Methods in Researching Language Practices and Language Learning: Looking at, Seeing, and Designing Language
2016
The changing ways of using language and various understandings of what language is have consequences for the way we research language practices and language learning. When engaging in social contact, people use diverse and complex forms, modes, and varieties of language to communicate, and moreover, these resources often include icons, images, and other semiotic ways of meaning making. Visuality thus has a natural position in people’s language practices. In this chapter, we discuss how visual methods have been adopted and used as a methodological tool in researching language practices and language learning. With this focus, attention is geared to the materiality of language, on the one hand…
Erazm Rykaczewski’s A Complete Dictionary English and Polish… (1849): Uncovering the Compilation Process
2015
This paper looks at Erazm Rykaczewski’s A Complete Dictionary English and Polish... (1849), one of the milestones in the history of English-Polish / Polish-English lexicography. Despite its significance for the bilingual user in Poland and English-speaking countries with large Polish diasporas, where it came to be reprinted over the next century, it has attracted little scholarly attention so far. Based on a comparative analysis of the bilingual dictionary and its assumed sources, the paper sheds some light on the methodology of compilation in which borrowing, adaptation, and translation turn out to have been the lexicographer’s main working practice. The findings are presented in a framewo…
Figurative language and multicultural education: metaphors of language acquisition and retention
2015
Linguistics has long recognised that figurative language in the form of metaphorical expressions structures and communicates attitudes towards the ideas and concepts being expressed and that multilingual students also employ linguistic figures frequently in their writing. In this study, multilingual students use figurative language to both critique and describe experiences related to language acquisition and retention. Faced with the task of using three or more languages, the L3 English language students studied often turn to metaphor to describe the relationships between their languages and the different contexts in which they use the linguistic resources available to them. The following a…
Deriving Enhanced Universal Dependencies from a Hybrid Dependency-Constituency Treebank
2018
The treebanks provided by the Universal Dependencies (UD) initiative are a state-of-the-art resource for cross-lingual and monolingual syntax-based linguistic studies, as well as for multilingual dependency parsing. Creating a UD treebank for a language helps further the UD initiative by providing an important dataset for research and natural language processing in that language. In this paper, we describe how we created a UD treebank for Latvian, and how we obtained both the basic and enhanced UD representations from the data in Latvian Treebank which is annotated according to a hybrid dependency-constituency grammar model. The hybrid model was inspired by Lucien Tesniere’s dependency gram…
Different Languages - Different Sentence Types? On Exclamative Sentences
2016
It is not equally easy for all languages to establish an exclamative sentence type. It seems the easiest for those languages that feature a morphological marking for an exclamative sentence type like Turkish or Vietnamese. English on the other hand is a language that does not mark exclamative clauses with an easily identifiable marker but uses certain preferred constructions, which allow us to separate a class of ‘exclamative sentences’ from other sentence types. However, there is another class of languages for which it is even harder to determine if ‘exclamative sentences’ exist as a sentence type. In those languages, these sentences share a striking amount of formal properties with senten…
The Effects of Multiple‐Exposure Textual Enhancement on Child L2 Learners’ Development in Derivational Morphology: A Multi‐Site Study
2021
Combining Machine Translated Sentence Chunks from Multiple MT Systems
2018
This paper presents a hybrid machine translation (HMT) system that pursues syntactic analysis to acquire phrases of source sentences, translates the phrases using multiple online machine translation (MT) system application program interfaces (APIs) and generates output by combining translated chunks to obtain the best possible translation. The aim of this study is to improve translation quality of English – Latvian texts over each of the individual MT APIs. The selection of the best translation hypothesis is done by calculating the perplexity for each hypothesis using an n-gram language model. The result is a phrase-based multi-system machine translation system that allows to improve MT out…