Search results for "Language processing"
showing 10 items of 421 documents
Strength Training: Single Versus Multiple Sets
1999
Using Automatic Morphological Tools to Process Data from a Learner Corpus of Hungarian
2014
The aim of this article is to show how automatic morphological tools originally used to analyze native speaker data can be applied to process data from a learner corpus of Hungarian. We collected written data from 35 students majoring in Hungarian studies at the University of Zagreb, Croatia. The data were analyzed by magyarlanc, a sentence splitter, morphological analyzer, POS-tagger and dependency parser, which found 667 unknown word forms. We investigated the recommendations made by the Hungarian spellchecker hunspell for these unknown words and the correct forms were manually chosen. It was found that if the first suggestion made by hunspell was automatically accepted, an accuracy score…
Machine Learning Models for Measuring Syntax Complexity of English Text
2019
In this paper we propose a methodology to assess the syntax complexity of a sentence representing it as sequence of parts-of-speech and comparing Recurrent Neural Networks and Support Vector Machine. We have carried out experiments in English language which are compared with previous results obtained for the Italian one.
Explaining Causes Behind SQL Query Formulation Errors
2020
This Full Research Paper presents the most prominent query formulation errors in Structured Query Language (SQL), and maps these errors to their cognitive explanations. Understanding query formulation errors is a key to teaching SQL. more effectively. However, studies on what kind of errors novices struggle with are relatively scarce when compared to, for example, programming languages. Although committing errors is a crucial part in learning, some errors are relatively easy to fix, and their commonness is not necessarily an indication of their difficulty. Other errors, however, halt the learning process, and are never fixed by the query writer. Using a previously established error taxonomy…
Noisy Channel in Language-Pair Phenomena Identification
2014
The Fuzzy Concept of Idiom and What It Might Mean for Bilingual Dictionaries
2019
Linguistic categories were developed as tools for describing language systems and making them easier to learn. However, like many theoretical concepts and systems, they do not fully represent the real world and, in some cases, seek to imprison linguistic units within a well-ordered system – a procrustean bed as it were. Besides, although the most general categories are universal, the lower-ranking ones are often language-specific. Idiom (or phraseologism) is a very unclear linguistic concept, subject to never-ending debate. However, a strict adherence to categorisation is observable in practical bilingual lexicography and phraseography. This may lead to unwanted compartmentalisation and a…
Timbral Qualities of Semantic Structures of Music.
2010
The rapid expansion of social media in music has provided the field with impressive datasets that offer insights into the semantic structures underlying everyday uses and classification of music. We hypothesize that the organization of these structures are rather directly linked with the ”qualia” of the music as sound. To explore the ways in which these structures are connected with the qualities of sounds, a semantic space was extracted from a large collection of musical tags with latent semantic and cluster analysis. The perceptual and musical properties of 19 clusters were investigated by a similarity rating task that used spliced musical excerpts representing each cluster. The resulting…
Dynamic assessment of word derivational knowledge: Tracing the development of a learner
2016
The present paper reports on a case study that explored the applicability of dynamic assessment (DA) for promoting learners’ word derivational knowledge in English as a second or a foreign language (L2). One learner’s performance on tasks assessing his word derivational knowledge was measured four times. The first two measurements were conducted before and after three weekly human-mediated DA sessions and the last two, which took place a year and a half later, before and after three weekly computerised DA sessions. Think aloud protocols and interviews were used to trace changes in the learner’s use of strategies and knowledge sources. The results revealed that following the dynamic assessme…
Cognitive Linguistics as the Underlying Framework for Semantic Annotation
2012
In recent years many attempts have been made to design suitable sets of rules aimed at extracting the semantic meaning from plain text, and to achieve annotation, but very few approaches make extensive use of grammars. Current systems are mainly focused on extracting the semantic role of the entities described in the text. This approach has limitations: in such applications the semantic role is conceived merely as the meaning of the involved entities without considering their context. As an example, current semantic annotators often specify a date entity without any annotation regarding the kind of the date itself i.e. a birth date, a book publication date, and so on. Moreover, these system…
Cognitive Computing supported Medical Decision Support System for Patient’s Driving Assessment
2018
To smartly utilize a huge and constantly growing volume of data, improve productivity and increase competitiveness in various fields of life; human requires decision making support systems that efficiently process and analyze the data, and, as a result, significantly speed up the process. Similarly to all other areas of human life, healthcare domain also is lacking Artificial Intelligence (AI) based solution. A number of supervised and unsupervised Machine Learning and Data Mining techniques exist to help us to deal with structured data. However, in a real life, we pretty much deal with unstructured data that hides useful knowledge and valuable information inside human-readable plain texts,…