Search results for "Computer Science - Computation and Language"

showing 10 items of 31 documents

Investigating label suggestions for opinion mining in German Covid-19 social media

2021

This work investigates the use of interactively updated label suggestions to improve upon the efficiency of gathering annotations on the task of opinion mining in German Covid-19 social media data. We develop guidelines to conduct a controlled annotation study with social science students and find that suggestions from a model trained on a small, expert-annotated dataset already lead to a substantial improvement - in terms of inter-annotator agreement(+.14 Fleiss' $\kappa$) and annotation quality - compared to students that do not receive any label suggestions. We further find that label suggestions from interactively trained models do not lead to an improvement over suggestions from a stat…

FOS: Computer and information sciencesComputer Science - Computation and LanguageInformation retrievalCoronavirus disease 2019 (COVID-19)Computer sciencemedia_common.quotation_subjectSentiment analysislanguage.human_languageTask (project management)GermanAnnotationlanguageQuality (business)Social mediaTransfer of learningComputation and Language (cs.CL)media_common

researchProduct

Polysemy in Controlled Natural Language Texts

2015

Computational semantics and logic-based controlled natural languages (CNL) do not address systematically the word sense disambiguation problem of content words, i.e., they tend to interpret only some functional words that are crucial for construction of discourse representation structures. We show that micro-ontologies and multi-word units allow integration of the rich and polysemous multi-domain background knowledge into CNL thus providing interpretation for the content words. The proposed approach is demonstrated by extending the Attempto Controlled English (ACE) with polysemous and procedural constructs resulting in a more natural CNL named PAO covering narrative multi-domain texts.

FOS: Computer and information sciencesComputer Science - Computation and LanguageInterpretation (logic)Computer sciencebusiness.industryRepresentation (arts)Content wordcomputer.software_genrelanguage.human_languageControlled natural languageComputational semanticslanguageAttempto Controlled EnglishArtificial intelligencePolysemybusinessComputation and Language (cs.CL)computerNatural languageNatural language processing

researchProduct

Computational linguistic assessment of textbook and online learning media by means of threshold concepts in business education

2020

Threshold concepts are key terms in domain-based knowledge acquisition. They are regarded as building blocks of the conceptual development of domain knowledge within particular learners. From a linguistic perspective, however, threshold concepts are instances of specialized vocabularies, exhibiting particular linguistic features. Threshold concepts are typically used in specialized texts such as textbooks -- that is, within a formal learning environment. However, they also occur in informal learning environments like newspapers. In this article, a first approach is taken to combine both lines into an overarching research program - that is, to provide a computational linguistic assessment of…

FOS: Computer and information sciencesComputer Science - Computation and LanguageK.3.m68T50 (Primary) 68T30 91F20 (Secondary)I.2.7; J.5; K.3.mI.2.7J.5Computation and Language (cs.CL)

researchProduct

Towards the evaluation of automatic simultaneous speech translation from a communicative perspective

2021

In recent years, automatic speech-to-speech and speech-to-text translation has gained momentum thanks to advances in artificial intelligence, especially in the domains of speech recognition and machine translation. The quality of such applications is commonly tested with automatic metrics, such as BLEU, primarily with the goal of assessing improvements of releases or in the context of evaluation campaigns. However, little is known about how the output of such systems is perceived by end users or how they compare to human performances in similar communicative tasks. In this paper, we present the results of an experiment aimed at evaluating the quality of a real-time speech translation engine…

FOS: Computer and information sciencesComputer Science - Computation and LanguageMachine translationEnd userComputer sciencebusiness.industrymedia_common.quotation_subjectSample (statistics)Context (language use)Intelligibility (communication)computer.software_genreSpeech translationQuality (business)Artificial intelligencebusinessComputation and Language (cs.CL)computerInterpreterNatural language processingmedia_commonProceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021)

researchProduct

Combining a Context Aware Neural Network with a Denoising Autoencoder for Measuring String Similarities

2018

Measuring similarities between strings is central for many established and fast growing research areas including information retrieval, biology, and natural language processing. The traditional approach for string similarity measurements is to define a metric over a word space that quantifies and sums up the differences between characters in two strings. The state-of-the-art in the area has, surprisingly, not evolved much during the last few decades. The majority of the metrics are based on a simple comparison between character and character distributions without consideration for the context of the words. This paper proposes a string metric that encompasses similarities between strings bas…

FOS: Computer and information sciencesComputer Science - Machine LearningArtificial Intelligence (cs.AI)Computer Science - Computation and LanguageComputer Science - Artificial IntelligenceComputation and Language (cs.CL)Information Retrieval (cs.IR)Machine Learning (cs.LG)Computer Science - Information Retrieval

researchProduct

Structured query construction via knowledge graph embedding

2020

In order to facilitate the accesses of general users to knowledge graphs, an increasing effort is being exerted to construct graph-structured queries of given natural language questions. At the core of the construction is to deduce the structure of the target query and determine the vertices/edges which constitute the query. Existing query construction methods rely on question understanding and conventional graph-based algorithms which lead to inefficient and degraded performances facing complex natural language questions over knowledge graphs with large scales. In this paper, we focus on this problem and propose a novel framework standing on recent knowledge graph embedding techniques. Our…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer Science - Computation and LanguageComputer Science - Artificial Intelligenceknowledge graph embeddingnatural language question answeringkyselykieletMachine Learning (cs.LG)luonnollinen kieliArtificial Intelligence (cs.AI)knowledge graphquery constructionComputation and Language (cs.CL)tietomallit

researchProduct

A Relational Tsetlin Machine with Applications to Natural Language Understanding

2021

TMs are a pattern recognition approach that uses finite state machines for learning and propositional logic to represent patterns. In addition to being natively interpretable, they have provided competitive accuracy for various tasks. In this paper, we increase the computing power of TMs by proposing a first-order logic-based framework with Herbrand semantics. The resulting TM is relational and can take advantage of logical structures appearing in natural language, to learn rules that represent how actions and consequences are related in the real world. The outcome is a logic program of Horn clauses, bringing in a structured view of unstructured data. In closed-domain question-answering, th…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer Science - Logic in Computer ScienceComputer Science - Computation and LanguageI.2.4Computer Science - Artificial IntelligenceComputer Networks and CommunicationsI.2.7Machine Learning (cs.LG)Logic in Computer Science (cs.LO)Artificial Intelligence (cs.AI)Artificial IntelligenceHardware and ArchitectureComputation and Language (cs.CL)I.2.7; I.2.4SoftwareInformation Systems

researchProduct

Fast Neural Machine Translation Implementation

2018

This paper describes the submissions to the efficiency track for GPUs at the Workshop for Neural Machine Translation and Generation by members of the University of Edinburgh, Adam Mickiewicz University, Tilde and University of Alicante. We focus on efficient implementation of the recurrent deep-learning model as implemented in Amun, the fast inference engine for neural machine translation. We improve the performance with an efficient mini-batching algorithm, and by fusing the softmax operation with the k-best extraction algorithm. Submissions using Amun were first, second and third fastest in the GPU efficiency track.

FOS: Computer and information sciencesFocus (computing)Computer Science - Computation and LanguageMachine translationComputer sciencebusiness.industrycomputer.software_genreTrack (rail transport)Softmax functionArtificial intelligenceInference enginebusinesscomputerComputation and Language (cs.CL)

researchProduct

Visualization of Jacques Lacan's Registers of the Psychoanalytic Field, and Discovery of Metaphor and of Metonymy. Analytical Case Study of Edgar All…

2016

We start with a description of Lacan's work that we then take into our analytics methodology. In a first investigation, a Lacan-motivated template of the Poe story is fitted to the data. A segmentation of the storyline is used in order to map out the diachrony. Based on this, it will be shown how synchronous aspects, potentially related to Lacanian registers, can be sought. This demonstrates the effectiveness of an approach based on a model template of the storyline narrative. In a second and more comprehensive investigation, we develop an approach for revealing, that is, uncovering, Lacanian register relationships. Objectives of this work include the wide and general application of our met…

FOS: Computer and information sciencesI.2Computer Science - Computation and LanguageStatistics - Machine LearningI.5.3I.5.462H25 62H30G.3Machine Learning (stat.ML)G.2.2Computation and Language (cs.CL)I.5.3; I.5.4; I.2; G.2.2; G.3

researchProduct

Word-level human interpretable scoring mechanism for novel text detection using Tsetlin Machines

2021

Recent research in novelty detection focuses mainly on document-level classification, employing deep neural networks (DNN). However, the black-box nature of DNNs makes it difficult to extract an exact explanation of why a document is considered novel. In addition, dealing with novelty at the word-level is crucial to provide a more fine-grained analysis than what is available at the document level. In this work, we propose a Tsetlin machine (TM)-based architecture for scoring individual words according to their contribution to novelty. Our approach encodes a description of the novel documents using the linguistic patterns captured by TM clauses. We then adopt this description to measure how …

FOS: Computer and information sciencesI.2Computer Science - Machine LearningArtificial Intelligence (cs.AI)Computer Science - Computation and LanguageI.5Artificial IntelligenceComputer Science - Artificial IntelligenceI.2; I.5; I.7Computation and Language (cs.CL)I.7VDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550Machine Learning (cs.LG)

researchProduct