Search results for "Software"

showing 10 items of 7396 documents

A Methodology for Bilingual Lexicon Extraction from Comparable Corpora

2015

Dictionary extraction using parallel corpora is well established. However, for many language pairs parallel corpora are a scarce resource which is why in the current work we discuss methods for dictionary extraction from comparable corpora. Hereby the aim is to push the boundaries of current approaches, which typically utilize correlations between co-occurrence patterns across languages, in several ways: 1) Eliminating the need for initial lexicons by using a bootstrapping approach which only requires a few seed translations. 2) Implementing a new approach which first establishes alignments between comparable documents across languages, and then computes cross-lingual alignments between wor…

Text corpusInterlinguaComputer sciencebusiness.industrymedia_common.quotation_subjectBootstrapping (linguistics)computer.software_genrelanguage.human_languageParallel corporaBilingual lexiconResource (project management)languageQuality (business)Artificial intelligencebusinesscomputerWord (computer architecture)Natural language processingmedia_commonProceedings of the Fourth Workshop on Hybrid Approaches to Translation (HyTra)
researchProduct

Reflection Assignment as a Tool to Support Students’ Metacognitive Awareness in the Context of Computer-Supported Collaborative Learning

2021

The present study explores the potential of a reflection assignment as a tool for supporting master’s degree students’ metacognitive skills in the context of computer-supported collaborative learning (CSCL). The research question (RQ) is formulated as follows: How does a regularly submitted reflection assignment support the development of students’ individual metacognitive awareness in the context of CSCL? The empirical data is a text corpus (7878 words) extracted from individual students’ (N = 13) reflection assignments (N = 65) submitted during one semester. Qualitative content analysis was employed to analyze the data. The results demonstrate that by the end of the course, the students s…

Text corpusReflection (computer programming)05 social sciences050301 educationMetacognition050109 social psychologyCollaborative learningContext (language use)computer.software_genreScripting languageComputer-supported collaborative learningComputingMilieux_COMPUTERSANDEDUCATIONMathematics education0501 psychology and cognitive sciencesPsychology0503 educationcomputerResearch question
researchProduct

Supporting Emotion Automatic Detection and Analysis over Real-Life Text Corpora via Deep Learning: Model, Methodology, and Framework

2021

This paper describes an approach for supporting automatic satire detection through effective deep learning (DL) architecture that has been shown to be useful for addressing sarcasm/irony detection problems. We both trained and tested the system exploiting articles derived from two important satiric blogs, Lercio and IlFattoQuotidiano, and significant Italian newspapers.

Text corpusSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniSettore INF/01 - InformaticaComputer sciencebusiness.industryDeep learningcomputer.software_genreNLPDeep LearningArtificial intelligenceSatire DetectionbusinesscomputerNatural language processing
researchProduct

The computation of word associations

2002

It is shown that basic language processes such as the production of free word associations and the generation of synonyms can be simulated using statistical models that analyze the distribution of words in large text corpora. According to the law of association by contiguity, the acquisition of word associations can be explained by Hebbian learning. The free word associations as produced by subjects on presentation of single stimulus words can thus be predicted by applying first-order statistics to the frequencies of word co-occurrences as observed in texts. The generation of synonyms can also be conducted on co-occurrence data but requires second-order statistics. The reason is that synony…

Text corpusSyntagmatic analysisbusiness.industryComputer scienceSynonymSpeech recognitionStatistical modelcomputer.software_genreProduction (computer science)Artificial intelligencebusinessAssociation (psychology)computerNatural language processingWord (computer architecture)Proceedings of the 19th international conference on Computational linguistics -
researchProduct

Revisiting corpus creation and analysis tools for translation tasks

2016

Many translation scholars have proposed the use of corpora to allow professional translators to produce high quality texts which read like originals. Yet, the diffusion of this methodology has been modest, one reason being the fact that software for corpora analyses have been developed with the linguist in mind, which means that they are generally complex and cumbersome, offering many advanced features, but lacking the level of usability and the specific features that meet translators’ needs. To overcome this shortcoming, we have developed TranslatorBank, a free corpus creation and analysis tool designed for translation tasks. TranslatorBank supports the creation of specialized monolingual …

Text corpusTranslationProfessionalizationTraducciónLinguistics and LanguageLiterature and Literary TheoryComputer sciencetranslationCorpus toolsMonolingual corpuscomputer.software_genreProfesionalizaciónLanguage and LinguisticsTerminologyDomain (software engineering)Example-based machine translationCorpus linguisticsmonolingual corpusprofessionalizationcorpus toolsConcordancerCorpus monolingüeTerminology extractionbusiness.industrylcsh:Translating and interpretingUsabilitylcsh:P306-310Herramientas de corpusArtificial intelligencebusinesscomputerNatural language processingCadernos de Tradução
researchProduct

Discovering the Senses of an Ambiguous Word by Clustering its Local Contexts

2005

As has been shown recently, it is possible to automatically discover the senses of an ambiguous word by statistically analyzing its contextual behavior in a large text corpus. However, this kind of research is still at an early stage. The results need to be improved and there is considerable disagreement on methodological issues. For example, although most researchers use clustering approaches for word sense induction, it is not clear what statistical features the clustering should be based on. Whereas so far most researchers cluster global co-occurrence vectors that reflect the overall behavior of a word in a corpus, in this paper we argue that it is more appropriate to use local context v…

Text corpusbusiness.industryComputer scienceContext (language use)computer.software_genreWord senseWord-sense inductionArtificial intelligencebusinessCluster analysiscomputerNatural language processingWord (computer architecture)Strengths and weaknesses
researchProduct

Weights Space Exploration Using Genetic Algorithms for Meta-classifier in Text Document Classification

2012

Text document classificationGeneral Computer ScienceComputer sciencebusiness.industryArtificial intelligenceElectrical and Electronic EngineeringbusinessMachine learningcomputer.software_genreClassifier (UML)computerSpace explorationStudies in Informatics and Control
researchProduct

Aspects Concerning SVM Method’s Scalability

2008

In the last years the quantity of text documents is increasing continually and automatic document classification is an important challenge. In the text document classification the training step is essential in obtaining a good classifier. The quality of learning depends on the dimension of the training data. When working with huge learning data sets, problems regarding the training time that increases exponentially are occurring. In this paper we are presenting a method that allows working with huge data sets into the training step without increasing exponentially the training time and without significantly decreasing the classification accuracy.

Text document classificationStructured support vector machinebusiness.industryComputer scienceDocument classificationcomputer.software_genreSupport vector machineText miningScalabilityData miningbusinessCluster analysiscomputerClassifier (UML)
researchProduct

A Controllable Text Simplification System for the Italian Language

2021

Text simplification is a non-trivial task that aims at reducing the linguistic complexity of written texts. Researchers have studied the problem by proposing new methodologies for addressing the English language, but other languages, like the Italian one, are almost unexplored. In this paper, we give a contribution to the enhancement of the Automated Text Simplification research by presenting a deep learning-based system, inspired by a state of the art system for the English language, capable of simplifying Italian texts. The system has been trained and tested by leveraging the Italian version of Newsela; it has shown promising results by achieving a SARI value of 30.17.

Text simplificationComputer scienceText simplification02 engineering and technologyEnglish languagecomputer.software_genreTask (project management)03 medical and health sciences0302 clinical medicineLinguistic sequence complexityDeep Learning0202 electrical engineering electronic engineering information engineeringValue (semiotics)Natural Language ProcessingSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniDeep Neural NetworksSettore INF/01 - Informaticabusiness.industryDeep learningItalian language030221 ophthalmology & optometryComputingMethodologies_DOCUMENTANDTEXTPROCESSING020201 artificial intelligence & image processingArtificial intelligenceState (computer science)businesscomputerNatural language processing
researchProduct

FISICA QUANTISTICA E FUNZIONI MENTALI SUPERIORI III

2010

Questo saggio riprende alcuni concetti di anatomia comparata e di fisiologia, sviluppati in una mia precedente ricerca dal titolo Termodinamica, campi quantici e funzioni mentali. Il fine è di chiarire alcuni punti controversi posti dall’analisi scientifica, in riguardo a teoremi di neuro anatomia e di neurofisiologia. Il filosofo Heiddegger H. riteneva che il pensiero umano non può affidarsi interamente all’indagine scientifica, traducendo in schemi ed in formule la vivente realtà della natura, sia fisica che biologica. La ricerca scientifica si basa su un sapere rigoroso e preciso, altamente dimostrativo che può competere con la matematica, ritenuta la scienza delle scienze. Nonostante ci…

The Mind Human Brain Software Geometry.Settore BIO/06 - Anatomia Comparata E Citologia
researchProduct