Search results for "Language processing"

showing 10 items of 421 documents

A Semantic Layer on Semi-structured Data Sources for Intuitive Chatbots

2009

The main limits of chatbot technology are related to the building of their knowledge representation and to their rigid information retrieval and dialogue capabilities, usually based on simple "pattern matching rules". The analysis of distributional properties of words in a texts corpus allows the creation of semantic spaces where represent and compare natural language elements. This space can be interpreted as a "conceptual" space where the axes represent the latent primitive concepts of the analyzed corpus. The presented work aims at exploiting the properties of a data-driven semantic/conceptual space built using semi-structured data sources freely available on the web, like Wikipedia. Thi…

Information retrievalKnowledge representation and reasoningbusiness.industryComputer scienceComputer Science::Information Retrievalcomputer.software_genreChatbotsemantic spaces chatbotSemantic similarityExplicit semantic analysisEncyclopediaSemi-structured dataPattern matchingArtificial intelligencebusinesscomputerNatural language processingNatural language
researchProduct

A Comparison of Language Identification Approaches on Short, Query-Style Texts

2010

In a multi-language Information Retrieval setting, the knowledge about the language of a user query is important for further processing. Hence, we compare the performance of some typical approaches for language detection on very short, query-style texts. The results show that already for single words an accuracy of more than 80% can be achieved, for slightly longer texts we even observed accuracy values close to 100%.

Information retrievalLanguage identificationComputer sciencebusiness.industryArtificial intelligencecomputer.software_genrebusinesscomputerNatural language processingStyle (sociolinguistics)
researchProduct

Enriching Didactic Similarity Measures of Concept Maps by a Deep Learning Based Approach

2021

Concept maps are significant tools able to support several tasks in the educational area such as curriculum design, knowledge organization and modeling, students' assessment and many others. They are also successfully used in learning activities in which students have to represent domain knowledge according to teacher's assignment. In this context, the development of Learning Analytics approaches would benefit of methods that automatically compare concept maps. Detecting concept maps similarities is relevant to identify how the same concepts are used in different knowledge representations. Algorithms for comparing graphs have been extensively studied in the literature, but they do not appea…

Information retrievalLearning AnalyticKnowledge representation and reasoningComputer scienceConcept mapKnowledge organizationLearning analyticsContext (language use)SemanticsLearning AnalyticsConcept MapConcept MapsDeep LearningInfersentSimilarity (psychology)Semantic Similarity MeasuresDomain knowledgeNatural Language Processing
researchProduct

FrameNet CNL: A Knowledge Representation and Information Extraction Language

2014

The paper presents a FrameNet-based information extraction and knowledge representation framework, called FrameNet-CNL. The framework is used on natural language documents and represents the extracted knowledge in a tailor-made Frame-ontology from which unambiguous FrameNet-CNL paraphrase text can be generated automatically in multiple languages. This approach brings together the fields of information extraction and CNL, because a source text can be considered belonging to FrameNet-CNL, if information extraction parser produces the correct knowledge representation as a result. We describe a state-of-the-art information extraction parser used by a national news agency and speculate that Fram…

Information retrievalParsingKnowledge representation and reasoningbusiness.industryComputer scienceAgency (philosophy)computer.software_genreParaphraseInformation extractionArtificial intelligenceSource textFrameNetbusinesscomputerNatural language processingNatural language
researchProduct

Part-of-speech labeling for Reuters database

2015

Even if the Vector Space Model used for document representation in information retrieval systems integrates a small quantity of knowledge it continues to be used due to its computational cost, speed execution and simplicity. We try to improve this document representation by adding some syntactic information such as the parts of speech. In this paper, we have evaluated three different tagging algorithms in order to select the most suitable tagger for using it to tag the Reuters dataset. In this work, we have evaluated the taggers using only five different parts of speech: noun, verb, adverb, adjective and others. We considered these particular tags being the most representative for describin…

Information retrievalbusiness.industryComputer scienceInformationSystems_INFORMATIONSTORAGEANDRETRIEVALVerbAdverbSpace (commercial competition)Part of speechcomputer.software_genreSequence labelingNounVector space modelArtificial intelligencebusinesscomputerAdjectiveNatural language processing2015 19th International Conference on System Theory, Control and Computing (ICSTCC)
researchProduct

A KST-BASED SYSTEM FOR STUDENT TUTORING

2008

Abstract: A novel assessment procedure based on knowledge space theory (KST) is presented along with a complete implementation of an intelligent tutoring system. (ITS) that has been used to test our theoretical findings. The key idea is that correct assessment of the student knowledge is strictly related to the structure of the domain ontology. Suitable relationships between the concepts must be present to allow the creation of a reverse path from the "knowledge state" representing the student goal to the one that contains her actual knowledge about this topic. Knowledge space theory is a very good framework to guide the process of building the ontology used, by the artificial tutor The sys…

Intelligent systemStructure (mathematical logic)CorrectnessOntologyComputer scienceLatent semantic analysisbusiness.industryOntology (information science)computer.software_genreIntelligent tutoring systemDomain (software engineering)Artificial IntelligenceArtificial intelligenceDialog systemStudentsbusinesscomputerNatural language processing systemKST Student TutoringNatural languageNatural language processingApplied Artificial Intelligence
researchProduct

BUCC Shared Task: Cross-Language Document Similarity

2015

We summarise the organisation and results of the first shared task aimed at detecting the most similar texts in a large multilingual collection. The dataset of the shared was based on Wikipedia dumps with interlanguage links with further filtering to ensure comparability of the paired articles. The eleven system runs we received have been evaluated using the TREC evaluation metrics. 1 Task description Parallel corpora of original texts with their translations provide the basis for multilingual NLP applications since the beginning of the 1990s. Relative scarcity of such resources led to greater attention to comparable (=less parallel) resources to mine information about possible translations…

InterlanguageDocument similarityInformation retrievalComputer sciencebusiness.industryInformationSystems_INFORMATIONSTORAGEANDRETRIEVALArtificial intelligencecomputer.software_genrebusinesscomputerNatural language processingTask (project management)Proceedings of the Eighth Workshop on Building and Using Comparable Corpora
researchProduct

Rigotrio At Semeval-2017 Task 9: Combining Machine Learning And Grammar Engineering For Amr Parsing And Generation

2017

By addressing both text-to-AMR parsing and AMR-to-text generation, SemEval-2017 Task 9 established AMR as a powerful semantic interlingua. We strengthen the interlingual aspect of AMR by applying the multilingual Grammatical Framework (GF) for AMR-to-text generation. Our current rule-based GF approach completely covered only 12.3% of the test AMRs, therefore we combined it with state-of-the-art JAMR Generator to see if the combination increases or decreases the overall performance. The combined system achieved the automatic BLEU score of 18.82 and the human Trueskill score of 107.2, to be compared to the plain JAMR Generator results. As for AMR parsing, we added NER extensions to our SemEva…

InterlinguaGenerator (computer programming)Parsingbusiness.industryComputer scienceSpeech recognitionGrammatical Framework02 engineering and technologycomputer.software_genreComputer scienceSemEvallanguage.human_languageTask (project management)020204 information systems0202 electrical engineering electronic engineering information engineeringlanguage020201 artificial intelligence & image processingGrammar engineeringArtificial intelligencebusinesscomputerNatural language processingBLEU
researchProduct

Rough Pragmatic Description Logic

2013

In this chapter, a rough description logic is built on the basis of a pragmatic standpoint of representation of knowledge. The pragmatic standpoint has influenced the acceptance of a broader definition of the semantic network than that appearing in the literature. The definition of the semantic network is a motivation of the introduced semantics of the language of the descriptive logic. First, the theoretical framework of representation of knowledge that was proposed in the papers [24,25] is adjusted to the description of data processing. The pragmatic system of knowledge representation is determined, as well as situations of semantic adequacy and semantic inadequacy for represented knowled…

Interpretation (logic)Knowledge representation and reasoningbusiness.industrycomputer.software_genreSemanticsSemantic networkDescription logicFormal languageInformation systemRough setArtificial intelligencebusinesscomputerNatural language processingMathematics
researchProduct

ETAT: Expository Text Analysis Tool.

2002

Qualitative methods that analyze the coherence of expository texts not only are time consuming, but also present challenges in collecting data on coding reliability. We describe software that analyzes expository texts more rapidly and produces a notable level of objectivity. ETAT (Expository Text Analysis Tool) analyzes the coherence of expository texts. ETAT adopts a symbolic representational system, known as conceptual graph structures. ETAT follows three steps: segmentation of a text into nodes, classification of the unidentified nodes, and linking the nodes with relational arcs. ETAT automatically constructs a graph in the form of nodes and their interrelationships, along with various a…

JavaComputer scienceWritingExperimental and Cognitive Psychologycomputer.software_genreText comprehensionSoftwareMicrocomputersArtificial IntelligenceQuestion answeringSegmentationObjectivity (science)General Psychologycomputer.programming_languageInformation ServicesObserver Variationbusiness.industryReading comprehensionProgramming LanguagesPsychology (miscellaneous)Artificial intelligenceWord ProcessingbusinesscomputerNatural language processingSoftwareCoding (social sciences)Behavior research methods, instruments,computers : a journal of the Psychonomic Society, Inc
researchProduct