Search results for "language processing"

showing 10 items of 421 documents

A practical solution to the problem of automatic part-of-speech induction from text

2005

The problem of part-of-speech induction from text involves two aspects: Firstly, a set of word classes is to be derived automatically. Secondly, each word of a vocabulary is to be assigned to one or several of these word classes. In this paper we present a method that solves both problems with good accuracy. Our approach adopts a mixture of statistical methods that have been successfully applied in word sense induction. Its main advantage over previous attempts is that it reduces the syntactic space to only the most important dimensions, thereby almost eliminating the otherwise omnipresent problem of data sparseness.

Vocabularybusiness.industryComputer sciencemedia_common.quotation_subjectSpeech recognitionSpace (commercial competition)Part of speechcomputer.software_genreSyntaxSet (abstract data type)Word-sense inductionArtificial intelligencebusinesscomputerNatural language processingWord (computer architecture)media_commonProceedings of the ACL 2005 on Interactive poster and demonstration sessions - ACL '05

researchProduct

Ontology languages for the semantic web: A never completely updated review

2006

This paper gives a never completely account of approaches that have been used for the research community for representing knowledge. After underlining the importance of a layered approach and the use of standards, it starts with early efforts used for artificial intelligence researchers. Then recent approaches, aimed mainly at the semantic web, are described. Coding examples from the literature are presented in both sections. Finally, the semantic web ontology creation process, as we envision it, is introduced.

Web standardsOntology Inference LayerInformation Systems and ManagementKnowledge representation and reasoningComputer sciencecomputer.internet_protocolProcess ontologyOntology (information science)computer.software_genreSocial Semantic WebOWL-SManagement Information SystemsWorld Wide WebOpen Biomedical OntologiesArtificial IntelligenceSemantic computingSemantic analyticsUpper ontologySemantic Web StackSemantic Webbusiness.industryOntology-based data integrationSuggested Upper Merged OntologyOntology languageOntologyArtificial intelligencebusinessWeb intelligencecomputerOntology alignmentSoftwareNatural language processingKnowledge-Based Systems

researchProduct

Natural Language Processing Agents and Document Clustering in Knowledge Management

2008

While HTML provides the Web with a standard format for information presentation, XML has been made a standard for information structuring on the Web. The mission of the Semantic Web now is to provide meaning to the Web. Apart from building on the existing Web technologies, we need other tools from other areas of science to do that. This chapter shows how natural language processing methods and technologies, together with ontologies and a neural algorithm, can be used to help in the task of adding meaning to the Web, thus making the Web a better platform for knowledge management in general.

Web standardsmedicine.medical_specialtyInformation retrievalKnowledge managementWeb developmentbusiness.industryComputer sciencecomputer.software_genreSocial Semantic WebWorld Wide WebmedicineArtificial intelligenceSemantic Web StackWeb servicebusinessWeb modelingcomputerSemantic WebData WebNatural language processing

researchProduct

Within and between variations of texts elicited from nine wine experts

2006

Nine wine experts tasted in replicate six Chardonnay wines that had been aged in oak barrels from different forests and/or species. They freely gave their descriptions in writing; the only instruction given was to underline three words or expressions that best characterized each tasted wine. The texts were submitted to an objective lexical analysis that quantified the important variation among the experts. In addition a matching task was performed by 117 assessors in which each assessor received from each expert six white cards and six yellow cards representing the descriptions of the six white wines and six red wines. The assessors were incapable of matching the descriptions for the same e…

WineNutrition and Dieteticsbusiness.industryLexical analysisReplicateArtificial intelligencePsychologycomputer.software_genrebusinesscomputerNatural language processingFood ScienceFood Quality and Preference

researchProduct

An Extension of the VSM Documents Representation using Word Embedding

2017

Abstract In this paper, we will present experiments that try to integrate the power of Word Embedding representation in real problems for documents classification. Word Embedding is a new tendency used in the natural language processing domain that tries to represent each word from the document in a vector format. This representation embeds the semantically context in that the word occurs more frequently. We include this new representation in a classical VSM document representation and evaluate it using a learning algorithm based on the Support Vector Machine. This new added information makes the classification to be more difficult because it increases the learning time and the memory neede…

Word embeddingComputer sciencebusiness.industryRepresentation (systemics)Context (language use)Extension (predicate logic)computer.software_genreDomain (software engineering)Support vector machineVector graphicsArtificial intelligencebusinesscomputerWord (computer architecture)Natural language processingBalkan Region Conference on Engineering and Business Education

researchProduct

Interpretability in Word Sense Disambiguation using Tsetlin Machine

2021

Word-sense disambiguationComputer sciencebusiness.industryArtificial intelligencecomputer.software_genrebusinesscomputerNatural language processingInterpretabilityProceedings of the 13th International Conference on Agents and Artificial Intelligence

researchProduct

SisHiTra : A Hybrid Machine Translation System from Spanish to Catalan

2004

In the current European scenario, characterized by the coexistence of communities writing and speaking a great variety of languages, machine translation has become a technology of capital importance. In areas of Spain and of other countries, coofficiality of several languages implies producing several versions of public information. Machine translation between all the languages of the Iberian Peninsula and from them into English will allow for a better integration of Iberian linguistic communities among them and inside Europe. The purpose of this paper is to show a machine translation system from Spanish to Catalan that deals with text input. In our approach, both deductive (linguistic) and…

Word-sense disambiguationMachine translationComputer sciencebusiness.industryAutomatic translationWord error rateHybrid machine translationcomputer.software_genreVariety (linguistics)language.human_languagelanguageCatalanArtificial intelligencebusinesscomputerNatural languageNatural language processing

researchProduct

Experiments in Non-Coherent Post-editing

2017

Market pressure on translation productivity joined with technological innovation is likely to fragment and decontextualise translation jobs even more than is cur-rently the case. Many different translators increasingly work on one document at different places, collaboratively working in the cloud. This paper investigates the effect of decontextualised source texts on behaviour by comparing post-editing of sequentially ordered sentences with shuffled sentences from two different texts. The findings suggest that there is little or no effect of the decontextualised source texts on behaviour.

Work (electrical)Fragment (logic)business.industryComputer scienceNon coherentCloud computingArtificial intelligencebusinesscomputer.software_genrecomputerProductivityNatural language processingProceedings of the Workshop on Human-Informed Translation and Interpreting Technology

researchProduct

Qualifying semantic graphs using model checking

2011

International audience; Semantic interoperability problems have found their solutions using languages and techniques from the Semantic Web. The proliferation of ontologies and meta-information has improved the understanding of information and the relevance of search engine responses. However, the construction of semantic graphs is a source of numerous errors of interpretation or modeling and scalability remains a major problem. The processing of large semantic graphs is a limit to the use of semantics in current information systems. The work presented in this paper is part of a new research at the border of two areas: the semantic web and the model checking. This line of research concerns t…

[ INFO.INFO-MO ] Computer Science [cs]/Modeling and Simulation[INFO.INFO-WB] Computer Science [cs]/WebComputer science[ INFO.INFO-WB ] Computer Science [cs]/Web0102 computer and information sciences02 engineering and technologycomputer.software_genre01 natural sciencesSocial Semantic Webtemporal logicSemantic similaritySemantic computing0202 electrical engineering electronic engineering information engineeringSemantic analyticsSemantic integrationSemantic Web StackInformation retrievalbusiness.industry[INFO.INFO-WB]Computer Science [cs]/WebSemantic search020207 software engineeringSemantic interoperability[INFO.INFO-MO]Computer Science [cs]/Modeling and SimulationModel-checking010201 computation theory & mathematicsSemantic graphTheoryofComputation_LOGICSANDMEANINGSOFPROGRAMS[INFO.INFO-MO] Computer Science [cs]/Modeling and SimulationArtificial intelligencebusinesscomputerNatural language processing2011 International Conference on Innovations in Information Technology

researchProduct

Le grand débat national, une aide pour prendre des décisions locales?

2021

The Great National Debate, decided by Emmanuel Macron at the beginning of 2019 to respond to the Yellow Vests social movement, allowed the collection of citizens’ contributions on the ecological transition via an online platform. In this article, we use the corpus constituted by these contributions to identify areas where participants are asking for the development of bicycle paths and railway facilities. For this purpose, we have created a classification model to identify contributions dealing with the theme of transportation and proposed a method for extracting patterns that reflect the contributors’ proposals. We then represented these patterns on maps, using the contributors’ postal cod…

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]ACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.7: Natural Language Processing/I.2.7.0: DiscourseMotifs[SHS.GEO] Humanities and Social Sciences/GeographyGrand Débat NationalTransport[SHS.GEO]Humanities and Social Sciences/GeographyPatternsACM: I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.7: Natural Language Processing[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]

researchProduct