Search results for "RETRIEVAL"

showing 10 items of 1176 documents

A Novel Approach to Improve the Accuracy of Web Retrieval

2010

General purpose search engines utilize a very simple view on text documents: They consider them as bags of words. It results that after indexing, the semantics of documents is lost. In this paper, we introduce a novel approach to improve the accuracy of Web retrieval. We utilize the WordNet and WordNet SenseRelate All Words Software as main tools to preserve the semantics of the sentences of documents and user queries. Nouns and verbs in the WordNet are organized in the tree hierarchies. The word meanings are presented by numbers that reference to the nodes on the semantic tree. The meaning of each word in the sentence is calculated when the sentence is analyzed. The goal is to put each nou…

Information retrievalConcept searchComputer sciencebusiness.industryInformationSystems_INFORMATIONSTORAGEANDRETRIEVALSearch engine indexingWord processingWordNetcomputer.software_genreSemanticsComputingMethodologies_ARTIFICIALINTELLIGENCETree (data structure)NounComputingMethodologies_DOCUMENTANDTEXTPROCESSINGArtificial intelligencebusinesscomputerNatural language processingSentence2010 5th International Conference on Future Information Technology
researchProduct

Extracting Semantic Knowledge from Unstructured Text Using Embedded Controlled Language

2016

Nowadays, most of the data on the Web is still in the form of unstructured text. Knowledge extraction from unstructured text is highly desirable but extremely challenging due to the inherent ambiguity of natural language. In this article, we present an architecture of an information extraction system based on the concept of Embedded Controlled Language that allows for extracting formal semantic knowledge from an unstructured text corpus. Moreover, the presented approach has a potential to support multilingual input and output.

Information retrievalConcept searchNoisy text analyticsbusiness.industryComputer scienceText simplification010401 analytical chemistryText graph02 engineering and technologycomputer.software_genre01 natural scienceslanguage.human_language0104 chemical sciencesInformation extractionControlled natural languageKnowledge extractionExplicit semantic analysis0202 electrical engineering electronic engineering information engineeringlanguage020201 artificial intelligence & image processingArtificial intelligencebusinesscomputerNatural language processing2016 IEEE Tenth International Conference on Semantic Computing (ICSC)
researchProduct

Semantic retrieval: an approach to representing, searching and summarising text documents

2011

Nowadays, the internet is the major source of information for millions of people. There are many search tools available on the net but finding appropriate text information is still difficult. The retrieval efficiency of the presently used systems cannot be significantly improved: ‘bag of words’ interpretation causes losing semantics of texts. We applied the functional approach to represent English text documents. It allows taking into account semantic relations between words when indexing documents and use ordinary English sentences as queries to a search engine. The proposed retrieval mechanisms return only highly relevant documents. They make it possible to generate content-aware summarie…

Information retrievalConcept searchbusiness.industryComputer scienceSearch engine indexingSemantic searchFunctional approachWord searchSemanticscomputer.software_genreBag-of-words modelVisual WordArtificial intelligencebusinesscomputerNatural language processingInternational Journal of Information Technology, Communications and Convergence
researchProduct

Automatic building of a visual interface for content-based multiresolution retrieval of paleontology images

2001

In this article we present research work in the field of content-based image retrieval in large databases applied to the paleontology image database of the Universite´ de Bourgogne, Dijon, France, called ‘‘TRANS’TYFIPAL.’’ Our indexing method is based on multiresolution decomposition of database images using wavelets. For each family of paleontology images we try to find a model image that represents it. The K-means automatic classification algorithm divides the space of parameters into several clusters. A model image for each cluster is computed from the wavelet transform of each image of the cluster. Then a search tree is built to offer users a graphic interface for retrieving images. So …

Information retrievalContextual image classificationComputer sciencebusiness.industrySearch engine indexingComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION020206 networking & telecommunicationsImage processing02 engineering and technologyContent-based image retrievalAtomic and Molecular Physics and OpticsSearch treeComputer Science ApplicationsPaleontologyAutomatic image annotation[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingComputer visionVisual WordArtificial intelligenceElectrical and Electronic EngineeringbusinessImage retrievalComputingMilieux_MISCELLANEOUS
researchProduct

Heuristic Method to Improve Systematic Collection of Terminology

2016

In this paper, we propose an experimental tool for analysis and graphical representation of glossaries. The original heuristic algorithms and analysis methods incorporated into the tool appeared to be useful to improve the quality of the glossaries. The tool was used for analysis of ISTQB Standard Glossary of Terms Used in Software Testing. There are instances of problems found in ISTQB glossary related to its consistency, completeness, and correctness described in the paper.

Information retrievalCorrectnessGlossaryComputer scienceHeuristicConcept mapcomputer.software_genreTerminologyConsistency (database systems)Completeness (order theory)Data miningRepresentation (mathematics)GeneralLiterature_REFERENCE(e.g.dictionariesencyclopediasglossaries)computer
researchProduct

Combining OWL ontologies usingE-Connections

2006

The standardization of the Web Ontology Language (OWL) leaves (at least) two crucial issues for Web-based ontologies unsatisfactorily resolved, namely how to represent and reason with multiple distinct, but linked ontologies, and how to enable effective knowledge reuse and sharing on the Semantic Web. In this paper, we present a solution for these fundamental problems based on E-Connections. We aim to use E-Connections to provide modelers with suitable means for developing Web ontologies in a modular way and to provide an alternative to the owl:imports construct. With such motivation, we present in this paper a syntactic and semantic extension of the Web Ontology language that covers E-Conn…

Information retrievalDatabaseComputer Networks and Communicationsbusiness.industrySemantic Web Rule Languagecomputer.internet_protocolComputer scienceWeb Ontology LanguageOntology (information science)computer.software_genreSocial Semantic WebOWL-SHuman-Computer InteractionUpper ontologySemantic Web StackbusinesscomputerSemantic WebSoftwarecomputer.programming_languageJournal of Web Semantics
researchProduct

Contextual Metadata for Document Databases

2005

Metadata has always been an important means to support accessibility of information in document collections. Metadata can be, for example, bibliographic data manually created for each document at the time of document storage. The indexes created by Web search engines serve as metadata about the content of Web documents. In the semantic Web solutions, ontologies are used to store semantic metadata (Berners-Lee et al., 2001). Attaching a common ontology to a set of heterogeneous document databases may be used to support data integration. Creation of the common ontology requires profound understanding of the concepts used in the databases. It is a demanding task, especially in cases where the …

Information retrievalDatabaseComputer scienceDocument type declarationWell-formed documentcomputer.software_genreMetadata repositoryWorld Wide WebMetadataDocument Schema Definition LanguagesSynonym ringGeospatial metadatacomputerDatabase catalog
researchProduct

From Databases to Ontologies

2009

This chapter introduces the UML profile for OWL as an essential instrument for bridging the gap between the legacy relational databases and OWL ontologies. We address one of the long-standing relational database design problems where initial conceptual model (a semantically clear domain conceptualization ontology) gets “lost” during conversion into the normalized database schema. The problem is that such “loss” makes database inaccessible for direct query by domain experts familiar with the conceptual model only. This problem can be avoided by exporting the database into RDF according to the original conceptual model (OWL ontology) and formulating semantically clear queries in SPARQL over t…

Information retrievalDatabaseComputer scienceModeling languageInformationSystems_DATABASEMANAGEMENTWeb Ontology Languagecomputer.software_genreDatabase designClosed-world assumptionOntology componentsIDEFIDEF5Open-world assumptioncomputercomputer.programming_language
researchProduct

Self-service Ad-hoc Querying Using Controlled Natural Language

2016

The ad-hoc querying process is slow and error prone due to inability of business experts of accessing data directly without involving IT experts. The problem lies in complexity of means used to query data. We propose a new natural language- and semistar ontology-based ad-hoc querying approach which lowers the steep learning curve required to be able to query data. The proposed approach would significantly shorten the time needed to master the ad-hoc querying and to gain the direct access to data by business experts, thus facilitating the decision making process in enterprises, government institutions and other organizations.

Information retrievalDatabaseProcess (engineering)Computer science05 social sciences02 engineering and technologyOntology (information science)computer.software_genrelanguage.human_languageHierarchical database modelData accessControlled natural languageLearning curve020204 information systems0202 electrical engineering electronic engineering information engineeringlanguage0501 psychology and cognitive sciencesDecision-makingcomputer050107 human factorsNatural language
researchProduct

Unifying Access to Heterogeneous Document Databases through Contextual Metadata

2011

Document databases available on the Internet carry massive information resources. To a person needing a piece of information on a specific domain, finding the piece, however, is often quite problematic even though there were a representative collection of databases available on the domain. The languages used in the content, the names of document types, their structures, the ways documents are organized and their retrieval techniques often vary in the databases. The databases containing legal information on the Internet offer a typical example. For finding relevant documents and for being able to interpret the content of the documents correctly, the user may need information about the contex…

Information retrievalDatabasebusiness.industrycomputer.internet_protocolComputer scienceRelational databaseContext (language use)Document management systemcomputer.software_genreDomain (software engineering)MetadataData model (ArcGIS)The InternetbusinesscomputerXML
researchProduct