Search results for " retrieval."

showing 10 items of 1102 documents

<title>Combining multiple image descriptions for browsing and retrieval</title>

2000

Retrieving images form large collections using image content is an important problem, in this multimedia age. A quick content-based visual access to the stored image is capital for efficient navigation through image collections. In this paper we introduce several techniques which characterize color homogeneous object and their spatial relationships for efficient content-based image retrieval. We present a region growing technique for efficient color homogeneous objects segmentation and extend the 2D string to an accurate description of spatial information and relationships. In order to improve content-based image retrieval, our method emphasized several objectives, such as: automated extrac…

Information retrievalComputer sciencebusiness.industryComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONContent-based image retrievalAutomatic image annotationImage textureRegion growingHuman–computer information retrievalComputer visionSegmentationVisual WordArtificial intelligencebusinessImage retrievalFeature detection (computer vision)SPIE Proceedings
researchProduct

Content Code Blurring: A New Approach to Content Extraction

2008

Most HTML documents on the world wide web contain far more than the article or text which forms their main content. Navigation menus, functional and design elements or commercial banners are typical examples of additional contents. Content extraction is the process of identifying the main content and/or removing the additional contents. We introduce content code blurring, a novel content extraction algorithm. As the main text content is typically a long, homogeneously formatted region in a web document, the aim is to identify exactly these regions in an iterative process. Comparing its performance with existing content extraction solutions we show thatfor most documents content code blurrin…

Information retrievalComputer sciencebusiness.industryContent (measure theory)Content extractionProcess (computing)Code (cryptography)businessKnowledge acquisitionContent management2008 19th International Conference on Database and Expert Systems Applications
researchProduct

Semantic web service discovery system for road traffic information services

2015

Create a multi-agent platform for a traveller information system (FIPA standards).Extend Paulucci algorithm with the use of seven similarity measures.Weight the similarity measure according to semantic relation and parameter nature.Improved running-time with a filtering pre-process for non-functional parameters.Improved the recall by measuring the sibling relationship concepts. We describe a multi-agent platform for a traveller information system, allowing travellers to find the road traffic information web service (WSs) that best fits their requirements. After studying existing proposals for discovery of semantic WS, we implemented a hybrid matching algorithm, which is described in detail …

Information retrievalComputer sciencebusiness.industryGeneral EngineeringSemantic web servicesSimilarity measurecomputer.software_genreRoad traffic information systemsSocial Semantic WebComputer Science ApplicationsKnowledge discoverySemantic similarityKnowledge extractionArtificial IntelligenceInformation systemInformation retrievalSemantic integrationRelevance (information retrieval)Semantic Web StackData miningWeb servicebusinessMatchmakingcomputerSemantic matching
researchProduct

Semantic Portal for Legislative Information

2006

Semantic portals enabled by Semantic Web technologies have been suggested to provide a point of access to an integrated body of information about some domain. In the area of e-Government there are multiple possible domains for semantic portals, one of them being legislative work. In this paper we propose a semantic portal based on a rich metadata repository to support the retrieval of legislative information. The portal provides process oriented semantic browsing capabilities. A prototype of the portal has been implemented for the retrieval of Finnish legislative information.

Information retrievalComputer sciencebusiness.industryInformationSystems_INFORMATIONSTORAGEANDRETRIEVALComputingMilieux_LEGALASPECTSOFCOMPUTINGSocial Semantic WebWorld Wide WebSemantic gridSemantic computingSemantic analyticsSemantic technologySemantic integrationSemantic Web StackbusinessSemantic Web
researchProduct

Wordnet and semidiscrete decomposition for sub-symbolic representation of words

2009

A methodology for sub-symbolic semantic encoding of words is presented. The methodology uses the standard, semantically highly-structured WordNet lexical database and the SemiDiscrete matrix Decomposition to obtain a vector representation with low memory requirements in a semantic n-space. The application of the proposed algorithm over all the WordNet words would lead to a useful tool for the sub-symbolic processing of texts.

Information retrievalComputer sciencebusiness.industryWordNetDecomposition (computer science)Artificial intelligenceRepresentation (mathematics)computer.software_genrebusinessLexical databasecomputerNatural language processingMatrix decomposition
researchProduct

Publish By Example

2008

We propose an approach for producing database publishing programs by example. The main idea is to interactively build an example document, representative of the program output. The system infers from this document, without ambiguity, the publishing program. The end-user does not need to know a programming language, a query language or the database schema.

Information retrievalComputer sciencebusiness.industrycomputer.internet_protocolRelational databasemedia_common.quotation_subjectDatabase schemaInformationSystems_DATABASEMANAGEMENTAmbiguityQuery languageInformation engineeringNeed to knowbusinessPublicationcomputerXMLmedia_common2008 Eighth International Conference on Web Engineering
researchProduct

Towards semantic-based RSS merging

2009

Merging information can be of key importance in several XML-based applications. For instance, merging the RSS news from different sources and providers can be beneficial for end-users (journalists, economists, etc.) in various scenarios. In this work, we address this issue and mainly explore the relatedness relationships between RSS entities/ elements. To validate our approach, we also provide a set of experimental tests showing satisfactory results. © 2009 Springer-Verlag Berlin Heidelberg

Information retrievalComputer sciencecomputer.internet_protocolRSSINF/01 - INFORMATICAComputerApplications_COMPUTERSINOTHERSYSTEMScomputer.file_formatSet (abstract data type)Semantic similarityArtificial IntelligenceKey (cryptography)Document Object ModelcomputerXML
researchProduct

Combining content extraction heuristics

2008

The main text content of an HTML document on the WWW is typically surrounded by additional contents, such as navigation menus, advertisements, link lists or design elements. Content Extraction (CE) is the task to identify and extract the main content. Ongoing research has spawned several CE heuristics of different quality. However, so far only the Crunch framework combines several heuristics to improve its overall CE performance. Since Crunch, though, many new algorithms have been formulated. The CombinE system is designed to test, evaluate and optimise combinations of CE heuristics. Its aim is to develop CE systems which yield better and more reliable extracts of the main content of a web …

Information retrievalComputer sciencemedia_common.quotation_subjectDesign elements and principlescomputer.software_genreCrunchTask (project management)Content extractionQuality (business)Data miningHeuristicsWeb documentcomputermedia_commonProceedings of the 10th International Conference on Information Integration and Web-based Applications & Services
researchProduct

A Novel Approach to Improve the Accuracy of Web Retrieval

2010

General purpose search engines utilize a very simple view on text documents: They consider them as bags of words. It results that after indexing, the semantics of documents is lost. In this paper, we introduce a novel approach to improve the accuracy of Web retrieval. We utilize the WordNet and WordNet SenseRelate All Words Software as main tools to preserve the semantics of the sentences of documents and user queries. Nouns and verbs in the WordNet are organized in the tree hierarchies. The word meanings are presented by numbers that reference to the nodes on the semantic tree. The meaning of each word in the sentence is calculated when the sentence is analyzed. The goal is to put each nou…

Information retrievalConcept searchComputer sciencebusiness.industryInformationSystems_INFORMATIONSTORAGEANDRETRIEVALSearch engine indexingWord processingWordNetcomputer.software_genreSemanticsComputingMethodologies_ARTIFICIALINTELLIGENCETree (data structure)NounComputingMethodologies_DOCUMENTANDTEXTPROCESSINGArtificial intelligencebusinesscomputerNatural language processingSentence2010 5th International Conference on Future Information Technology
researchProduct

Extracting Semantic Knowledge from Unstructured Text Using Embedded Controlled Language

2016

Nowadays, most of the data on the Web is still in the form of unstructured text. Knowledge extraction from unstructured text is highly desirable but extremely challenging due to the inherent ambiguity of natural language. In this article, we present an architecture of an information extraction system based on the concept of Embedded Controlled Language that allows for extracting formal semantic knowledge from an unstructured text corpus. Moreover, the presented approach has a potential to support multilingual input and output.

Information retrievalConcept searchNoisy text analyticsbusiness.industryComputer scienceText simplification010401 analytical chemistryText graph02 engineering and technologycomputer.software_genre01 natural scienceslanguage.human_language0104 chemical sciencesInformation extractionControlled natural languageKnowledge extractionExplicit semantic analysis0202 electrical engineering electronic engineering information engineeringlanguage020201 artificial intelligence & image processingArtificial intelligencebusinesscomputerNatural language processing2016 IEEE Tenth International Conference on Semantic Computing (ICSC)
researchProduct