Search results for "information retrieval"

showing 10 items of 924 documents

Watermarking for ontologies

2011

In this paper, we study watermarking methods to prove the ownership of an ontology. Different from existing approaches, we propose to watermark not by altering existing statements, but by removing them. Thereby, our approach does not introduce false statements into the ontology. We show how ownership of ontologies can be established with provably tight probability bounds, even if only parts of the ontology are being re-used. We finally demonstrate the viability of our approach on real-world ontologies.

021110 strategic defence & security studiesInformation retrievalOpen worldComputer scienceOntology-based data integrationProcess ontologyData_MISCELLANEOUS0211 other engineering and technologiesWatermark02 engineering and technologyOntology (information science)computer.software_genre020204 information systems0202 electrical engineering electronic engineering information engineeringData miningDigital watermarkingSecurity parametercomputerISWC 2011: Proceedings of the 10th International Semantic Web Conference
researchProduct

Automatic Integration of Spatial Data into the Semantic Web

2017

International audience

021110 strategic defence & security studies[ INFO ] Computer Science [cs]Information retrieval[INFO.INFO-WB] Computer Science [cs]/Webbusiness.industryComputer scienceOntology-based data integration05 social sciences[INFO.INFO-WB]Computer Science [cs]/Web[ INFO.INFO-WB ] Computer Science [cs]/Web0211 other engineering and technologies050801 communication & media studies02 engineering and technology[INFO] Computer Science [cs]Social Semantic Web0508 media and communicationsSemantic computingSemantic analyticsSemantic integration[INFO]Computer Science [cs]Semantic Web StackbusinessSemantic WebData WebComputingMilieux_MISCELLANEOUS
researchProduct

EvalMSA: A Program to Evaluate Multiple Sequence Alignments and Detect Outliers

2016

8 páginas, 3 figuras, 2 tablas.

0301 basic medicineBiologiaComputer sciencemedia_common.quotation_subjectlcsh:EvolutionBinary numberGappinesscomputer.software_genre03 medical and health scienceslcsh:QH359-425GeneticsQuality (business)Relevance (information retrieval)Ecology Evolution Behavior and SystematicsOriginal Researchgappinessoutlier sequencecomputer.programming_languagemedia_commonSequenceMultiple sequence alignmentOutlier sequenceData scienceComputer Science ApplicationsIdentification (information)030104 developmental biologyOutliermultiple sequence alignmentMultiple sequence alignmentData miningPerlcomputerProgrames d'ordinadorEvolutionary Bioinformatics
researchProduct

Reactome graph database: Efficient access to complex pathway data

2018

Reactome is a free, open-source, open-data, curated and peer-reviewed knowledgebase of biomolecular pathways. One of its main priorities is to provide easy and efficient access to its high quality curated data. At present, biological pathway databases typically store their contents in relational databases. This limits access efficiency because there are performance issues associated with queries traversing highly interconnected data. The same data in a graph database can be queried more efficiently. Here we present the rationale behind the adoption of a graph database (Neo4j) as well as the new ContentService (REST API) that provides access to these data. The Neo4j graph database and its qu…

0301 basic medicineDatabases FactualComputer scienceData managementKnowledge BasesSocial SciencesInformation Storage and RetrievalNoSQLcomputer.software_genreComputer ApplicationsDatabase and Informatics MethodsUser-Computer Interface0302 clinical medicineKnowledge extractionPsychologyDatabase Searchinglcsh:QH301-705.5Data ManagementLanguageBiological dataEcologySystems BiologyGenomicsGenomic DatabasesComputational Theory and MathematicsModeling and SimulationWeb-Based ApplicationsGraph (abstract data type)Information TechnologyResearch ArticleComputer and Information SciencesRelational databaseQuery languageResearch and Analysis MethodsEcosystems03 medical and health sciencesCellular and Molecular NeuroscienceDatabasesGeneticsComputer GraphicsHumansMolecular BiologyEcology Evolution Behavior and SystematicsInternetInformation retrievalGraph databasebusiness.industryEcology and Environmental SciencesCognitive PsychologyBiology and Life SciencesComputational BiologyGenome AnalysisRelational Databases030104 developmental biologyBiological Databaseslcsh:Biology (General)Cognitive Sciencebusinesscomputer030217 neurology & neurosurgerySoftwareNeurosciencePLoS Computational Biology
researchProduct

Designing food packaging for the Spanish market: Do motivations differ between involved and non-involved adolescents?

2018

Abstract This paper investigates the relationships among food choice motivations and the relevance of packaging elements (visual and informative elements) in the adolescent market. In addition, these relationships are re-tested in two different frameworks: high-involved consumers and low-involved consumers. 590 young consumers between 13 and 17 years were interviewed at the door of their public or private schools. Structural Modelling was used to test our hypotheses. The first analysis was done considering the global sample. The second one split off the sample into two groups: 351 high-involved adolescents and 239 low-involved adolescents. Our results showed, on one side, that weight contro…

0301 basic medicineHealth Knowledge Attitudes PracticeAdolescentAttitude of Health PersonnelDecision MakingSample (statistics)Product LabelingAffect (psychology)Choice BehaviorFood Preferences03 medical and health sciencesSurveys and Questionnaires0502 economics and businessFood choicemedicineHumansRelevance (information retrieval)MarketingMarketingMotivationSchools030109 nutrition & dietetics05 social sciencesCommerceFood PackagingHispanic or LatinoWeight controlConsumer BehaviorTest (assessment)Food packaging050211 marketingCuesmedicine.symptomPsychologyFood ScienceDietingFood Research International
researchProduct

Coupling News Sentiment with Web Browsing Data Improves Prediction of Intra-Day Price Dynamics

2015

The new digital revolution of big data is deeply changing our capability of understanding society and forecasting the outcome of many social and economic systems. Unfortunately, information can be very heterogeneous in the importance, relevance, and surprise it conveys, affecting severely the predictive power of semantic and statistical methods. Here we show that the aggregation of web users' behavior can be elicited to overcome this problem in a hard to predict complex system, namely the financial market. Specifically, our in-sample analysis shows that the combined use of sentiment analysis of news and browsing activity of users of Yahoo! Finance greatly helps forecasting intra-day and dai…

0301 basic medicineINFORMATIONEconomicsComputer scienceBig datalcsh:MedicineSocial SciencesQuantitative Finance - Computational Financesocial and economic systemsMathematical and Statistical TechniquesSociologybig dataEconometrics050207 economicsComputer NetworksCapital Marketslcsh:ScienceFinancial Marketsmedia_common050208 financeMultidisciplinary05 social sciencesCommerceSocial CommunicationSettore FIS/02 - Fisica Teorica Modelli e Metodi MatematiciSurpriseModels EconomicSocial NetworksPhysical SciencesSocial SystemsEngineering and TechnologyComputational sociologyBEHAVIORStatistics (Mathematics)Network AnalysisResearch ArticleComputer and Information SciencesExploitmedia_common.quotation_subjectTwitterComputational Finance (q-fin.CP)Research and Analysis MethodsFOS: Economics and business03 medical and health sciencesSEARCH0502 economics and businessHumansRelevance (information retrieval)Web navigationInvestmentsStatistical MethodsInternetStatistical Finance (q-fin.ST)STOCK-MARKETbusiness.industrylcsh:RSentiment analysisFinancial marketATTENTIONQuantitative Finance - Statistical FinanceCommunicationsNoise ReductionFinancial Firms030104 developmental biologySignal ProcessingPredictive powerlcsh:QStock marketbusinessSocial MediaFinanceMathematicsForecastingPLOS ONE
researchProduct

RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures

2017

RepeatsDB 2.0 (URL: http://repeatsdb.bio.unipd.it/) is an update of the database of annotated tandem repeat protein structures. Repeat proteins are a widespread class of non-globular proteins carrying heterogeneous functions involved in several diseases. Here we provide a new version of RepeatsDB with an improved classification schema including high quality annotations for ∼5400 protein structures. RepeatsDB 2.0 features information on start and end positions for the repeat regions and units for all entries. The extensive growth of repeat unit characterization was possible by applying the novel ReUPred annotation method over the entire Protein Data Bank, with data quality is guaranteed by a…

0301 basic medicineRepetitive Sequences Amino Acid[SDV.BC]Life Sciences [q-bio]/Cellular BiologyBiologyBioinformaticsSearch engineAnnotationStructure-Activity Relationship03 medical and health sciences0302 clinical medicineTandem repeatGeneticsAnimalsHumansDatabase IssueDatabases ProteinComputingMilieux_MISCELLANEOUSRepeat unit030304 developmental biology0303 health sciencesInformation retrievalProteinscomputer.file_formatProtein Data BankVisualizationSchema (genetic algorithms)030104 developmental biologyData qualityCorrigendumcomputerSoftware030217 neurology & neurosurgeryNucleic Acids Research
researchProduct

Variable Ranking Feature Selection for the Identification of Nucleosome Related Sequences

2018

Several recent works have shown that K-mer sequence representation of a DNA sequence can be used for classification or identification of nucleosome positioning related sequences. This representation can be computationally expensive when k grows, making the complexity in spaces of exponential dimension. This issue effects significantly the classification task computed by a general machine learning algorithm used for the purpose of sequence classification. In this paper, we investigate the advantage offered by the so-called Variable Ranking Feature Selection method to select the most informative k − mers associated to a set of DNA sequences, for the final purpose of nucleosome/linker classifi…

0301 basic medicineSequenceSettore INF/01 - InformaticaEpigenomic030102 biochemistry & molecular biologybusiness.industryComputer scienceDeep learningPattern recognitionFeature selectionDNA sequencesNucleosomesRanking (information retrieval)Set (abstract data type)03 medical and health sciencesVariable (computer science)030104 developmental biologyDimension (vector space)Feature selectionDeep learning modelsArtificial intelligenceDeep learning models Feature selection DNA sequences Epigenomic NucleosomesRepresentation (mathematics)business
researchProduct

MetaCache: context-aware classification of metagenomic reads using minhashing.

2017

Abstract Motivation Metagenomic shotgun sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification, i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes corresponding software tools suffer from either long runtimes, large memory requirements or low accuracy. Results We introduce MetaCache—a novel software for read classification using the big data technique minhashing. Our…

0301 basic medicineStatistics and ProbabilityComputer scienceSequence analysisContext (language use)BiochemistryGenome03 medical and health scienceschemistry.chemical_compound0302 clinical medicineRefSeqHumansMolecular BiologyInformation retrievalShotgun sequencingHigh-Throughput Nucleotide SequencingSequence Analysis DNAComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicschemistryMetagenomicsMetagenomics030217 neurology & neurosurgeryDNAAlgorithmsSoftwareReference genomeBioinformatics (Oxford, England)
researchProduct

Towards identifying drug side effects from social media using active learning and crowd sourcing.

2019

Motivation Social media is a largely untapped source of information on side effects of drugs. Twitter in particular is widely used to report on everyday events and personal ailments. However, labeling this noisy data is a difficult problem because labeled training data is sparse and automatic labeling is error-prone. Crowd sourcing can help in such a scenario to obtain more reliable labels, but is expensive in comparison because workers have to be paid. To remedy this, semi-supervised active learning may reduce the number of labeled data needed and focus the manual labeling process on important information. Results We extracted data from Twitter using the public API. We subsequently use Ama…

0303 health sciencesFocus (computing)Information retrievalDrug-Related Side Effects and Adverse ReactionsProcess (engineering)business.industryActive learning (machine learning)Computer scienceComputational BiologyCrowdsourcing03 medical and health sciences0302 clinical medicineProblem-based learningCode (cryptography)CrowdsourcingHumansSocial media030212 general & internal medicinebusinessBaseline (configuration management)Social Media030304 developmental biologyPacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
researchProduct