Search results for "NoSQL"
showing 10 items of 22 documents
Reactome graph database: Efficient access to complex pathway data
2018
Reactome is a free, open-source, open-data, curated and peer-reviewed knowledgebase of biomolecular pathways. One of its main priorities is to provide easy and efficient access to its high quality curated data. At present, biological pathway databases typically store their contents in relational databases. This limits access efficiency because there are performance issues associated with queries traversing highly interconnected data. The same data in a graph database can be queried more efficiently. Here we present the rationale behind the adoption of a graph database (Neo4j) as well as the new ContentService (REST API) that provides access to these data. The Neo4j graph database and its qu…
An Integrative Framework for the Construction of Big Functional Networks
2018
We present a methodology for biological data integration, aiming at building and analysing large functional networks which model complex genotype-phenotype associations. A functional network is a graph where nodes represent cellular components (e.g., genes, proteins, mRNA, etc.) and edges represent associations among such molecules. Different types of components may cohesist in the same network, and associations may be related to physical[biochemical interactions or functional/phenotipic relationships. Due to both the large amount of involved information and the computational complexity typical of the problems in this domain, the proposed framework is based on big data technologies (Spark a…
Clusterpoint NoSQL vaicājumu valoda
2015
Maģistra darbā ir aprakstīta Clusterpoint datubāzes vaicājumu valoda un tās salīdzinājums ar citām datubāzes vaicājumu valodām. Maģistra darba gaitā tiek aprakstīts Clusterpoint vaicājuma valodas pielāgošana strukturētai vaicājumu valodai (SQL), kas atvieglotu un paātrinātu jaunu Clusterpoint datubāzes lietotāju un citu relāciju datubāžu lietotāju Clusterpoint vaicājumu valodas apguvi. Lai gan NoSQL popularitāte turpina augt, pirmā saskare ar datubāzēm bieži vien ir tieši relāciju datubāzes tādēļ, lai izmantotu jau iegūtās zināšanas strukturētās vaicājuma valodas konstrukcijā un izmantošanā Clusterpoint piedāvā savu strukturētas vaicājuma valodas paveidu, kas ir aprakstīts autora maģistra d…
Effectively and efficiently supporting crowd-enabled databases via NoSQL paradigms
2013
In this paper we provide an overview of the Hints From the Crowd (HFC) project, whose main goal is to build a NoSQL database system for large collections of product reviews; the database is queried by expressing a natural language sentence; the result is a list of products ranked based on the relevance of reviews w.r.t. the natural language sentence. The best ranked products in the result list can be seen as the best hints for the user based on crowd opinions (the reviews). The HFC prototype has been developed as a web application, independent of the particular application domain of the collected product reviews. Queries are performed by evaluating a text-based ranking metric for sets of re…
Migration of Relational Database to Document-Oriented Database: Structure Denormalization and Data Transformation
2015
Relational databases remain the leading data storage technology. Nevertheless, many companies want to reduce operating expenses, to make scalable applications that use cloud computing technologies. Use of NoSQL database is one of the possible solutions, and it is forecasted that the NoSQL market will be growing at a CAGR of approximately 50 percent over the next five years. The paper offers a solution for quick data migration from a relational database into a document-oriented database. We have created semi-automatically two logical levels over physical data. Users can refine generated logical data model and configure data migration template for each needed document. Data migration features…
PyCellBase, an efficient python package for easy retrieval of biological data from heterogeneous sources.
2019
Background Biological databases and repositories are incrementing in diversity and complexity over the years. This rapid expansion of current and new sources of biological knowledge raises serious problems of data accessibility and integration. To handle the growing necessity of unification, CellBase was created as an integrative solution. CellBase provides a centralized NoSQL database containing biological information from different and heterogeneous sources. Access to this information is done through a RESTful web service API, which provides an efficient interface to the data. Results In this work we present PyCellBase, a Python package that provides programmatic access to the rich RESTfu…
Intelligent Cloud Storage Management for Layered Tiers
2018
Today, the cloud offers a large array of possibilities for storage, with this flexibility comes also complexity. This complexity stems from the variety of storage mediums, such as, blob storage or NoSQL tables, and also from the different cost tiers within these systems. A strategic thinking to navigate this complex cloud storage landscape is important, not only for cost saving but also for prioritizing information, this prioritization has wider implications in other domains such as the Big Data realm, especially for governance and efficiency. In this paper we propose a strategy centered around probabilistic graphical model (PGM), this heuristic oriented management and organizational strate…
Enhanced query processing for NoSQL crowdsourcing systems
2014
In this paper, we provide a novel approach for effectively and efficiently support query processing tasks in novel NoSQL crowdsourcing systems. The idea of our method is to exploit the social knowledge available from reviews about products of any kind, freely provided by customers through specialized web sites. We thus define a NoSQL database system for large collections of product reviews, where queries can be expressed in terms of natural language sentences whose answers are modeled as lists of products ranked based on the relevance of reviews w.r.t. the natural language sentences. The best ranked products in the result list can be seen as the best hints for the user based on crowd opinio…
Hints from the Crowd: A Novel NoSQL Database
2013
The crowd can be an incredible source of information. In particular, this is true for reviews about products of any kind, freely provided by customers through specialized web sites. In other words, they are social knowledge, that can be exploited by other customers. The Hints From the Crowd HFC prototype, presented in this paper, is a NoSQL database system for large collections of product reviews; the database is queried by expressing a natural language sentence; the result is a list of products ranked based on the relevance of reviews w.r.t. the natural language sentence. The best ranked products in the result list can be seen as the best hints for the user based on crowd opinions the revi…
OSINT rīku izmantošana neaizsargāto datubāžu atklāšanai un pieejamas informācijas analīzei Baltijas valstīm dažādu datubāžu kontekstā
2021
OSINT rīki IT nozarē kļūst arvien populārāki zinātnieku, testētāju un krāpnieku vidū, piedāvājot detalizētu informāciju par visām iekārtām, kas ir pieslēgtas internetam. Izmantojot divus populārus OSINT rīkus Shodan un BinaryEdge, ir dažreiz iespējams izgūt informāciju par datubāzēm, pie kurām ir iespējams pieslēgties jebkuram interesentam. Darba mērķis ir, izmantojot pašrakstīto rīku, iegūt neaizsargāto informāciju, izpētīt to un noteikt mūsdienas drošības situāciju datubāžu kontekstā trīs Baltijas valstu kontekstā. Darbā tiek apskatīti OSINT rīki Shodan un BinaryEdge, dažāda tipa datu glabātuves un to drošības līdzekļi un izpētītas neaizsargātas Latvijas, Lietuvas un Igaunijas datubāzes.