Search results for "Big data"

showing 10 items of 311 documents

Methods to Use Big Wearable Heart Rate Data for Estimation of Physical Activity in Population Level

2015

Technologies for wearable health monitoring are becoming increasingly popular and affordable. As a result, large-scale health databases from a large number of individuals are becoming available. However, analysis of these databases requires special methodology to transform available parameters into more generic ones and to manage such non-balanced data characteristics as biases and sampling issues. In this paper, we introduce a methodology for studying physical activity from big wearable heart rate (HR) data on about 5 000 working-age individuals, each measured only for a few days. Physical activity was assessed by oxygen consumption (VO2) calculated from measured HR data using a neural net…

Estimationmedicine.medical_specialtySports medicineComputer sciencebusiness.industryData managementmedia_common.quotation_subjectBig dataLinear modelSampling (statistics)Wearable computerData sciencemedicineQuality (business)businessmedia_common
researchProduct

Statistically validated mobile communication networks: the evolution of motifs in European and Chinese data

2014

Big data open up unprecedented opportunities to investigate complex systems including the society. In particular, communication data serve as major sources for computational social sciences but they have to be cleaned and filtered as they may contain spurious information due to recording errors as well as interactions, like commercial and marketing activities, not directly related to the social network. The network constructed from communication data can only be considered as a proxy for the network of social relationships. Here we apply a systematic method, based on multiple hypothesis testing, to statistically validate the links and then construct the corresponding Bonferroni network, gen…

FOS: Computer and information sciencesPhysics - Physics and SocietyBig dataFOS: Physical sciencesGeneral Physics and AstronomyPhysics and Society (physics.soc-ph)computer.software_genre01 natural sciences010305 fluids & plasmassymbols.namesake0103 physical sciences010306 general physicsProxy (statistics)Social and Information Networks (cs.SI)PhysicsSocial networkbusiness.industryComputer Science - Social and Information NetworksComplex networkcomplex networks social systems statistically validated networks mobile call records 3-motifsSettore FIS/07 - Fisica Applicata(Beni Culturali Ambientali Biol.e Medicin)Bonferroni correctionMobile phonesymbolsMobile telephonyData miningRaw databusinesscomputer
researchProduct

Alignment-free Genomic Analysis via a Big Data Spark Platform

2021

Abstract Motivation Alignment-free distance and similarity functions (AF functions, for short) are a well-established alternative to pairwise and multiple sequence alignments for many genomic, metagenomic and epigenomic tasks. Due to data-intensive applications, the computation of AF functions is a Big Data problem, with the recent literature indicating that the development of fast and scalable algorithms computing AF functions is a high-priority task. Somewhat surprisingly, despite the increasing popularity of Big Data technologies in computational biology, the development of a Big Data platform for those tasks has not been pursued, possibly due to its complexity. Results We fill this impo…

FOS: Computer and information sciencesStatistics and Probabilitysequence analysisComputer science0206 medical engineeringBig data02 engineering and technologyMachine learningcomputer.software_genreBiochemistry03 medical and health sciencesSpark (mathematics)MapReduceMolecular Biology030304 developmental biology0303 health sciencesSettore INF/01 - Informaticabusiness.industryBioinformatics High Performance Computing Compressed Data StructuresMapReduce; hadoop; sequence analysisComputer Science ApplicationsComputational MathematicsTask (computing)Computer Science - Distributed Parallel and Cluster ComputingComputational Theory and MathematicsDistributed Parallel and Cluster Computing (cs.DC)Artificial intelligencehadoopbusinesscomputer020602 bioinformaticsBioinformatics
researchProduct

Semantic HMC for Big Data Analysis

2014

International audience; Analyzing Big Data can help corporations to im-prove their efficiency. In this work we present a new vision to derive Value from Big Data using a Semantic Hierarchical Multi-label Classification called Semantic HMC based in a non-supervised Ontology learning process. We also proposea Semantic HMC process, using scalable Machine-Learning techniques and Rule-based reasoning.

FOS: Computer and information sciences[ INFO.INFO-TT ] Computer Science [cs]/Document and Text Processingmulti-classifyComputer scienceComputer Science - Artificial IntelligenceBig data[ INFO.INFO-WB ] Computer Science [cs]/Websemantic technologies02 engineering and technologyOntology (information science)Semantic data model[ INFO.INFO-DC ] Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Semantic similarity020204 information systemsSemantic computing0202 electrical engineering electronic engineering information engineeringontologyInformation retrievalOntology learningbusiness.industryOntology-based data integration[INFO.INFO-WB]Computer Science [cs]/WebBig-Data[INFO.INFO-TT]Computer Science [cs]/Document and Text ProcessingArtificial Intelligence (cs.AI)machine learningOntologySemantic technologyIndex Terms—classification020201 artificial intelligence & image processing[INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]business
researchProduct

Intelligent Cloud Storage Management for Layered Tiers

2018

Today, the cloud offers a large array of possibilities for storage, with this flexibility comes also complexity. This complexity stems from the variety of storage mediums, such as, blob storage or NoSQL tables, and also from the different cost tiers within these systems. A strategic thinking to navigate this complex cloud storage landscape is important, not only for cost saving but also for prioritizing information, this prioritization has wider implications in other domains such as the Big Data realm, especially for governance and efficiency. In this paper we propose a strategy centered around probabilistic graphical model (PGM), this heuristic oriented management and organizational strate…

Flexibility (engineering)0209 industrial biotechnologyComputer scienceHeuristicbusiness.industryDistributed computingBig dataProbabilistic logicBinary large objectCloud computing02 engineering and technologyNoSQLcomputer.software_genre020901 industrial engineering & automation0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingbusinessCloud storagecomputer
researchProduct

Automatic ontology-based user profile learning from heterogeneous web resources in a big data context

2013

The Web has developed to the biggest source of information and entertainment in the world. By its size, its adaptability and flexibility, it challenged our current paradigms on information sharing in several areas. By offering everybody the opportunity to release own contents in a fast and cheap way, the Web already led to a revolution of the traditional publishing world and just now, it commences to change the perspective on advertisements. With the possibility to adapt the contents displayed on a page dynamically based on the viewer's context, campaigns launched to target rough customer groups will become an element of the past. However, this new ecosystem, that relates advertisements wit…

Flexibility (engineering)User profileDigital marketingComputer sciencebusiness.industryInformation sharingBig dataGeneral EngineeringContext (language use)Ontology (information science)computer.software_genreOntology engineeringWorld Wide WebOntologyWeb resourcebusinesscomputerProceedings of the VLDB Endowment
researchProduct

Learning Environments in the 21st Century: A Mapping of the Literature

2020

Education has been transformed by significant breakthroughs in AI, mobile internet, cloud computing and Big Data technologies. More personalized educational settings are developed by increasingly integrating contemporary learning environments with new technologies. However, few examples of executed AI enabled learning interventions have been identified. Therefore, a mapping of literature on AI enabled learning systems was done. 121 studies published in the last five years were analyzed. This paper presents a discussion regarding on what mainly AI enabled contemporary learning environments are designed to achieve. The major contribution of the study is bringing awareness to researchers and s…

Future studiesComputer scienceMobile internetbusiness.industryEmerging technologiesBig dataCloud computingbusinessData scienceGeneralLiterature_MISCELLANEOUS
researchProduct

Data Analytics in Healthcare: A Tertiary Study

2022

AbstractThe field of healthcare has seen a rapid increase in the applications of data analytics during the last decades. By utilizing different data analytic solutions, healthcare areas such as medical image analysis, disease recognition, outbreak monitoring, and clinical decision support have been automated to various degrees. Consequently, the intersection of healthcare and data analytics has received scientific attention to the point of numerous secondary studies. We analyze studies on healthcare data analytics, and provide a wide overview of the subject. This is a tertiary study, i.e., a systematic review of systematic reviews. We identified 45 systematic secondary studies on data analy…

General Computer ScienceComputer Networks and Communicationsterveydenhuoltodata-analytiikkahealthcaredata miningtekoälyartificial intelligenceComputer Graphics and Computer-Aided DesignComputer Science Applicationsmachine learningkoneoppiminendataComputational Theory and Mathematicsbig dataArtificial Intelligencetiedonlouhintadata analyticsSN Computer Science
researchProduct

Exploring social media network landscape of post-Soviet space

2019

The “post-Soviet space” consists of countries with a substantial fraction of the world’s population; however, unlike many other regions, its social media network landscape is still somewhat under-explored. This paper aims at filling this gap. To this purpose, we use anonymized data on user friendships at VK.com (also known as VKontakte and, informally, as “Russian Facebook”), which is the largest and most popular social media portal in the post-Soviet space with hundreds of millions of user accounts. Using the VK network snapshots from October 2015 to December 2016, we conduct a “multiscale” empirical study of this network by considering conn…

General Computer SciencePopulationsosiaalinen mediaContext (language use)010501 environmental sciencesSpace (commercial competition)01 natural sciencesEmpirical researchbig data0502 economics and businessSocial network servicesGeneral Materials ScienceSocial mediaEconomic geographyeducationsovellukset (tietotekniikka)0105 earth and related environmental sciencesta113verkostoteducation.field_of_studyModularity (networks)big data applicationsData collection05 social sciencesGeneral Engineeringnetwork theory (graphs)Scale (social sciences)lcsh:Electrical engineering. Electronics. Nuclear engineeringsocial network serviceslcsh:TK1-9971050203 business & management
researchProduct

The composition of data economy : a bibliometric approach and TCCM framework of conceptual, intellectual and social structure

2022

Purpose The data economy mainly relies on the surveillance capitalism business model, enabling companies to monetize their data. The surveillance allows for transforming private human experiences into behavioral data that can be harnessed in the marketing sphere. This study aims to focus on investigating the domain of data economy with the methodological lens of quantitative bibliometric analysis of published literature. Design/methodology/approach The bibliometric analysis seeks to unravel trends and timelines for the emergence of the data economy, its conceptualization, scientific progression and thematic synergy that could predict the future of the field. A total of 591 data between 200…

General Computer Sciencebig datakaupallistaminenLibrary and Information Sciencesdatatalousyksilönsuojaavoin tietodigitaalinen markkinointiyrityksetkäsiteanalyysibibliometriikka
researchProduct