Search results for " data analysis"

showing 10 items of 231 documents

Criminal networks analysis in missing data scenarios through graph distances.

2021

Data collected in criminal investigations may suffer from: (i) incompleteness, due to the covert nature of criminal organisations; (ii) incorrectness, caused by either unintentional data collection errors and intentional deception by criminals; (iii) inconsistency, when the same information is collected into law enforcement databases multiple times, or in different formats. In this paper we analyse nine real criminal networks of different nature (i.e., Mafia networks, criminal street gangs and terrorist organizations) in order to quantify the impact of incomplete data and to determine which network type is most affected by it. The networks are firstly pruned following two specific methods: …

Data AnalysisFOS: Computer and information sciencesComputer and Information SciencesScienceIntelligenceSocial SciencesTransportationCriminologyCivil EngineeringSocial NetworkingComputer Science - Computers and SocietyLaw EnforcementSociologyComputers and Society (cs.CY)PsychologyHumansComputer NetworksSocial and Information Networks (cs.SI)Algorithms; Humans; Terrorism; Criminals; Data Analysis; Social NetworkingSettore INF/01 - InformaticaQCognitive PsychologyRBiology and Life SciencesEigenvaluesComputer Science - Social and Information NetworksCriminalsTransportation InfrastructurePoliceRoadsProfessionsAlgebraLinear AlgebraPeople and PlacesPhysical SciencesEngineering and TechnologyCognitive ScienceMedicineLaw and Legal SciencesPopulation GroupingsTerrorismCrimeCriminal Justice SystemMathematicsNetwork AnalysisAlgorithmsResearch ArticleNeurosciencePLoS ONE
researchProduct

Analyzing big datasets of genomic sequences: fast and scalable collection of k-mer statistics

2019

Abstract Background Distributed approaches based on the MapReduce programming paradigm have started to be proposed in the Bioinformatics domain, due to the large amount of data produced by the next-generation sequencing techniques. However, the use of MapReduce and related Big Data technologies and frameworks (e.g., Apache Hadoop and Spark) does not necessarily produce satisfactory results, in terms of both efficiency and effectiveness. We discuss how the development of distributed and Big Data management technologies has affected the analysis of large datasets of biological sequences. Moreover, we show how the choice of different parameter configurations and the careful engineering of the …

Data AnalysisFOS: Computer and information sciencesTime FactorsTime FactorComputer scienceStatistics as TopicBig dataApache Spark; distributed computing; performance evaluation; k-mer countinglcsh:Computer applications to medicine. Medical informaticsBiochemistryDomain (software engineering)Databases03 medical and health sciences0302 clinical medicineStructural BiologyComputer clusterStatisticsSpark (mathematics)Molecular Biologylcsh:QH301-705.5030304 developmental biology0303 health sciencesGenomeSettore INF/01 - InformaticaBase SequenceNucleic AcidApache Sparkbusiness.industryResearchApache Spark; Distributed computing; k-mer counting; Performance evaluation; Algorithms; Base Sequence; Software; Time Factors; Data Analysis; Databases Nucleic Acid; Genome; Statistics as TopicApplied Mathematicsk-mer countingDistributed computingComputer Science ApplicationsAlgorithmData AnalysiComputer Science - Distributed Parallel and Cluster Computinglcsh:Biology (General)030220 oncology & carcinogenesisScalabilityPerformance evaluationlcsh:R858-859.7Algorithm designDistributed Parallel and Cluster Computing (cs.DC)Databases Nucleic AcidbusinessAlgorithmsSoftware
researchProduct

MATLAB-based educational software for exploratory data analysis (EDA toolkit)

2009

This article presents an educational software developed in order to enable engineering students to gain insight into data sets via the exploratory data analysis (EDA). This software has been developed using the MATLAB GUIDE tool. This article shows the program suitability for learning EDA in different engineering courses related to data analysis such as data mining or data processing courses. © 2009 Wiley Periodicals, Inc. Comput Appl Eng Educ 20: 313–320, 2012

Data processingGeneral Computer ScienceComputer sciencebusiness.industryGeneral Engineeringcomputer.software_genreData scienceEducationExploratory data analysisSoftwareMATLABSoftware engineeringbusinesscomputerEducational softwarecomputer.programming_languageComputer Applications in Engineering Education
researchProduct

Symbolic and conceptual representation of dynamic scenes: Interpreting situation calculus on conceptual spaces

2001

In (Chella et al. [1,2]) we proposed a framework for the representation of visual knowledge, with particular attention to the analysis and the representation of scenes with moving objects and people. One of our aims is a principled integration of the models developed within the artificial vision community with the propositional knowledge representation systems developed within symbolic AI. In the present note we show how the approach we adopted fits well with the representational choices underlying one of the most popular symbolic formalisms used in cognitive robotics, namely the situation calculus.

Descriptive knowledgeKnowledge representation and reasoningComputer sciencebusiness.industryRepresentation (systemics)RoboticsConceptual spaceArtificial intelligenceSituation calculusbusinessCognitive roboticsSymbolic data analysis
researchProduct

ideal: an R/Bioconductor package for interactive differential expression analysis

2020

AbstractBackgroundRNA sequencing (RNA-seq) is an ever increasingly popular tool for transcriptome profiling. A key point to make the best use of the available data is to provide software tools that are easy to use but still provide flexibility and transparency in the adopted methods. Despite the availability of many packages focused on detecting differential expression, a method to streamline this type of bioinformatics analysis in a comprehensive, accessible, and reproducible way is lacking.ResultsWe developed the ideal software package, which serves as a web application for interactive and reproducible RNA-seq analysis, while producing a wealth of visualizations to facilitate data interpr…

Differential expression analysisComputer scienceShinyBioconductorInteractive data analysislcsh:Computer applications to medicine. Medical informaticsReproducible researchBioconductorDifferential expressionCode (cryptography)Transcriptome profilingHumansRNA-SeqTranscriptomicslcsh:QH301-705.5Flexibility (engineering)Ideal (set theory)Base Sequencebusiness.industryData visualizationGene Expression ProfilingRRNAReproducibility of ResultsTransparency (human–computer interaction)Gene Expression Regulationlcsh:Biology (General)Data Interpretation StatisticalWeb applicationlcsh:R858-859.7Software engineeringbusinessSoftwareBMC Bioinformatics
researchProduct

Analytical properties of horizontal visibility graphs in the Feigenbaum scenario

2012

Time series are proficiently converted into graphs via the horizontal visibility (HV) algorithm, which prompts interest in its capability for capturing the nature of different classes of series in a network context. We have recently shown [1] that dynamical systems can be studied from a novel perspective via the use of this method. Specifically, the period-doubling and band-splitting attractor cascades that characterize unimodal maps transform into families of graphs that turn out to be independent of map nonlinearity or other particulars. Here we provide an in depth description of the HV treatment of the Feigenbaum scenario, together with analytical derivations that relate to the degree di…

Dynamical systems theoryMatemáticasGeneral Physics and AstronomyFOS: Physical sciencesLyapunov exponentDynamical Systems (math.DS)Fixed point01 natural sciencesAeronáutica010305 fluids & plasmassymbols.namesakeBifurcation theoryOscillometry0103 physical sciencesAttractorFOS: MathematicsEntropy (information theory)Computer SimulationStatistical physicsMathematics - Dynamical Systems010306 general physicsMathematical PhysicsMathematicsSeries (mathematics)Degree (graph theory)Applied MathematicsStatistical and Nonlinear Physics16. Peace & justiceNonlinear Sciences - Chaotic DynamicsNonlinear DynamicsPhysics - Data Analysis Statistics and ProbabilitysymbolsChaotic Dynamics (nlin.CD)AlgorithmsData Analysis Statistics and Probability (physics.data-an)
researchProduct

Do firms share the same functional form of their growth rate distribution? A statistical test

2014

We introduce a new statistical test of the hypothesis that a balanced panel of firms have the same growth rate distribution or, more generally, that they share the same functional form of growth rate distribution. We applied the test to European Union and US publicly quoted manufacturing firms data, considering functional forms belonging to the Subbotin family of distributions. While our hypotheses are rejected for the vast majority of sets at the sector level, we cannot rejected them at the subsector level, indicating that homogenous panels of firms could be described by a common functional form of growth rate distribution.

Economics and EconometricsControl and OptimizationFOS: Physical sciencesDistribution (economics)Heterogeneous firmEDF testsFOS: Economics and businessMicroeconomicsGrowth rate distribution of individual firmEconomicsmedia_common.cataloged_instanceEuropean unionScalingmedia_commonStatistical hypothesis testingSettore SECS-S/06 - Metodi mat. dell'economia e Scienze Attuariali e FinanziarieStatistical Finance (q-fin.ST)EDF testbusiness.industryApplied MathematicsSettore FIS/01 - Fisica SperimentaleQuantitative Finance - Statistical FinanceProbability and statisticsVariance (accounting)Settore FIS/07 - Fisica Applicata(Beni Culturali Ambientali Biol.e Medicin)North American Industry Classification SystemHeterogeneous firmsPhysics - Data Analysis Statistics and ProbabilityNull hypothesisbusinessData Analysis Statistics and Probability (physics.data-an)
researchProduct

Factors Affecting Attrition among First Year Computer Science Students: the Case of University of Latvia

2015

<p class="R-AbstractKeywords"><span lang="EN-GB">The purpose of our study was to identify reasons for high dropout of students enrolled in the first year of the computer science study program to make it possible to determine students, who are potentially in risk. Several factors that could affect attrition, as it was originally assumed, were studied: high school grades (admission score), compensative course in high school mathematics, intermediate grades for core courses, prior knowledge of programming. However, the results of our study indicate that none of the studied factors is determinant to identify those students, who are going to abandon their studies, with great precisio…

EngineeringAttrition rate; computer science education; data processing; data analysisbusiness.industryComputer scienceeducationmedicine.diseaseAffect (psychology)Drop outmedicineMathematics educationComputingMilieux_COMPUTERSANDEDUCATIONAttritionbusinessDropout (neural networks)Environment. Technology. Resources. Proceedings of the International Scientific and Practical Conference
researchProduct

On Detection of Yaw and Roll Angle Information for Vehicle Oblique Crash using Hough Transform

2014

When performing vehicle crash tests, it is common to capture high frame rate video (HFR) to observe the vehicle motion during the impact. Such videos contain a lot of information, especially when it comes to geometric data. The yaw and roll angles from the HFR video is detected by using the Hough Transform and Matlab's Image processing Toolbox. The measured Yaw angle from the HFR video are compared with real life test data captured with a gyroscopic device inside the vehicle during the oblique vehicle impact.

Engineeringbusiness.industryComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONOblique caseGyroscopeCrashImage processinglaw.inventionHough transformEuler anglessymbols.namesakelawsymbolsComputer visionArtificial intelligencebusinessMATLABcomputerGeometric data analysiscomputer.programming_language
researchProduct

THE “ZISA” IN PALERMO: BIOCLIMATIC ASPECTS OF ITS ARCHITECTURE

2010

In the present work is examined, as an ante-litteram example of summer residence and bioclimatic archi-tecture, the ancient castle of the Zisa (from the Arabic 'al-aziz = noble, shining, glorious), meaningful work of Arab-Islamic character built up at the beginnings of the XII century (fig. 1). The building holds a notable importance as it is one of the few examples of Norman civil architecture with elements of Arab-Islamic character and Byzantian culture, but also because it had been conceived according to the most re-fined Islamic techniques of free cooling. The building (fig. 1) presents itself like a massive rectangular construction developed on three levels. On the architectural plan t…

Environmental contitionSettore ING-IND/11 - Fisica Tecnica AmbientaleClimatic data analysisBioclimatic ArchitectureHistorical heiritage
researchProduct