Search results for "Informatics"

showing 10 items of 2542 documents

Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases

2019

AbstractThe widespread occurrence of repetitive stretches of DNA in genomes of organisms across the tree of life imposes fundamental challenges for sequencing, genome assembly, and automated annotation of genes and proteins. This multi-level problem can lead to errors in genome and protein databases that are often not recognized or acknowledged. As a consequence, end users working with sequences with repetitive regions are faced with ‘ready-to-use’ deposited data whose trustworthiness is difficult to determine, let alone to quantify. Here, we provide a review of the problems associated with tandem repeat sequences that originate from different stages during the sequencing-assembly-annotatio…

FOS: Computer and information sciencesBioinformatics[SDV]Life Sciences [q-bio]Sequence assemblyGenomics[SDV.BC]Life Sciences [q-bio]/Cellular BiologyComputational biologyBiologyGenome03 medical and health sciencesAnnotation0302 clinical medicineTandem repeatGeneticsAnimalsSurvey and SummaryDatabases ProteinGeneComputingMilieux_MISCELLANEOUS030304 developmental biology0303 health sciencesEnd user572: BiochemieDNASequence Analysis DNAGenomics[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM]WorkflowComputingMethodologies_PATTERNRECOGNITIONGadus morhuaTandem Repeat SequencesScientific Experimental Error[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]Databases Nucleic Acid030217 neurology & neurosurgery
researchProduct

Extending the Tsetlin Machine With Integer-Weighted Clauses for Increased Interpretability

2020

Despite significant effort, building models that are both interpretable and accurate is an unresolved challenge for many pattern recognition problems. In general, rule-based and linear models lack accuracy, while deep learning interpretability is based on rough approximations of the underlying inference. Using a linear combination of conjunctive clauses in propositional logic, Tsetlin Machines (TMs) have shown competitive performance on diverse benchmarks. However, to do so, many clauses are needed, which impacts interpretability. Here, we address the accuracy-interpretability challenge in machine learning by equipping the TM clauses with integer weights. The resulting Integer Weighted TM (…

FOS: Computer and information sciencesBoosting (machine learning)Theoretical computer scienceinteger-weighted Tsetlin machineGeneral Computer ScienceComputer scienceComputer Science - Artificial Intelligence0206 medical engineeringNatural language understandingInference02 engineering and technologycomputer.software_genre0202 electrical engineering electronic engineering information engineeringGeneral Materials ScienceTsetlin machineVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550InterpretabilityArtificial neural networkLearning automatabusiness.industryDeep learningGeneral Engineeringinterpretable machine learningrule-based learninginterpretable AIPropositional calculusSupport vector machineArtificial Intelligence (cs.AI)TheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGESXAIPattern recognition (psychology)020201 artificial intelligence & image processinglcsh:Electrical engineering. Electronics. Nuclear engineeringArtificial intelligencebusinesslcsh:TK1-9971computer020602 bioinformaticsInteger (computer science)
researchProduct

Using the Tsetlin Machine to Learn Human-Interpretable Rules for High-Accuracy Text Categorization With Medical Applications

2019

Medical applications challenge today's text categorization techniques by demanding both high accuracy and ease-of-interpretation. Although deep learning has provided a leap ahead in accuracy, this leap comes at the sacrifice of interpretability. To address this accuracy-interpretability challenge, we here introduce, for the first time, a text categorization approach that leverages the recently introduced Tsetlin Machine. In all brevity, we represent the terms of a text as propositional variables. From these, we capture categories using simple propositional formulae, such as: if "rash" and "reaction" and "penicillin" then Allergy. The Tsetlin Machine learns these formulae from a labelled tex…

FOS: Computer and information sciencesComputer Science - Machine LearningGeneral Computer ScienceComputer sciencetext categorizationNatural language understandingDecision treeMachine Learning (stat.ML)02 engineering and technologyVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550::Annen informasjonsteknologi: 559Machine learningcomputer.software_genresupervised learningMachine Learning (cs.LG)Naive Bayes classifierText miningStatistics - Machine Learning0202 electrical engineering electronic engineering information engineeringGeneral Materials ScienceTsetlin machinehealth informaticsInterpretabilityPropositional variableClassification algorithmsArtificial neural networkbusiness.industryDeep learning020208 electrical & electronic engineeringGeneral EngineeringRandom forestSupport vector machinemachine learningCategorization020201 artificial intelligence & image processingArtificial intelligencelcsh:Electrical engineering. Electronics. Nuclear engineeringbusinessPrecision and recallcomputerlcsh:TK1-9971
researchProduct

Multiscale analysis of information dynamics for linear multivariate processes.

2016

In the study of complex physical and physiological systems represented by multivariate time series, an issue of great interest is the description of the system dynamics over a range of different temporal scales. While information-theoretic approaches to the multiscale analysis of complex dynamics are being increasingly used, the theoretical properties of the applied measures are poorly understood. This study introduces for the first time a framework for the analytical computation of information dynamics for linear multivariate stochastic processes explored at different time scales. After showing that the multiscale processing of a vector autoregressive (VAR) process introduces a moving aver…

FOS: Computer and information sciencesInformation transferMultivariate statisticsMultivariate analysisComputer scienceComputer Science - Information Theory0206 medical engineeringStochastic ProcesseBiomedical EngineeringFOS: Physical sciencesInformation Storage and RetrievalHealth Informatics02 engineering and technology01 natural sciencesEntropy (classical thermodynamics)Moving average0103 physical sciencesEntropy (information theory)Computer SimulationStatistical physicsEntropy (energy dispersal)Time series010306 general physicsEntropy (arrow of time)Multivariate Analysi1707Stochastic ProcessesEntropy (statistical thermodynamics)Stochastic processInformation Theory (cs.IT)Probability and statisticsModels Theoretical020601 biomedical engineeringComplex dynamicsAutoregressive modelPhysics - Data Analysis Statistics and ProbabilitySignal ProcessingSettore ING-INF/06 - Bioingegneria Elettronica E InformaticaMultivariate AnalysisData Analysis Statistics and Probability (physics.data-an)Entropy (order and disorder)Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
researchProduct

A Proposed Access Control-Based Privacy Preservation Model to Share Healthcare Data in Cloud

2020

Healthcare data in cloud computing facilitates the treatment of patients efficiently by sharing information about personal health data between the healthcare providers for medical consultation. Furthermore, retaining the confidentiality of data and patients' identity is a another challenging task. This paper presents the concept of an access control-based (AC) privacy preservation model for the mutual authentication of users and data owners in the proposed digital system. The proposed model offers a high-security guarantee and high efficiency. The proposed digital system consists of four different entities, user, data owner, cloud server, and key generation center (KGC). This approach makes…

FOS: Computer and information sciencesKey generationComputer Science - Cryptography and Security020205 medical informaticsbusiness.industryComputer science020206 networking & telecommunicationsAccess controlCloud computing02 engineering and technologyMutual authenticationEncryptionPublic-key cryptographyData sharingComputer Science - Computers and SocietyComputers and Society (cs.CY)0202 electrical engineering electronic engineering information engineeringSession keybusinessCryptography and Security (cs.CR)Computer network2020 16th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob)
researchProduct

Alignment-free Genomic Analysis via a Big Data Spark Platform

2021

Abstract Motivation Alignment-free distance and similarity functions (AF functions, for short) are a well-established alternative to pairwise and multiple sequence alignments for many genomic, metagenomic and epigenomic tasks. Due to data-intensive applications, the computation of AF functions is a Big Data problem, with the recent literature indicating that the development of fast and scalable algorithms computing AF functions is a high-priority task. Somewhat surprisingly, despite the increasing popularity of Big Data technologies in computational biology, the development of a Big Data platform for those tasks has not been pursued, possibly due to its complexity. Results We fill this impo…

FOS: Computer and information sciencesStatistics and Probabilitysequence analysisComputer science0206 medical engineeringBig data02 engineering and technologyMachine learningcomputer.software_genreBiochemistry03 medical and health sciencesSpark (mathematics)MapReduceMolecular Biology030304 developmental biology0303 health sciencesSettore INF/01 - Informaticabusiness.industryBioinformatics High Performance Computing Compressed Data StructuresMapReduce; hadoop; sequence analysisComputer Science ApplicationsComputational MathematicsTask (computing)Computer Science - Distributed Parallel and Cluster ComputingComputational Theory and MathematicsDistributed Parallel and Cluster Computing (cs.DC)Artificial intelligencehadoopbusinesscomputer020602 bioinformaticsBioinformatics
researchProduct

Random Walk in a N-cube Without Hamiltonian Cycle to Chaotic Pseudorandom Number Generation: Theoretical and Practical Considerations

2017

Designing a pseudorandom number generator (PRNG) is a difficult and complex task. Many recent works have considered chaotic functions as the basis of built PRNGs: the quality of the output would indeed be an obvious consequence of some chaos properties. However, there is no direct reasoning that goes from chaotic functions to uniform distribution of the output. Moreover, embedding such kind of functions into a PRNG does not necessarily allow to get a chaotic output, which could be required for simulating some chaotic behaviors. In a previous work, some of the authors have proposed the idea of walking into a $\mathsf{N}$-cube where a balanced Hamiltonian cycle has been removed as the basis o…

FOS: Computer and information sciencesUniform distribution (continuous)Computer Science - Cryptography and SecurityComputer scienceHamiltonian CycleChaoticPseudorandom Numbers GeneratorFOS: Physical sciences02 engineering and technology[INFO.INFO-SE]Computer Science [cs]/Software Engineering [cs.SE]01 natural sciencesUpper and lower bounds[INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computingsymbols.namesake[INFO.INFO-MC]Computer Science [cs]/Mobile Computing[INFO.INFO-CR]Computer Science [cs]/Cryptography and Security [cs.CR]0202 electrical engineering electronic engineering information engineeringApplied mathematics[INFO.INFO-RB]Computer Science [cs]/Robotics [cs.RO]0101 mathematicsEngineering (miscellaneous)Pseudorandom number generatorChaotic IterationsBasis (linear algebra)Applied Mathematics020208 electrical & electronic engineering010102 general mathematicsRandom walkNonlinear Sciences - Chaotic DynamicsHamiltonian path[INFO.INFO-MO]Computer Science [cs]/Modeling and SimulationNonlinear Sciences::Chaotic Dynamics[INFO.INFO-MA]Computer Science [cs]/Multiagent Systems [cs.MA]Modeling and SimulationRandom Walk[NLIN.NLIN-CD]Nonlinear Sciences [physics]/Chaotic Dynamics [nlin.CD]symbolsPseudo random number generator[INFO.INFO-ET]Computer Science [cs]/Emerging Technologies [cs.ET]Chaotic Dynamics (nlin.CD)[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM][INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Cryptography and Security (cs.CR)
researchProduct

Using the transit of Venus to probe the upper planetary atmosphere

2015

The atmosphere of a transiting planet shields the stellar radiation providing us with a powerful method to estimate its size and density. In particular, because of their high ionization energy, atoms with high atomic number (Z) absorb short-wavelength radiation in the upper atmosphere, undetectable with observations in visible light. One implication is that the planet should appear larger during a primary transit observed in high energy bands than in the optical band. The last Venus transit in 2012 offered a unique opportunity to study this effect. The transit has been monitored by solar space observations from Hinode and Solar Dynamics Observatory (SDO). We measure the radius of Venus duri…

FOS: Physical sciencesGeneral Physics and AstronomyVenusBioinformatics7. Clean energyGeneral Biochemistry Genetics and Molecular BiologyArticleAtmosphereAtmosphere of VenusPhysics and Astronomy (all)Settore FIS/05 - Astronomia E AstrofisicaPlanetAstrophysics::Solar and Stellar AstrophysicsTransit (astronomy)Earth and Planetary Astrophysics (astro-ph.EP)[PHYS]Physics [physics]PhysicsBiochemistry Genetics and Molecular Biology (all)MultidisciplinarySecondary atmospherebiologyChemistry (all)Astrophysics::Instrumentation and Methods for AstrophysicsAstronomyGeneral ChemistryRadiusbiology.organism_classificationExoplanet13. Climate actionBiochemistry Genetics and Molecular Biology (all); Chemistry (all); Physics and Astronomy (all)Physics::Space PhysicsAstrophysics::Earth and Planetary Astrophysics[PHYS.ASTR]Physics [physics]/Astrophysics [astro-ph]Astrophysics - Earth and Planetary AstrophysicsNature Communications
researchProduct

Broad spectrum of Fabry disease manifestation in an extended Spanish family with a new deletion in the GLA gene

2012

Background. Fabry disease (FD) is an X-linked inherited disease based on the absence or reduction of lysosomal-galactosidase (Gla) activity. The enzymatic defect results in progressive impairment of cerebrovascular, renal and cardiac function. Normally, female heterozygote mutation carriers are less strongly affected than male hemizygotes aggravating disease diagnosis. Method. Close examination of the patients by renal biopsy, echo- and electrocardiography and MRI. Blood work and subsequent DNA analysis were carried out utilizing approved protocols for PCR and Sequencing. MLPA analysis was done to unveil deletions within the GLA gene locus. Quantitative detection of Glycolipids in patient p…

Fabry diseaseTransplantationPathologymedicine.medical_specialtybusiness.industryOriginal ContributionsGenetic disorderLocus (genetics)Heterozygote advantageOriginal Articleslyso-Gb3multiple sclerosismedicine.diseaseBioinformaticsrenal involvementFabry diseaseExonNephrologyMedicineBiomarker (medicine)Multiplex ligation-dependent probe amplificationbusinessX-linked recessive inheritanceClinical Kidney Journal
researchProduct

Chances and Challenges of Computational Data Gathering and Analysis

2015

Digital and social media and large available data-sets generate various new possibilities and challenges for conducting research focused on perpetually developing online news ecosystems. This paper presents a novel computational technique for gathering and processing large quantities of data from Facebook. We demonstrate how to use this technique for detecting and analysing issue-attention cycles and news flows in Facebook groups and pages. Although the paper concentrates on a Finnish Facebook group as a case study, the demonstrated method can be used for gathering and analysing large sets of data from various social network sites and national contexts. The paper also discusses Facebook pla…

Facebookcomputational data gathering020205 medical informaticsComputer scienceissue-attention cycledata warehouse050801 communication & media studies02 engineering and technologynews flowsWorld Wide WebComputational Technique0508 media and communications0202 electrical engineering electronic engineering information engineeringSocial mediata518semi-public datata113hybrid news ecosystemData collectionEthical issuesSocial networkbusiness.industryCommunication05 social sciencesOnline research methodsData warehousedigital and social media researchbusinessDigital Journalism
researchProduct