0000000000164917

AUTHOR

Satu Helske

0000-0003-0532-0153

showing 9 related works from this author

Can visualization alleviate dichotomous thinking? Effects of visual representations on the cliff effect

2021

Common reporting styles for statistical results in scientific articles, such as $p$ p -values and confidence intervals (CI), have been reported to be prone to dichotomous interpretations, especially with respect to the null hypothesis significance testing framework. For example when the $p$ p -value is small enough or the CIs of the mean effects of a studied drug and a placebo are not overlapping, scientists tend to claim significant differences while often disregarding the magnitudes and absolute differences in the effect sizes. This type of reasoning has been shown to be potentially harmful to science. Techniques relying on the visual estimation of the strength of evidence have been recom…

FOS: Computer and information sciencesvisualisointiBayesian inferencetilastomenetelmätComputer Science - Human-Computer Interactiontulkinta02 engineering and technologyBayesian inferenceluottamustasotHuman-Computer Interaction (cs.HC)cliff effectData visualizationhypothesis testing0202 electrical engineering electronic engineering information engineeringStatistical inferencevisualizationconfidence intervalsStatistical hypothesis testingpäättelybusiness.industrybayesilainen menetelmäOther Statistics (stat.OT)Multilevel model020207 software engineeringtilastografiikkaComputer Graphics and Computer-Aided DesignConfidence intervalStatistics - Other StatisticsSignal ProcessingComputer Vision and Pattern RecognitionbusinessPsychologyNull hypothesisValue (mathematics)SoftwareCognitive psychologystatistical inference
researchProduct

Partnership formation and dissolution over the life course: applying sequence analysis and event history analysis in the study of recurrent events

2015

We present two types of approach to the analysis of recurrent events for discretely measured data, and show how these methods can complement each other when analysing co-residential partnership histories. Sequence analysis is a descriptive tool that gives an overall picture of the data and helps to find typical and atypical patterns in histories. Event history analysis is used to make conclusions about the effects of covariates on the timing and duration of the partnerships. As a substantive question, we studied how family background and childhood socio-emotional characteristics were related to later partnership formation and stability in a Finnish cohort born in 1959. We found that high se…

partnership formationta112H Social Sciences (General)sequence analysisevent history analysisHQ The family. Marriage. Womanpartnership dissolutionLower riskDevelopmental psychologyRepeated eventsrecurrent eventssekvenssianalyysiGeneral partnershipCohortjel:C1Life course approachHA StatisticsLife-span and Life-course StudiesPsychologypartnership formation; partnership dissolution; sequence analysis; event history analysis; recurrent eventsta515Survival analysisLongitudinal and Life Course Studies
researchProduct

Combining Sequence Analysis and Hidden Markov Models in the Analysis of Complex Life Sequence Data

2018

Life course data often consists of multiple parallel sequences, one for each life domain of interest. Multichannel sequence analysis has been used for computing pairwise dissimilarities and finding clusters in this type of multichannel (or multidimensional) sequence data. Describing and visualizing such data is, however, often challenging. We propose an approach for compressing, interpreting, and visualizing the information within multichannel sequences by finding (1) groups of similar trajectories and (2) similar phases within trajectories belonging to the same group. For these tasks we combine multichannel sequence analysis and hidden Markov modelling. We illustrate this approach with an …

longitudinal datasekvensointisequence analysisSequence analysisComputer scienceMarkovin ketjutMarkov modelspitkittäistutkimuselämänkaari01 natural sciences010104 statistics & probability03 medical and health sciencesData sequencespopulation dynamicsSannolikhetsteori och statistik0101 mathematicsfamily and work trajectoriesProbability Theory and StatisticsHidden Markov modellife course030505 public healthhidden Markov modelslatent Markov modelsbusiness.industryPattern recognitionTvärvetenskapliga studier inom samhällsvetenskaplife sequence dataLife domainLife course approachPairwise comparisonArtificial intelligenceSocial Sciences Interdisciplinary0305 other medical sciencebusinessväestötilastot
researchProduct

Statistical analysis of life sequence data

2016

latent Markov modelsequence analysisevent history analysislife course dataelinaika-analyysitilastomenetelmätMarkovin ketjutmultichannel sequencespitkittäistutkimuselämänkaarielämäntilannemultidimensional sequencessekvenssianalyysielämänmuutoksetmixture hidden Markov modelhidden Markov modeltilastolliset mallitstokastiset prosessit
researchProduct

Mixture Hidden Markov Models for Sequence Data: The seqHMM Package in R

2019

Sequence analysis is being more and more widely used for the analysis of social sequences and other multivariate categorical time series data. However, it is often complex to describe, visualize, and compare large sequence data, especially when there are multiple parallel sequences per subject. Hidden (latent) Markov models (HMMs) are able to detect underlying latent structures and they can be used in various longitudinal settings: to account for measurement error, to detect unobservable states, or to compress information across several types of observations. Extending to mixture hidden Markov models (MHMMs) allows clustering data into homogeneous subsets, with or without external covariate…

FOS: Computer and information sciencesStatistics and ProbabilityMultivariate statisticssequence analysisaikasarjatComputer sciencerMarkov modelStatistics - ComputationStatistics - Applications01 natural sciencesUnobservablecategorical time seriesR-kieli010104 statistics & probabilitymulti-channel sequences; categorical time series; visualizing sequence data; visualizing models; latent Markov models; latent class models; RCovariateApplications (stat.AP)Sannolikhetsteori och statistikComputer software0101 mathematicsTime seriesProbability Theory and StatisticsHidden Markov modelCluster analysislcsh:Statisticslcsh:HA1-4737Categorical variableComputation (stat.CO)ta112business.industryvisualizing sequence dataR (programming languages)Pattern recognitionmulti-channel sequencesvisualizing modelslatent class modelssekvenssianalyysiArtificial intelligencelatent markov modelstime seriesStatistics Probability and UncertaintybusinessSoftwareJournal of Statistical Software
researchProduct

Statistical analysis of life history calendar data

2016

The life history calendar is a data-collection tool for obtaining reliable retrospective data about life events. To illustrate the analysis of such data, we compare the model-based probabilistic event history analysis and the model-free data mining method, sequence analysis. In event history analysis, we estimate instead of transition hazards the cumulative prediction probabilities of life events in the entire trajectory. In sequence analysis, we compare several dissimilarity metrics and contrast data-driven and user-defined substitution costs. As an example, we study young adults' transition to adulthood as a sequence of events in three life domains. The events define the multistate event…

AdultMaleStatistics and ProbabilityAdolescentEpidemiologyComputer sciencedistance-based dataDisease clustercomputer.software_genre01 natural sciencesLife Change EventsYoung Adult010104 statistics & probability0504 sociologyHealth Information Managementprediction probabilityStatisticsData MiningHumansLongitudinal StudiesProspective Studieslife history calendar multidimensional sequence analysis0101 mathematicsFinlandSurvival analysisProbabilityRetrospective StudiesSequence (medicine)Complement (set theory)ta112DepressionData Collection05 social sciencesProbabilistic logic050401 social sciences methodsContrast (statistics)multistate modelMiddle ageLife course approachFemaleData mininglife history calendarlife course analysiscomputermultidimensional sequence analysisStatistical Methods in Medical Research
researchProduct

From Sequences to Variables : Rethinking the Relationship between Sequences and Outcomes

2021

Sequence analysis is increasingly used in the social sciences for the holistic analysis of life-course and other longitudinal data. The usual approach is to construct sequences, calculate dissimilarities, group similar sequences with cluster analysis, and use cluster membership as a dependent or independent variable in a regression model. This approach may be problematic, as cluster memberships are assumed to be fixed known characteristics of the subjects in subsequent analyses. Furthermore, it is often more reasonable to assume that individual sequences are mixtures of multiple ideal types rather than equal members of some group. Failing to account for uncertain and mixed memberships may l…

sequence analysisrepresentativenesslife-courseSocArXiv|Social and Behavioral Sciences|Sociology|Children and Youthbepress|Social and Behavioral Sciences|SociologySocArXiv|Social and Behavioral Sciences|Sociologyklusteritbepress|Social and Behavioral Sciences|Sociology|Family Life Course and Societysekvenssianalyysianalyysibepress|Social and Behavioral SciencesklusterianalyysiSocArXiv|Social and Behavioral Sciencestypologycluster analysis
researchProduct

Analysing Complex Life Sequence Data with Hidden Markov Modelling

2016

When analysing complex sequence data with multiple channels (dimensions) and long observation sequences, describing and visualizing the data can be a challenge. Hidden Markov models (HMMs) and their mixtures (MHMMs) offer a probabilistic model-based framework where the information in such data can be compressed into hidden states (general life stages) and clusters (general patterns in life courses). We studied two different approaches to analysing clustered life sequence data with sequence analysis (SA) and hidden Markov modelling. In the first approach we used SA clusters as fixed and estimated HMMs separately for each group. In the second approach we treated SA clusters as suggestive and …

complex sequence dataHidden Markov Modelling
researchProduct

Kompleksisen luokitellun aineiston riippuvuusrakenteen pelkistäminen : sovellus ylioppilaskirjoitusaineistoon

2009

valintaylioppilastutkintotaulukointimuuttujatulottuvuusmallitäidinkieli
researchProduct