Search results for "rekisteri"
showing 10 items of 31 documents
Newly Digitized Database Reveals the Lives and Families of Forced Migrants from Finnish Karelia
2017
Studies on displaced persons often suffer from a lack of data on the long-term effects of forced migration. A register created during 1960s and published as a book series ‘Siirtokarjalaisten tie’ in 1970 documented the lives of individuals who fled the southern Karelian district of Finland after its first and second occupation by the Soviet Union in 1940 and 1944. To realize the potential value of these data for scientific research, we have recently scanned the register using optical character recognition (OCR) software, and developed proprietary computer code to extract these data. Here we outline the steps involved in the digitization process, and present an overview of the Migration Kare…
Register data in sample allocations for small-area estimation
2018
The inadequate control of sample sizes in surveys using stratified sampling and area estimation may occur when the overall sample size is small or auxiliary information is insufficiently used. Very small sample sizes are possible for some areas. The proposed allocation based on multi-objective optimization uses a small-area model and estimation method and semi-collected empirical data annually collected empirical data. The assessment of its performance at the area and at the population levels is based on design-based sample simulations. Five previously developed allocations serve as references. The model-based estimator is more accurate than the design-based Horvitz–Thompson estimator and t…
Mitä tilastollinen tarkastelu voi kertoa sosiaalisen median kielestä?
2017
Reddit on pääasiassa englanninkielinen sosiaalisen median sivusto, jossa keskustelu keskittyy eri aihealueiden ympärille. Tilastollisia menetelmiä käyttämällä voidaan saada selville, että Redditin sisällä kielenkäyttö vaihtelee tilanteen mukaan samoin kuin sen ulkopuolellakin. Ihmiset ovat taitavia käyttämään kieltä aina tilanteen vaatimalla tavalla. nonPeerReviewed
Aktivointisuunnitelma tehty : Kokkolan pitkäaikaistyöttömyys ja aktivointipolitiikka rekisteritietojen valossa
2007
Optimal sample allocation conditioned on a small area model, estimator, and auxiliary data
2018
We have studied optimal sample allocation, associated with small area estimation, when the objective is to obtain as accurate estimates as possible, for the population and for the subpopulations, called as areas here. It is a question of a two-level optimization problem. The basic premise is composed of planned areas, stratified sampling, and small overall sample size predetermined by restricted time and budget resources. Low sample sizes are common in market surveys. During this thesis, we have developed new allocation methods, based on a small area model, estimator, and auxiliary data. The final method, the three-term Pareto allocation, is based on the three terms of the mean-squared erro…
Measurement of Open Access as an Infrastructural Challenge : The Case of Finland
2017
Finland has set numeric goals for the development of open access. However, at the moment, no system is available by which this development could be monitored. Poor quality in the metadata records in universities’ research information databases prevents metadata-based analysis of open access publishing progress. This paper shows how the quality problems of Finnish publication data can be resolved through centralizing the services and processes of metadata creation and by improving the interoperability of systems involved in the processes. As a result, this study describes an environment where reliable measurement of open access is possible and presents suggested actions for improving the Fin…
Data obstacles and privacy concerns in artificial intelligence initiatives
2021
To become and remain competitive, many companies (especially large ones) are considering capitalising on data-based technologies, such as Artificial Intelligence (AI). However, whether these companies are structurally ready in terms of data collection and management remains unknown. This chapter discusses how privacy issues and reforms, such as the General Data and Protection Regulation (GDPR), affect companies’ AI initiatives and processes. For this purpose, we reviewed the relevant literature and collected empirical data using in-depth interviews with AI and data industry experts in five countries. Our main findings indicated that companies are lacking sound data collection and management…