Search results for "data quality"
showing 10 items of 96 documents
Validation of HF radar sea surface currents in the Malta-Sicily Channel
2019
Abstract A network of High-Frequency radar (HFR) stations runs operationally in the Malta-Sicily Channel (MSC), Central Mediterranean Sea, providing sea surface current maps with high temporal (1 h) and spatial (3 × 3 km) resolutions since August 2012. Comparisons with surface drifter data and near-surface Acoustic Doppler Current Profiler (ADCP) observations, as well as radar site-to-site baseline analyses, provide quantitative assessments of HFR velocities accuracy. Twenty-two drifters were deployed within the HFR domain of coverage between December 2012 and October 2013. Additionally, six ADCP vertical current profiles were collected at selected positions during a dedicated field survey.…
Uneven Data Quality and the Earliest Occupation of Europe—the Case of Untermassfeld (Germany)
2017
AbstractThe database regarding the earliest occupation of Europe has increased significantly in quantity and quality of data points over the last two decades, mainly through the addition of new sites as a result of long-term systematic excavations and large-scale prospections of Early and early Middle Pleistocene exposures. The site distribution pattern suggests an ephemeral presence of hominins in the south of Europe from around one million years ago, with occasional short northward expansions along the western coastal areas when temperate conditions permitted. From around 600,000-700,000 years ago Acheulean artefacts appear in Europe and somewhat later hominin presence seems to pick up, w…
Assessing multiple sources of data to detect illegal fishing, trade and mislabelling of elasmobranchs in Greek markets
2020
Abstract Elasmobranchs, extremely charismatic and threatened animals, still are an important economic source for fishers in many parts of the world, providing significant income through trade. Even though Greek seas host at least 67 elasmobranch species, our knowledge about their biology and ecology is to a large extent unknown. In the present study the integration of conventional (legislation, official data from fisheries landings and fish market value and import/export data) and unconventional (social media) sources of data, accompanied with the use of genetics, aim at outlining the elasmobranch fisheries and trade in Greece and identifying “weak spots” that sabotage their conservation. R…
Perspective: Essential study quality descriptors for data from nutritional epidemiologic research
2017
Pooled analysis of secondary data increases the power of research and enables scientific discovery in nutritional epidemiology. Information on study characteristics that determine data quality is needed to enable correct reuse and interpretation of data. This study aims to define essential quality characteristics for data from observational studies in nutrition. First, a literature review was performed to get an insight on existing instruments that assess the quality of cohort, case-control, and cross-sectional studies and dietary measurement. Second, 2 face-to-face workshops were organized to determine the study characteristics that affect data quality. Third, consensus on the data descrip…
Integration of animal health and public health surveillance sources to exhaustively inform the risk of zoonosis: An application to echinococcosis in …
2020
The analysis of zoonotic disease risk requires the consideration of both human and animal geo-referenced disease incidence data. Here we show an application of joint Bayesian analyses to the study of echinococcosis granulosus (EG) in the province of Rio Negro, Argentina. We focus on merging passive and active surveillance data sources of animal and human EG cases using joint Bayesian spatial and spatio-temporal models. While similar spatial clustering and temporal trending was apparent, there appears to be limited lagged dependence between animal and human outcomes. Beyond the data quality issues relating to missingness at different times, we were able to identify relations between dog and …
RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures
2017
RepeatsDB 2.0 (URL: http://repeatsdb.bio.unipd.it/) is an update of the database of annotated tandem repeat protein structures. Repeat proteins are a widespread class of non-globular proteins carrying heterogeneous functions involved in several diseases. Here we provide a new version of RepeatsDB with an improved classification schema including high quality annotations for ∼5400 protein structures. RepeatsDB 2.0 features information on start and end positions for the repeat regions and units for all entries. The extensive growth of repeat unit characterization was possible by applying the novel ReUPred annotation method over the entire Protein Data Bank, with data quality is guaranteed by a…
EHRtemporalVariability: delineating temporal dataset shifts in electronic health records
2020
AbstractBackgroundTemporal variability in healthcare processes or protocols is intrinsic to medicine. Such variability can potentially introduce dataset shifts, a data quality issue when reusing electronic health records (EHRs) for secondary purposes. Temporal dataset shifts can present as trends, abrupt or seasonal changes in the statistical distributions of data over time, being particularly complex to address in multi-modal and highly coded data. These changes, if not delineated, can harm population and data-driven research, such as machine learning. Given that biomedical research repositories are increasingly being populated with large historical data from EHRs, there is a need for spec…
ATLAS data quality operations and performance for 2015-2018 data-taking
2020
The ATLAS detector at the Large Hadron Collider reads out particle collision data from over 100 million electronic channels at a rate of approximately 100 kHz, with a recording rate for physics events of approximately 1 kHz. Before being certified for physics analysis at computer centres worldwide, the data must be scrutinised to ensure they are clean from any hardware or software related issues that may compromise their integrity. Prompt identification of these issues permits fast action to investigate, correct and potentially prevent future such problems that could render the data unusable. This is achieved through the monitoring of detector-level quantities and reconstructed collision ev…
Cancer in children and adolescents in Europe: Developments over 20 years and future challengers
2006
This special issue contains 18 articles describing population-based analyses of incidence and survival for cancer among children and adolescents in Europe over the period 1978-1997. The analyses were derived from the large database of the ACCIS project (Automated Childhood Cancer Information System), which was built through collaboration of 62 population-based cancer registries in 19 European countries. Data on 88,465 cancers in children and 15,369 in adolescents (age 15-19 yrs) were included in the various analyses, making this the largest database on cancer in these age-groups in the world. National data were grouped into five European regions to allow comparisons of incidence and surviva…
How does the quality of surveys for nutrient intake adequacy assessment compare across Europe? A scoring system to rate the quality of data in such s…
2009
Research was conducted within the EURopean micronutrient RECommendations Aligned (EURRECA) Network of Excellence, to find the best practice in assessing nutrient intakes. Objectives include: to search for and use data on individual nutrient intake adequacy (NIA) assessment collected in twenty-eight European countries and the four European Free Trade Association countries; to design and test innovative tools for data quality analysis. The information was obtained using the method described by Blanquer et al. in the present issue. The best-practice criteria were devised to select the most appropriate survey in each country. Then a survey quality scoring system was developed in consultation wi…