Search results for "dataset"

showing 10 items of 77 documents

A European Multi Lake Survey dataset of environmental variables, phytoplankton pigments and cyanotoxins

2018

Under ongoing climate change and increasing anthropogenic activity, which continuously challenge ecosystem resilience, an in-depth understanding of ecological processes is urgently needed. Lakes, as providers of numerous ecosystem services, face multiple stressors that threaten their functioning. Harmful cyanobacterial blooms are a persistent problem resulting from nutrient pollution and climate-change induced stressors, like poor transparency, increased water temperature and enhanced stratification. Consistency in data collection and analysis methods is necessary to achieve fully comparable datasets and for statistical validity, avoiding issues linked to disparate data sources. The Europea…

Ecologia dels llacsData DescriptorWater resourcesAquatic Ecology and Water Quality Managementthermocline010504 meteorology & atmospheric sciencesvesien tilaphytoplankton pigments010501 environmental sciences01 natural sciencesEcosystem servicesympäristön tilaBU Contaminants & ToxinsEnvironmental monitoringLimnologylakesddc:550Canvi climàticGeosciences MultidisciplinarySurveyComputingMilieux_MISCELLANEOUSddc:333.7-333.9Climate-ChangeEurope LakesEnvironmental resource management[Belirlenecek]Climate-change ecologyplanktonEutrophication6. Clean waterComputer Science ApplicationsEuropeDisparate systemdatainternationalBloomStatistics Probability and UncertaintyEuropaEnvironmental MonitoringInformation Systemsenvironmental variablesStatistics and ProbabilityBiological pigmentsFitoplànctonClimate ChangeCyanotoxinsta1172BU Contaminanten & ToxinesClimate changeobservation designLibrary and Information SciencesCyanobacteriajärvetEducationEuropean Multi Lakecyanotoxinsddc:570Life ScienceEcosystem14. Life underwaterdatabase creation objectivesyanobakteerit0105 earth and related environmental sciencesWIMEKbusiness.industrydata analysis objectivenutrientmuuttujatPigments Biological15. Life on landClimatic changesdataset ; environmental variables ; phytoplankton ; pigments ; cyanotoxinsmikrolevätAquatische Ecologie en WaterkwaliteitsbeheerEnvironmental variablesPhytoplankton pigmentsMultidisciplinär geovetenskapClimatic changeWater resourcesLakes13. Climate actionNutrient pollutionPhytoplanktonEnvironmental science[SDE.BE]Environmental Sciences/Biodiversity and EcologybusinessEutrophicationLake ecologyCanvis climàticsWatersScientific Data
researchProduct

A Stochastic Variance Factor Model for Large Datasets and an Application to S&P Data

2008

The aim of this paper is to consider multivariate stochastic volatility models for large dimensional datasets. We suggest the use of the principal component methodology of Stock and Watson [Stock, J.H., Watson, M.W., 2002. Macroeconomic forecasting using diffusion indices. Journal of Business and Economic Statistics, 20, 147–162] for the stochastic volatility factor model discussed by Harvey, Ruiz, and Shephard [Harvey, A.C., Ruiz, E., Shephard, N., 1994. Multivariate Stochastic Variance Models. Review of Economic Studies, 61, 247–264]. We provide theoretical and Monte Carlo results on this method and apply it to S&P data.

Economics and EconometricsMultivariate statisticsPrincipal componentsStochastic volatilityjel:C32jel:C33jel:G12Factor modelPrincipal component analysisEconometricsEconomicsStochastic volatility Factor models Principal componentsStochastic volatilityforecasting; stochastic volatility; large datasetFinanceFactor analysis
researchProduct

USE-Net: Incorporating Squeeze-and-Excitation blocks into U-Net for prostate zonal segmentation of multi-institutional MRI datasets

2019

Prostate cancer is the most common malignant tumors in men but prostate Magnetic Resonance Imaging (MRI) analysis remains challenging. Besides whole prostate gland segmentation, the capability to differentiate between the blurry boundary of the Central Gland (CG) and Peripheral Zone (PZ) can lead to differential diagnosis, since tumor's frequency and severity differ in these regions. To tackle the prostate zonal segmentation task, we propose a novel Convolutional Neural Network (CNN), called USE-Net, which incorporates Squeeze-and-Excitation (SE) blocks into U-Net. Especially, the SE blocks are added after every Encoder (Enc USE-Net) or Encoder-Decoder block (Enc-Dec USE-Net). This study ev…

FOS: Computer and information sciences0209 industrial biotechnologyComputer Science - Machine LearningGeneralizationComputer scienceComputer Vision and Pattern Recognition (cs.CV)Cognitive NeuroscienceComputer Science - Computer Vision and Pattern RecognitionConvolutional neural network02 engineering and technologyConvolutional neural networkMachine Learning (cs.LG)Image (mathematics)Prostate cancer020901 industrial engineering & automationArtificial IntelligenceProstate0202 electrical engineering electronic engineering information engineeringmedicineMedical imagingAnatomical MRISegmentationBlock (data storage)Prostate cancermedicine.diagnostic_testSettore INF/01 - Informaticabusiness.industryAnatomical MRI; Convolutional neural networks; Cross-dataset generalization; Prostate cancer; Prostate zonal segmentation; USE-NetINF/01 - INFORMATICAMagnetic resonance imagingPattern recognitionUSE-Netmedicine.diseaseComputer Science Applicationsmedicine.anatomical_structureCross-dataset generalizationFeature (computer vision)Prostate zonal segmentation020201 artificial intelligence & image processingConvolutional neural networksArtificial intelligencebusinessEncoder
researchProduct

An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments

2020

The problem of training with a small set of positive samples is known as few-shot learning (FSL). It is widely known that traditional deep learning (DL) algorithms usually show very good performance when trained with large datasets. However, in many applications, it is not possible to obtain such a high number of samples. In the image domain, typical FSL applications include those related to face recognition. In the audio domain, music fraud or speaker recognition can be clearly benefited from FSL methods. This paper deals with the application of FSL to the detection of specific and intentional acoustic events given by different types of sound alarms, such as door bells or fire alarms, usin…

FOS: Computer and information sciencesComputer Science - Machine LearningSound (cs.SD)sound processingaudio datasetmachine listeningUNESCO::CIENCIAS TECNOLÓGICASComputer Science - SoundMachine Learning (cs.LG)classificationArtificial IntelligenceAudio and Speech Processing (eess.AS)Signal ProcessingFOS: Electrical engineering electronic engineering information engineeringfew-shot learningopen-set recognitionComputer Vision and Pattern RecognitionSoftwareElectrical Engineering and Systems Science - Audio and Speech Processing
researchProduct

Human experts vs. machines in taxa recognition

2020

The step of expert taxa recognition currently slows down the response time of many bioassessments. Shifting to quicker and cheaper state-of-the-art machine learning approaches is still met with expert scepticism towards the ability and logic of machines. In our study, we investigate both the differences in accuracy and in the identification logic of taxonomic experts and machines. We propose a systematic approach utilizing deep Convolutional Neural Nets with the transfer learning paradigm and extensively evaluate it over a multi-pose taxonomic dataset with hierarchical labels specifically created for this comparison. We also study the prediction accuracy on different ranks of taxonomic hier…

FOS: Computer and information sciencesComputer Science - Machine Learninghahmontunnistus (tietotekniikka)Computer scienceClassification approachTaxonomic expert02 engineering and technologyneuroverkotcomputer.software_genreConvolutional neural networkQuantitative Biology - Quantitative MethodsField (computer science)Machine Learning (cs.LG)Machine learning approachesStatistics - Machine LearningAutomated approachDeep neural networks0202 electrical engineering electronic engineering information engineeringTaxonomic rankQuantitative Methods (q-bio.QM)Classification (of information)Artificial neural networksystematiikka (biologia)Prediction accuracyIdentification (information)koneoppiminenMulti-image dataBenchmark (computing)020201 artificial intelligence & image processingConvolutional neural networksComputer Vision and Pattern RecognitionClassification errorsMachine Learning (stat.ML)Machine learningState of the artElectrical and Electronic EngineeringTaxonomySupport vector machinesLearning systemsbusiness.industryNode (networking)020206 networking & telecommunicationsComputer circuitsHierarchical classificationConvolutionSupport vector machineFOS: Biological sciencesTaxonomic hierarchySignal ProcessingBiomonitoringBenchmark datasetsArtificial intelligencebusinesscomputertaksonitSoftware
researchProduct

Fast Estimation of Diffusion Tensors under Rician noise by the EM algorithm

2016

Diffusion tensor imaging (DTI) is widely used to characterize, in vivo, the white matter of the central nerve system (CNS). This biological tissue contains much anatomic, structural and orientational information of fibers in human brain. Spectral data from the displacement distribution of water molecules located in the brain tissue are collected by a magnetic resonance scanner and acquired in the Fourier domain. After the Fourier inversion, the noise distribution is Gaussian in both real and imaginary parts and, as a consequence, the recorded magnitude data are corrupted by Rician noise. Statistical estimation of diffusion leads a non-linear regression problem. In this paper, we present a f…

FOS: Computer and information sciencesreduced computationGaussianModels NeurologicalDatasets as Topicta3112Statistics - ComputationStatistics - ApplicationsTime030218 nuclear medicine & medical imagingMethodology (stat.ME)Diffusion03 medical and health sciencessymbols.namesake0302 clinical medicineScoring algorithmRician fadingPrior probabilityExpectation–maximization algorithmImage Processing Computer-AssistedMaximum a posteriori estimationHumansApplications (stat.AP)Computer SimulationComputation (stat.CO)Statistics - MethodologyMathematicsta112Likelihood FunctionsGeneral NeuroscienceBrainEstimatormaximum likelihood estimatorFisher scoringMagnetic Resonance ImagingWhite MatterRician likelihoodDiffusion Tensor ImagingFourier transformNonlinear Dynamicssymbolsmaximum a posteriori estimatorAlgorithmAlgorithms030217 neurology & neurosurgerydata augmentation
researchProduct

Ancestry and demography and descendants of Iron Age nomads of the Eurasian Steppe

2017

During the 1st millennium before the Common Era (BCE), nomadic tribes associated with the Iron Age Scythian culture spread over the Eurasian Steppe, covering a territory of more than 3,500 km in breadth. To understand the demographic processes behind the spread of the Scythian culture, we analysed genomic data from eight individuals and a mitochondrial dataset of 96 individuals originating in eastern and western parts of the Eurasian Steppe. Genomic inference reveals that Scythians in the east and the west of the steppe zone can best be described as a mixture of Yamnaya-related ancestry and an East Asian component. Demographic modelling suggests independent origins for eastern and western g…

Gene FlowMale0301 basic medicineSteppePopulation geneticsHuman MigrationGenomic dataBiological anthropologyScience[SHS.ANTHRO-BIO]Humanities and Social Sciences/Biological anthropologyDatasets as TopicGeneral Physics and AstronomyDNA MitochondrialWhite PeopleArticleGeneral Biochemistry Genetics and Molecular BiologyRussia03 medical and health sciencesAsian Peopleddc:590HumansEast AsiaHistory AncientTransients and MigrantsModels StatisticalMultidisciplinarygeography.geographical_feature_categoryHuman migrationbusiness.industryQGenetic VariationGeneral ChemistryGrasslandKazakhstan030104 developmental biologyGeographyIron AgeEthnologybusiness
researchProduct

Characterization of a fractured basement reservoir using high-resolution 3D seismic and logging datasets: A case study of the Sab'atayn Basin, Yemen.

2018

The Sab'atayn Basin is one of the most prolific Mesozoic hydrocarbon basins located in central Yemen. It has many oil producing fields including the Habban Field with oil occurrences in fractured basement rocks. A comprehensive seismic analysis of fractured basement reservoirs was performed to identify the structural pattern and mechanism of hydrocarbon entrapment and reservoir characteristics. A 3D post-stack time migration seismic cube and logging data of 20 wells were used and several 2D seismic sections were constructed and interpreted. Depth structure maps were generated for the basement reservoir and overlying formations. The top of the basement reservoir is dissected by a set of NW-S…

Geologic SedimentsYemen010504 meteorology & atmospheric sciencesOutcropWater WellsDatasets as TopicGeographic Mappinglcsh:Medicine010502 geochemistry & geophysicsBiochemistry01 natural scienceschemistry.chemical_compoundJurassic PeriodOil and Gas FieldsPetrologylcsh:ScienceMaterialsSeismologyMineralsCretaceous PeriodMultidisciplinaryHydraulic FrackingPhysicsClassical MechanicsGeologyMineralogyLipidsPetroleum reservoirChemistryGeophysicsPetroleumBasement (geology)Source rockPhysical SciencesMesozoic EraPetroleumOrganic MaterialsPorosityGeologyResearch ArticleMaterials ScienceGraniteNatural GasStructural basinImaging Three-DimensionalEarthquakesHumans0105 earth and related environmental sciencesDamage Mechanicslcsh:RChemical CompoundsBiology and Life SciencesDrillingGeologic TimeHydrocarbonschemistryEarth SciencesGeographic Information Systemslcsh:QOilsOil shalePLoS ONE
researchProduct

Big Data in Medical Science–a Biostatistical View

2015

Big data” is a universal buzzword in business and science, referring to the retrieval and handling of ever-growing amounts of information. It can be assumed, for example, that a typical hospital generates hundreds of terabytes (1 TB = 1012 bytes) of data annually in the course of patient care (1). For instance, exome sequencing, which results in 5 gigabytes (1 GB = 109 bytes) of data per patient, is on the way to becoming routine (2). The analysis of such enormous volumes of information, i.e., organization and description of the data and the drawing of (scientifically valid) conclusions, can already hardly be accomplished with the traditional tools of computer science and statistics. For ex…

Gigabytebusiness.industrymedia_common.quotation_subjectBig dataByteCloud computingGeneral MedicineTerabyteBioinformaticsData scienceData analysisMedicinebusinessFunction (engineering)media_commonDatasets as TopicDeutsches Ärzteblatt international
researchProduct

CArDIS : A Swedish Historical Handwritten Character and Word Dataset

2022

This paper introduces a new publicly available image-based Swedish historical handwritten character and word dataset named Character Arkiv Digital Sweden (CArDIS) (https://cardisdataset.github.io/CARDIS/). The samples in CArDIS are collected from 64, 084 Swedish historical documents written by several anonymous priests between 1800 and 1900. The dataset contains 116, 000 Swedish alphabet images in RGB color space with 29 classes, whereas the word dataset contains 30, 000 image samples of ten popular Swedish names as well as 1, 000 region names in Sweden. To examine the performance of different machine learning classifiers on CArDIS dataset, three different experiments are conducted. In the …

Handwriting recognitionOptical character recognition softwareoptical character recognition (OCR)Computer SciencesCharacter recognitionold handwritten styleImage recognitionCharacter and word recognitionVDP::Teknologi: 500Datavetenskap (datalogi)Machine learningSwedish handwritten word datasetmachine learning methodsFeature extractionHidden Markov modelsSwedish handwritten character dataset
researchProduct