Search results for "Data type"
showing 10 items of 1183 documents
Bagging, bumping, multiview, and active learning for record linkage with empirical results on patient identity data
2011
Record linkage or deduplication deals with the detection and deletion of duplicates in and across files. For this task, this paper introduces and evaluates two new machine-learning methods (bumping and multiview) together with bagging, a tree-based ensemble-approach. Whereas bumping represents a tree-based approach as well, multiview is based on the combination of different methods and the semi-supervised learning principle. After providing a theoretical background of the methods, initial empirical results on patient identity data are given. In the empirical evaluation, we calibrate the methods on three different kinds of training data. The results show that the smallest training data set, …
Patterns in words and languages
2004
AbstractA word p, over the alphabet of variables E, is a pattern of a word w over A if there exists a non-erasing morphism h from E∗ to A∗ such that h(p)=w. If we take E=A, given two words u,v∈A∗, we write u⩽v if u is a pattern of v. The restriction of ⩽ to aA∗, where A is the binary alphabet {a,b}, is a partial order relation. We introduce, given a word v, the set P(v) of all words u such that u⩽v. P(v), with the relation ⩽, is a poset and it is called the pattern poset of v. The first part of the paper is devoted to investigate the relationships between the structure of the poset P(v) and the combinatorial properties of the word v. In the last section, for a given language L, we consider …
Imprinting the complex dielectric permittivity of liquids into the spintronic terahertz emission
2021
We report an approach in time-domain terahertz (THz) spectroscopy for measuring the dielectric response of liquids based on inherent properties of spintronic THz emitters (STEs). The THz electric field radiated from the STE is inversely proportional to the sum of the complex refractive indices of the media surrounding the thin metallic stack of the STE and the stack's conductivity. We demonstrate that by bringing a liquid in contact with the emitter, its complex refractive index and accordingly its dielectric response are imprinted into the radiated electromagnetic field from the emitter. We use water as the test liquid and ascertain its dielectric loss and permittivity in the range of ∼0.…
On Spaces of Bochner and Pettis Integrable Functions and Their Set-Valued Counterparts
2011
The aim of this paper is to give a brief summary of the Pettis and Bochner integrals, how they are related, how they are generalized to the set-valued setting and the canonical Banach spaces of bounded maps between Banach spaces that they generate. The main tool that we use to relate the Banach space-valued case to the set-valued case, is the R ̊adstr ̈om embedding theorem.
Drugs and Nondrugs: An Effective Discrimination with Topological Methods and Artificial Neural Networks
2003
A set of topological and structural descriptors has been used to discriminate general pharmacological activity. To that end, we selected a group of molecules with proven pharmacological activity including different therapeutic categories, and another molecule group without any activity. As a method for pharmacological activity discrimination, an artificial neural network was used, dividing molecules into active and inactive, to train the network and externally validate it. The following plot frequency distribution diagrams were used: a function of the number of drugs within a value interval, and the output value of the neural network versus these values. Pharmacological distribution diagram…
Biopharma business models in Canada.
2011
This article provides new insights into the different strategy paths or business models currently being implemented by Canadian biopharma companies. Through a case-study methodology, seven biopharma companies pertaining to three business models were analyzed, leading to a broad set of results emerging from the following areas: activity, business model and strategy; management and human resources; and RD, technology and innovation strategy. The three business models represented were: model 1 (conventional biotech oriented to new drug development, radical innovation and search for discoveries); model 2 (development of a technology platform, usually in proteomics and bioinformatics); and model…
Sur les Codes ZigZag et Leur Décidabilité
1990
AbstractThis paper deals with zigzag factorizations and zigzag codes. The language of “zigzag” over a regular language is represented by constructing a special family of two-way automata. Decidability of zigzag codes, previously shown for the finite languages, is proved here for all regular languages by the analysis of the set of “crossing sequences” produced by a two-way automation in the family. We also obtain that it is decidable whether or not a two-way automation of a certain type is non-ambiguous.RésuméDans ce papier on reprend les notions de factorisation zigzag et de code zigzag. On construit pour tout langage rationnel, une famille d'automates bilatéres lesquels reconnaissent les m…
Dynamics of fintech terms in news and blogs and specialization of companies of the fintech industry
2020
We perform a large scale analysis of a list of fintech terms in (i) news and blogs in English language and (ii) professional descriptions of companies operating in many countries. The occurrence and co-occurrence of fintech terms and locutions shows a progressive evolution of the list of fintech terms in a compact and coherent set of terms used worldwide to describe fintech business activities. By using methods of complex networks that are specifically designed to deal with heterogeneous systems, our analysis of a large set of professional descriptions of companies shows that companies having fintech terms in their description present over-expressions of specific attributes of country, muni…
A tool for filtering information in complex systems
2005
We introduce a technique to filter out complex data-sets by extracting a subgraph of representative links. Such a filtering can be tuned up to any desired level by controlling the genus of the resulting graph. We show that this technique is especially suitable for correlation based graphs giving filtered graphs which preserve the hierarchical organization of the minimum spanning tree but containing a larger amount of information in their internal structure. In particular in the case of planar filtered graphs (genus equal to 0) triangular loops and 4 element cliques are formed. The application of this filtering procedure to 100 stocks in the USA equity markets shows that such loops and cliqu…
Few Simple Rules to Fix the Dynamics of Classical Systems Using Operators
2012
We show how to use operators in the description of exchanging processes often taking place in (complex) classical systems. In particular, we propose a set of rules giving rise to an Hamiltonian operator for such a system \({\mathcal{S}}\), which can be used to deduce the dynamics of \({\mathcal{S}}\).