Search results for "text processing"
showing 10 items of 35 documents
Qui a peur du changement climatique?
2019
ABSTRACT.The French "Grand Débat National" (Great National Debate) was animportant political event in 2019. Using the online proposals collected during thisconsultation, we propose a representation of the feeling of the impact of climate changeamong the contributors to the "Grand Débat National" in Metropolitan France. Weanalyze the causes of this feeling, through a supervised analysis of the contributions(via the Great Annotation) and we show the complementary interest of an unsuper-vised analysis (by extracting keywords). We show the richness of the data set that constitutes the "Grand Débat National", and the analytical stakes around these data.We also point out some important limitation…
Editorial: Mining Scientific Papers: NLP-enhanced Bibliometrics
2019
International audience
Read&Answer, A Tool to Capture on-Line Processing of Electronic Texts
2009
This paper is aimed at presenting Read&Answer, a tool that records reading times, one of the main on-line methods employed in text processing research. Read&Answer allows the recording, analysis and interpretation of the learner processing in order to test specific hypotheses and explain final comprehension results. First, we will describe the tool, and then we will briefly explain some research studies using the tool. We will show how Read&Answer can be used in combination with another on-line method extensively employed in text processing research, i.e., verbal protocols, and we will also compare Read&Answer with eye movement tracking, a widely accepted on-line reading times technique.
Advanced Topics in Intelligent Information and Database Systems
2017
This book presents recent research in intelligent information and database systems. The carefully selected contributions were initially accepted for presentation as posters at the 9th Asian Conference on Intelligent Information and Database Systems (ACIIDS 2017) held from to 5 April 2017 in Kanazawa, Japan. While the contributions are of an advanced scientific level, several are accessible for non-expert readers. The book brings together 47 chapters divided into six main parts: • Part I. From Machine Learning to Data Mining.• Part II. Big Data and Collaborative Decision Support Systems,• Part III. Computer Vision Analysis, Detection, Tracking and Recognition,• Part IV. Data-Intensive Text P…
A Metric for Automatic Word categorization
2008
This paper presents a metric to be used by the working prototype WIH (Web Intelligent Handler). This metric (referred here as po) is designed to reflect main topic words and discriminate certain text profiles through word weightings. The actual version is designed only for Spanish web texts. Statistical analyses show that it is possible to differentiate text profiles upon po behavior. A poll is presented also, showing that it is a good main words discriminator. This paper is posted here as a new algorithm useful for Spanish text processing.
Transforming XML documents to OWL ontologies: A survey
2015
The aims of XML data conversion to ontologies are the indexing, integration and enrichment of existing ontologies with knowledge acquired from these sources. The contribution of this paper consists in providing a classification of the approaches used for the conversion of XML documents into OWL ontologies. This classification underlines the usage profile of each conversion method, providing a clear description of the advantages and drawbacks belonging to each method. Hence, this paper focuses on two main processes, which are ontology enrichment and ontology population using XML data. Ontology enrichment is related to the schema of the ontology (TBox), and ontology population is related to …
Interview with Charles Bigelow
2018
Charles Bigelows career parallels the development of digital font technology. He has designed fonts and consulted about font technology to many of the companies that created desktop publishing systems. He has also written extensively on digital font technology and taught at RISD, Stanford, and RIT.
Semantic HMC for Big Data Analysis
2014
International audience; Analyzing Big Data can help corporations to im-prove their efficiency. In this work we present a new vision to derive Value from Big Data using a Semantic Hierarchical Multi-label Classification called Semantic HMC based in a non-supervised Ontology learning process. We also proposea Semantic HMC process, using scalable Machine-Learning techniques and Rule-based reasoning.
Using skeleton and Hough transform variant to correct skew in historical documents
2020
International audience; As a main part of several document analysis systems, Skew estimation represents one of the major research challenges, particularly in case of historical documents exploration. In this paper, we propose an original skew angle detection and correction technique. Morphological Skeleton is introduced to considerably diminish the amount of data by eliminating the redundant pixels and preserving only the central curves of the image components. Next, the proposed method uses Progressive Probabilistic Hough Transform (PPHT) to find image lines. At the end, a specific procedure is applied in order to measure the global skew angle of the document image from these identified li…
An ontology change management approach for facility management
2014
International audience; Facility management (FM) or technical property management is an approach to operate, maintain, improve and adapt buildings and infrastructures of organizations. A FM project requires the cooperation of many actors from different domains so it has to be automated in a constrained collaborative environment. This paper proposes a new approach for ontology change management applied on facility management of such projects. The industrial challenge is, firstly, to ensure consistency of a FM project knowledge from the construction phase to the technical property management phase (after delivery). Secondly, it has to provide to each actor of the project a personal up-to-date…