Search results for "text processing"

showing 10 items of 35 documents

Qui a peur du changement climatique?

2019

ABSTRACT.The French "Grand Débat National" (Great National Debate) was animportant political event in 2019. Using the online proposals collected during thisconsultation, we propose a representation of the feeling of the impact of climate changeamong the contributors to the "Grand Débat National" in Metropolitan France. Weanalyze the causes of this feeling, through a supervised analysis of the contributions(via the Great Annotation) and we show the complementary interest of an unsuper-vised analysis (by extracting keywords). We show the richness of the data set that constitutes the "Grand Débat National", and the analytical stakes around these data.We also point out some important limitation…

Changement climatique[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing[SHS.GEO] Humanities and Social Sciences/GeographyGlobal warmingGrand Débat National[INFO.INFO-TT] Computer Science [cs]/Document and Text Processing[SHS.GEO]Humanities and Social Sciences/Geographyrand Débat NationalComputingMilieux_MISCELLANEOUS
researchProduct

Editorial: Mining Scientific Papers: NLP-enhanced Bibliometrics

2019

International audience

Computer science[SHS.INFO]Humanities and Social Sciences/Library and information sciencestext miningBibliometrics050905 science studiescomputer.software_genrescientific papersscientometrics[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]Bibliography. Library science. Information resourcescomputational linguistics[SHS.HISPHILSO]Humanities and Social Sciences/History Philosophy and Sociology of Sciencesnatural language processing[SHS.LANGUE]Humanities and Social Sciences/LinguisticsComputingMilieux_MISCELLANEOUScitation content analysisbusiness.industry05 social sciencesScientometrics[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]Artificial intelligence0509 other social sciencesComputational linguistics050904 information & library sciencesbusinesscomputerNatural language processingZ
researchProduct

Read&Answer, A Tool to Capture on-Line Processing of Electronic Texts

2009

This paper is aimed at presenting Read&Answer, a tool that records reading times, one of the main on-line methods employed in text processing research. Read&Answer allows the recording, analysis and interpretation of the learner processing in order to test specific hypotheses and explain final comprehension results. First, we will describe the tool, and then we will briefly explain some research studies using the tool. We will show how Read&Answer can be used in combination with another on-line method extensively employed in text processing research, i.e., verbal protocols, and we will also compare Read&Answer with eye movement tracking, a widely accepted on-line reading times technique.

Computer sciencebusiness.industrymedia_common.quotation_subjectInterpretation (philosophy)Eye movementcomputer.software_genreTest (assessment)ComprehensionText processingReading (process)Artificial intelligenceLine (text file)businessThink aloud protocolcomputerNatural language processingmedia_commonThe Ergonomics Open Journal
researchProduct

Advanced Topics in Intelligent Information and Database Systems

2017

This book presents recent research in intelligent information and database systems. The carefully selected contributions were initially accepted for presentation as posters at the 9th Asian Conference on Intelligent Information and Database Systems (ACIIDS 2017) held from to 5 April 2017 in Kanazawa, Japan. While the contributions are of an advanced scientific level, several are accessible for non-expert readers. The book brings together 47 chapters divided into six main parts: • Part I. From Machine Learning to Data Mining.• Part II. Big Data and Collaborative Decision Support Systems,• Part III. Computer Vision Analysis, Detection, Tracking and Recognition,• Part IV. Data-Intensive Text P…

Decision support systemDatabaseComputer sciencebusiness.industryBig dataComputational intelligencecomputer.software_genreData scienceResource (project management)Text processingDecision managementCollaborationThe Internetbusinesscomputer
researchProduct

A Metric for Automatic Word categorization

2008

This paper presents a metric to be used by the working prototype WIH (Web Intelligent Handler). This metric (referred here as po) is designed to reflect main topic words and discriminate certain text profiles through word weightings. The actual version is designed only for Spanish web texts. Statistical analyses show that it is possible to differentiate text profiles upon po behavior. A poll is presented also, showing that it is a good main words discriminator. This paper is posted here as a new algorithm useful for Spanish text processing.

DiscriminatorComputer sciencebusiness.industryPart of speechcomputer.software_genreText processingCategorizationStatistical analysesMetric (mathematics)Artificial intelligenceComputational linguisticsbusinesscomputerWord (computer architecture)Natural language processing
researchProduct

Transforming XML documents to OWL ontologies: A survey

2015

The aims of XML data conversion to ontologies are the indexing, integration and enrichment of existing ontologies with knowledge acquired from these sources. The contribution of this paper consists in providing a classification of the approaches used for the conversion of XML documents into OWL ontologies. This classification underlines the usage profile of each conversion method, providing a clear description of the advantages and drawbacks belonging to each method. Hence, this paper focuses on two main processes, which are ontology enrichment and ontology population using XML data. Ontology enrichment is related to the schema of the ontology (TBox), and ontology population is related to …

Document Structure Description[ INFO.INFO-IR ] Computer Science [cs]/Information Retrieval [cs.IR][ INFO.INFO-TT ] Computer Science [cs]/Document and Text ProcessingComputer scienceEfficient XML Interchange[ INFO.INFO-WB ] Computer Science [cs]/WebLibrary and Information SciencesOntology (information science)[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]XML Schema EditorStreaming XMLRELAX NG[ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI]computer.programming_languageOWLInformation retrievalOntology[INFO.INFO-WB]Computer Science [cs]/WebACM[INFO.INFO-LO]Computer Science [cs]/Logic in Computer Science [cs.LO]Web Ontology LanguageXML validationcomputer.file_formatXML[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]ComputingMethodologies_DOCUMENTANDTEXTPROCESSING[ INFO.INFO-LO ] Computer Science [cs]/Logic in Computer Science [cs.LO]computerInformation Systems
researchProduct

Interview with Charles Bigelow

2018

Charles Bigelows career parallels the development of digital font technology. He has designed fonts and consulted about font technology to many of the companies that created desktop publishing systems. He has also written extensively on digital font technology and taught at RISD, Stanford, and RIT.

EngineeringComputingMilieux_THECOMPUTINGPROFESSIONGeneral Computer Sciencebusiness.industry05 social sciences050905 science studiescomputer.software_genreDesktop publishingVisual artsText processingHistory and Philosophy of ScienceFontComputingMilieux_COMPUTERSANDEDUCATION0509 other social sciencesbusinesscomputerParallelsIEEE Annals of the History of Computing
researchProduct

Semantic HMC for Big Data Analysis

2014

International audience; Analyzing Big Data can help corporations to im-prove their efficiency. In this work we present a new vision to derive Value from Big Data using a Semantic Hierarchical Multi-label Classification called Semantic HMC based in a non-supervised Ontology learning process. We also proposea Semantic HMC process, using scalable Machine-Learning techniques and Rule-based reasoning.

FOS: Computer and information sciences[ INFO.INFO-TT ] Computer Science [cs]/Document and Text Processingmulti-classifyComputer scienceComputer Science - Artificial IntelligenceBig data[ INFO.INFO-WB ] Computer Science [cs]/Websemantic technologies02 engineering and technologyOntology (information science)Semantic data model[ INFO.INFO-DC ] Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Semantic similarity020204 information systemsSemantic computing0202 electrical engineering electronic engineering information engineeringontologyInformation retrievalOntology learningbusiness.industryOntology-based data integration[INFO.INFO-WB]Computer Science [cs]/WebBig-Data[INFO.INFO-TT]Computer Science [cs]/Document and Text ProcessingArtificial Intelligence (cs.AI)machine learningOntologySemantic technologyIndex Terms—classification020201 artificial intelligence & image processing[INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]business
researchProduct

Using skeleton and Hough transform variant to correct skew in historical documents

2020

International audience; As a main part of several document analysis systems, Skew estimation represents one of the major research challenges, particularly in case of historical documents exploration. In this paper, we propose an original skew angle detection and correction technique. Morphological Skeleton is introduced to considerably diminish the amount of data by eliminating the redundant pixels and preserving only the central curves of the image components. Next, the proposed method uses Progressive Probabilistic Hough Transform (PPHT) to find image lines. At the end, a specific procedure is applied in order to measure the global skew angle of the document image from these identified li…

General Computer ScienceHorizontal and verticalMorphological skeletonComputer scienceSkew estimationComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONDocument image analysis010103 numerical & computational mathematics02 engineering and technologySkeleton (category theory)01 natural sciencesMeasure (mathematics)Theoretical Computer ScienceHough transformlaw.inventionImage (mathematics)lawMorphological skeleton0202 electrical engineering electronic engineering information engineering[INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL]0101 mathematicsNumerical AnalysisPixelbusiness.industryApplied MathematicsProgressive probabilistic Hough transformSkew[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Pattern recognitionSkew correction[INFO.INFO-TT]Computer Science [cs]/Document and Text ProcessingModeling and Simulation020201 artificial intelligence & image processingArtificial intelligencebusinessMathematics and Computers in Simulation
researchProduct

An ontology change management approach for facility management

2014

International audience; Facility management (FM) or technical property management is an approach to operate, maintain, improve and adapt buildings and infrastructures of organizations. A FM project requires the cooperation of many actors from different domains so it has to be automated in a constrained collaborative environment. This paper proposes a new approach for ontology change management applied on facility management of such projects. The industrial challenge is, firstly, to ensure consistency of a FM project knowledge from the construction phase to the technical property management phase (after delivery). Secondly, it has to provide to each actor of the project a personal up-to-date…

Information managementEngineering[ INFO.INFO-IR ] Computer Science [cs]/Information Retrieval [cs.IR][ INFO.INFO-TT ] Computer Science [cs]/Document and Text ProcessingProcess managementKnowledge managementGeneral Computer Sciencebusiness.industryOntology-based data integrationProcess ontology[INFO.INFO-WB]Computer Science [cs]/WebGeneral Engineering[ INFO.INFO-WB ] Computer Science [cs]/Web[INFO.INFO-LO]Computer Science [cs]/Logic in Computer Science [cs.LO]Ontology (information science)Change management (ITSM)[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI][INFO.INFO-TT]Computer Science [cs]/Document and Text ProcessingFacility management[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]Upper ontology[ INFO.INFO-LO ] Computer Science [cs]/Logic in Computer Science [cs.LO]business[ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI]Project management triangle
researchProduct