Search results for "dictionary"

showing 10 items of 79 documents

On parsing optimality for dictionary-based text compression—the Zip case

2013

Dictionary-based compression schemes are the most commonly used data compression schemes since they appeared in the foundational paper of Ziv and Lempel in 1977, and generally referred to as LZ77. Their work is the base of Zip, gZip, 7-Zip and many other compression software utilities. Some of these compression schemes use variants of the greedy approach to parse the text into dictionary phrases; others have left the greedy approach to improve the compression ratio. Recently, two bit-optimal parsing algorithms have been presented filling the gap between theory and best practice. We present a survey on the parsing problem for dictionary-based text compression, identifying noticeable results …

Theoretical computer scienceComputer scienceData_CODINGANDINFORMATIONTHEORYTop-down parsingcomputer.software_genreTheoretical Computer ScienceParsing optimalityCompression (functional analysis)Discrete Mathematics and CombinatoricsLossless compressionParsingLZ77 algorithmSettore INF/01 - InformaticaDeflate algorithmbusiness.industryDictionary-based text compressionComputational Theory and MathematicsData compressionDEFLATECompression ratioArtificial intelligencebusinesscomputerNatural language processingBottom-up parsingData compressionJournal of Discrete Algorithms
researchProduct

The rightmost equal-cost position problem.

2013

LZ77-based compression schemes compress the input text by replacing factors in the text with an encoded reference to a previous occurrence formed by the couple (length, offset). For a given factor, the smallest is the offset, the smallest is the resulting compression ratio. This is optimally achieved by using the rightmost occurrence of a factor in the previous text. Given a cost function, for instance the minimum number of bits used to represent an integer, we define the Rightmost Equal-Cost Position (REP) problem as the problem of finding one of the occurrences of a factor whose cost is equal to the cost of the rightmost one. We present the Multi-Layer Suffix Tree data structure that, for…

FOS: Computer and information sciencesOffset (computer science)Computer scienceSuffix treeComputer Science - Information Theorylaw.inventionCombinatoricslawLog-log plotComputer Science - Data Structures and AlgorithmsCompression schemetext compressiondictionary text compressionData Structures and Algorithms (cs.DS)LZ77 compressiondata compressionLossless compressionfull text indexSuffix Tree Data StructuresSettore INF/01 - InformaticaInformation Theory (cs.IT)Data structurePrefixCompression ratioCompression scheme; Constant time; Suffix Tree Data StructuresAlgorithmData compressionConstant time
researchProduct

Dictionary-symbolwise flexible parsing

2012

AbstractLinear-time optimal parsing algorithms are rare in the dictionary-based branch of the data compression theory. A recent result is the Flexible Parsing algorithm of Matias and Sahinalp (1999) that works when the dictionary is prefix closed and the encoding of dictionary pointers has a constant cost. We present the Dictionary-Symbolwise Flexible Parsing algorithm that is optimal for prefix-closed dictionaries and any symbolwise compressor under some natural hypothesis. In the case of LZ78-like algorithms with variable costs and any, linear as usual, symbolwise compressor we show how to implement our parsing algorithm in linear time. In the case of LZ77-like dictionaries and any symbol…

Theoretical computer scienceComputer science[INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS][INFO.INFO-DS] Computer Science [cs]/Data Structures and Algorithms [cs.DS]Data_CODINGANDINFORMATIONTHEORY0102 computer and information sciences02 engineering and technologycomputer.software_genre01 natural sciencesDirected acyclic graphTheoretical Computer ScienceConstant (computer programming)020204 information systemsEncoding (memory)Optimal parsing0202 electrical engineering electronic engineering information engineeringDiscrete Mathematics and CombinatoricsStringologySymbolwise text compressionTime complexityLossless compressionParsingSettore INF/01 - InformaticaDictionary-based compressionOptimal Parsing Lossless Data Compression DAGDirected acyclic graphPrefixComputational Theory and MathematicsText compression010201 computation theory & mathematicsAlgorithmcomputerBottom-up parsingData compressionJournal of Discrete Algorithms
researchProduct

Tożsamość górnośląska wyrażona w regionalnej kuchni (na przykładzie słownika zup)

2018

Górny ŚląskcuisinesoupskulinariazupyUpper SilesiatożsamośćsłownikidentitydictionaryStudia Śląskie
researchProduct

An Online Multilingual Medical Vocabulary/Thesaurus/Dictionary (MED-VTD) for Facilitating Understanding of Medical Texts

Medical texts (e.g., reports and medicine leaflets) are usually written by professionals (physicians, medical researchers, etc.) who use their own language and communication style. On the other hand, they are often read by health consumers or other medical professionals who do not have the same vocabularies and can have difficulties in text comprehension. Thus, to help a generic user in understanding a medical text, it would be desirable to have an online medical vocabulary/thesaurus/dictionary that he/she can easily look for finding the plain equivalent of any medical (technical) term and a definition of the term with the same kind of language. In this work, we present an online multilingu…

Settore INF/01 - InformaticaE-Health Patient Empowerment Plain Language Medical Vocabulary Medical Dictionary Consumer Health Vocabulary Electronic Health Record Personal Health Record HL7.
researchProduct

How to combine tools and methods in practice— a field study

1990

In spring 1989 we surveyed the experiences of some Finnish companies in methodology modelling (metamodelling) and adaptation of tools and methodologies to each other (methodology adaptation). The companies represented software production, banking, wood and metal industry, and wholesale trade. The study was carried out as a field study where we interviewed method developers, systems analysts and their supervisors. The goal of the survey was to find out whether there was need for metamodelling or methodology adaptation in general and how this need had been satisfied. The study shows that a little experience had been gained in adapting data dictionaries to methodologies but no such attempts ha…

Process managementKnowledge managementbusiness.industryComputer sciencemedia_common.quotation_subjectUsabilityData dictionaryAdaptabilityMetamodelingInformation systemUser interfacebusinessAdaptation (computer science)Computer-aided software engineeringmedia_common
researchProduct

Neology and lemmatization: clippings and acronyms in spanish dictionaries

2020

espanolEsta investigacion presenta tres objetivos: describir el tratamiento lexicografico de los acortamientos y siglas en diccionarios actuales del espanol; caracterizar las nuevas unidades lexicas formadas por reduccion en cotextos de uso reales (periodisticos y digitales); y disenar una propuesta de criterios para integrar de un modo coherente estas unidades en los diccionarios. Para alcanzar estos objetivos, en primer lugar se describen los acortamientos y siglas detectados en la nomenclatura del DLE (2017) y en el vaciado de dos repertorios de neologismos (Neologismos del espanol actual y Corpus Obneo) y del Corpus del espanol. En segundo lugar, a partir de estos datos, se configura un…

NeologyNomenclatureDiccionarioOrganic ChemistryAcortamientosSiglasAcronymsCorpusClippingsBiochemistryDictionaryCastellà Termes i locucionsNomenclaturaNeología
researchProduct

Progetti di lessicografia onomastica dell’Atlante Linguistico della Sicilia

2022

L’Atlante Linguistico della Sicilia ha adottato, già una decina di anni fa, lo strumento del dizionario-atlante. Quando si è cominciato ad indagare il ricco universo onomastico, se ne è valutata la spendibilità anche su tre diversi corpora orali: uno relativo alle forme popolari del patrimonio toponomastico (DAToS, Dizionario-atlante toponomastico della Sicilia), uno riguardante l’inventario plurale dei soprannomi etnici (DASES, Dizionario-atlante dei soprannomi etnici in Sicilia), l’ultimo afferente al patrimonio antroponomastico popolare, individuale e familiare (DASS, Dizionario-atlante dei soprannomi di/in Sicilia). Il contributo fornisce lo stato dell’arte dei tre progetti e ne descriv…

About ten years ago the Linguistic Atlas of Sicily (ALS) adopted the dictionary-atlas tool. When the rich onomastic universe began to be investigated its use was also evaluated on three different oral corpora: one relating to the popular forms of the toponymic heritage (DAToS Dictionary-toponymic atlas of Sicily) one concerning the plural inventory of ethnic nicknames (DASES Dictionary-atlas of ethnic nicknames in Sicily) the latest afferent to the popular individual and family anthroponomastic heritage (DASS Dictionary-atlas of the nicknames of / in Sicily). The contribution provides the state of the art of the three projects and describes their methods objectives and dissemination tools.Settore L-FIL-LET/12 - Linguistica Italiana
researchProduct

Słownik gwar śląskich

2016

Słownik gwar Śląskich [Dictionary of Silesian Dialects] contains vocabulary from all of the region’s dialects. The dictionary is based on rich lexical materiał, which has been collected sińce the 1950s. The first volume was published in 2000, sińce then thirteen volumes have appeared (containing vocabulary starting with letters from A to J). The core of the materiał is constituted by 20th-eentury vocabulary; however, there are many components of older origin. The aim of the editorial board is the archiving of the Silesian glossary and the structure of the dictionary is subordinated to this purpose. The article presents the structure of the entries, the rules of semantic description of the l…

gwary śląskiesłownikSilesian dialectsdictionary
researchProduct

Corpo/corporeità

2021

A reflection on body and corporeity in a key of reading centered on the sociological analysis of the person

Settore SPS/07 - Sociologia GeneralePerson sociology dictionary
researchProduct