Search results for " Compression"

showing 10 items of 400 documents

Boosting Textual Compression in Optimal Linear Time

2005

We provide a general boosting technique for Textual Data Compression. Qualitatively, it takes a good compression algorithm and turns it into an algorithm with a better compression performance guarantee. It displays the following remarkable properties: (a) it can turn any memoryless compressor into a compression algorithm that uses the “best possible” contexts; (b) it is very simple and optimal in terms of time; and (c) it admits a decompression algorithm again optimal in time. To the best of our knowledge, this is the first boosting technique displaying these properties.Technically, our boosting technique builds upon three main ingredients: the Burrows--Wheeler Transform, the Suffix Tree d…

Theoretical computer scienceBurrows–Wheeler transformSuffix treeString (computer science)Data_CODINGANDINFORMATIONTHEORYBurrows-Wheeler transformSubstringArithmetic codinglaw.inventionLempel-Ziv compressorsArtificial IntelligenceHardware and ArchitectureControl and Systems Engineeringlawtext compressionempirical entropyArithmetic codingGreedy algorithmTime complexityAlgorithmSoftwareInformation SystemsMathematicsData compression

researchProduct

On parsing optimality for dictionary-based text compression—the Zip case

2013

Dictionary-based compression schemes are the most commonly used data compression schemes since they appeared in the foundational paper of Ziv and Lempel in 1977, and generally referred to as LZ77. Their work is the base of Zip, gZip, 7-Zip and many other compression software utilities. Some of these compression schemes use variants of the greedy approach to parse the text into dictionary phrases; others have left the greedy approach to improve the compression ratio. Recently, two bit-optimal parsing algorithms have been presented filling the gap between theory and best practice. We present a survey on the parsing problem for dictionary-based text compression, identifying noticeable results …

Theoretical computer scienceComputer scienceData_CODINGANDINFORMATIONTHEORYTop-down parsingcomputer.software_genreTheoretical Computer ScienceParsing optimalityCompression (functional analysis)Discrete Mathematics and CombinatoricsLossless compressionParsingLZ77 algorithmSettore INF/01 - InformaticaDeflate algorithmbusiness.industryDictionary-based text compressionComputational Theory and MathematicsData compressionDEFLATECompression ratioArtificial intelligencebusinesscomputerNatural language processingBottom-up parsingData compressionJournal of Discrete Algorithms

researchProduct

Dictionary-symbolwise flexible parsing

2012

AbstractLinear-time optimal parsing algorithms are rare in the dictionary-based branch of the data compression theory. A recent result is the Flexible Parsing algorithm of Matias and Sahinalp (1999) that works when the dictionary is prefix closed and the encoding of dictionary pointers has a constant cost. We present the Dictionary-Symbolwise Flexible Parsing algorithm that is optimal for prefix-closed dictionaries and any symbolwise compressor under some natural hypothesis. In the case of LZ78-like algorithms with variable costs and any, linear as usual, symbolwise compressor we show how to implement our parsing algorithm in linear time. In the case of LZ77-like dictionaries and any symbol…

Theoretical computer scienceComputer science[INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS][INFO.INFO-DS] Computer Science [cs]/Data Structures and Algorithms [cs.DS]Data_CODINGANDINFORMATIONTHEORY0102 computer and information sciences02 engineering and technologycomputer.software_genre01 natural sciencesDirected acyclic graphTheoretical Computer ScienceConstant (computer programming)020204 information systemsEncoding (memory)Optimal parsing0202 electrical engineering electronic engineering information engineeringDiscrete Mathematics and CombinatoricsStringologySymbolwise text compressionTime complexityLossless compressionParsingSettore INF/01 - InformaticaDictionary-based compressionOptimal Parsing Lossless Data Compression DAGDirected acyclic graphPrefixComputational Theory and MathematicsText compression010201 computation theory & mathematicsAlgorithmcomputerBottom-up parsingData compressionJournal of Discrete Algorithms

researchProduct

Text Compression Using Antidictionaries

1999

International audience; We give a new text compression scheme based on Forbidden Words ("antidictionary"). We prove that our algorithms attain the entropy for balanced binary sources. They run in linear time. Moreover, one of the main advantages of this approach is that it produces very fast decompressors. A second advantage is a synchronization property that is helpful to search compressed data and allows parallel compression. Our algorithms can also be presented as "compilers" that create compressors dedicated to any previously fixed source. The techniques used in this paper are from Information Theory and Finite Automata.

Theoretical computer scienceFinite-state machineComputer science[INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS]010102 general mathematicsforbidden wordData_CODINGANDINFORMATIONTHEORY0102 computer and information sciencesInformation theory01 natural sciencesfinite automatonParallel compressionpattern matching010201 computation theory & mathematicsEntropy (information theory)Pattern matching0101 mathematicsTime complexityAlgorithmdata compressioninformation theoryData compression

researchProduct

Kolmogorov superposition theorem for image compression

2012

International audience; The authors present a novel approach for image compression based on an unconventional representation of images. The proposed approach is different from most of the existing techniques in the literature because the compression is not directly performed on the image pixels, but is rather applied to an equivalent monovariate representation of the wavelet-transformed image. More precisely, the authors have considered an adaptation of Kolmogorov superposition theorem proposed by Igelnik and known as the Kolmogorov spline network (KSN), in which the image is approximated by sums and compositions of specific monovariate functions. Using this representation, the authors trad…

Theoretical computer scienceImage compressionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION02 engineering and technologySuperposition theoremE.4. CODING AND INFORMATION THEORY01 natural sciencesWavelet[ INFO.INFO-TI ] Computer Science [cs]/Image Processing0202 electrical engineering electronic engineering information engineering0101 mathematicsElectrical and Electronic EngineeringMathematicsPixel010102 general mathematicsWavelet transformcomputer.file_formatSpline (mathematics)[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]Signal ProcessingJPEG 2000Kolmogorov superposition theorem020201 artificial intelligence & image processingComputer Vision and Pattern RecognitionAlgorithmcomputerSoftwareData compressionImage compression

researchProduct

The Burrows-Wheeler Transform between Data Compression and Combinatorics on Words

2013

The Burrows-Wheeler Transform (BWT) is a tool of fundamental importance in Data Compression and, recently, has found many applications well beyond its original purpose. The main goal of this paper is to highlight the mathematical and combinatorial properties on which the outstanding versatility of the $BWT$ is based, i.e. its reversibility and the clustering effect on the output. Such properties have aroused curiosity and fervent interest in the scientific world both for theoretical aspects and for practical effects. In particular, in this paper we are interested both to survey the theoretical research issues which, by taking their cue from Data Compression, have been developed in the conte…

Theoretical computer scienceSettore INF/01 - InformaticaBurrows–Wheeler transformmedia_common.quotation_subjectTheoretical researchContext (language use)Data_CODINGANDINFORMATIONTHEORYBurrows Wheeler transform; Clustering effect; Combinatorial propertiesCombinatorial propertiesBurrows Wheeler transformCombinatorics on wordsClustering effectBWT balancing optimal partitioning text-compressionCuriosityArithmeticCluster analysisFocus (optics)media_commonData compressionMathematics

researchProduct

Mesh connectivity compression using convection reconstruction

2007

International audience; During a highly productive period running from 1995 to about 2002, the research in lossless compression of 3D meshes mainly consisted in a hard battle for the best bitrates. But for a few years, compression rates seem stabilized around 1.5 bit per vertex for the connectivity coding of usual meshes, and more and more work is dedicated to remeshing, lossy compression, or gigantic mesh compression, where memory and CPU optimizations are the new priority. However, the size of 3D models keeps growing, and many application fields keep requiring lossless compression. In this paper, we present a new contribution for single-rate lossless connectivity compression, which first …

Theoretical computer scienceTexture compressionLossless[ MATH.MATH-IT ] Mathematics [math]/Information Theory [math.IT]02 engineering and technologyLossy compression[INFO.INFO-CG]Computer Science [cs]/Computational Geometry [cs.CG][MATH.MATH-IT] Mathematics [math]/Information Theory [math.IT][ INFO.INFO-IT ] Computer Science [cs]/Information Theory [cs.IT]I.3.5 [Computing Methodologies]: Computer Graphics--Computational Geometry and Object Modeling0202 electrical engineering electronic engineering information engineeringPolygon meshComputingMethodologies_COMPUTERGRAPHICSMathematicsMeshConnected componentLossless compressionConnectivityDelaunay triangulationCompression[MATH.MATH-IT]Mathematics [math]/Information Theory [math.IT]020207 software engineering[INFO.INFO-CG] Computer Science [cs]/Computational Geometry [cs.CG][INFO.INFO-IT]Computer Science [cs]/Information Theory [cs.IT][ INFO.INFO-CG ] Computer Science [cs]/Computational Geometry [cs.CG]020201 artificial intelligence & image processing[INFO.INFO-IT] Computer Science [cs]/Information Theory [cs.IT]ReconstructionAlgorithmImage compressionData compressionProceedings of the 2007 ACM symposium on Solid and physical modeling

researchProduct

New Representations for Multidimensional Functions Based on Kolmogorov Superposition Theorem. Applications on Image Processing

2012

Mastering the sorting of the data in signal (nD) can lead to multiple applications like new compression, transmission, watermarking, encryption methods and even new processing methods for image. Some authors in the past decades have proposed to use these approaches for image compression, indexing, median filtering, mathematical morphology, encryption. A mathematical rigorous way for doing such a study has been introduced by Andrei Nikolaievitch Kolmogorov (1903-1987) in 1957 and recent results have provided constructive ways and practical algorithms for implementing the Kolmogorov theorem. We propose in this paper to present those algorithms and some preliminary results obtained by our team…

Theoretical computer science[ INFO.INFO-TS ] Computer Science [cs]/Signal and Image Processingbusiness.industry[INFO.INFO-TS] Computer Science [cs]/Signal and Image ProcessingSortingimage progressive transmissionImage processingimage encryption[ SPI.SIGNAL ] Engineering Sciences [physics]/Signal and Image processingMathematical morphologyEncryptionimage watermarkingimage compressionImage (mathematics)multi-variables function representation[INFO.INFO-TS]Computer Science [cs]/Signal and Image ProcessingKolmogorov superposition theoremMedian filterbusinessDigital watermarkingAlgorithm[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processingImage compressionMathematics[SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processing

researchProduct

Learning Similarity Scores by Using a Family of Distance Functions in Multiple Feature Spaces

2017

There exist a large number of distance functions that allow one to measure similarity between feature vectors and thus can be used for ranking purposes. When multiple representations of the same object are available, distances in each representation space may be combined to produce a single similarity score. In this paper, we present a method to build such a similarity ranking out of a family of distance functions. Unlike other approaches that aim to select the best distance function for a particular context, we use several distances and combine them in a convenient way. To this end, we adopt a classical similarity learning approach and face the problem as a standard supervised machine lea…

Training setbusiness.industryFeature vectorSimilarity heuristicPattern recognition02 engineering and technologyMachine learningcomputer.software_genreSemantic similarityArtificial Intelligence020204 information systemsNormalized compression distance0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingComputer Vision and Pattern RecognitionArtificial intelligenceJaro–Winkler distancebusinesscomputerClassifier (UML)SoftwareSimilarity learningMathematicsInternational Journal of Pattern Recognition and Artificial Intelligence

researchProduct

Use of Machine Learning and Artificial Intelligence to Drive Personalized Medicine Approaches for Spine Care

2020

Personalized medicine is a new paradigm of healthcare in which interventions are based on individual patient characteristics rather than on “one-size-fits-all” guidelines. As epidemiological datasets continue to burgeon in size and complexity, powerful methods such as statistical machine learning and artificial intelligence (AI) become necessary to interpret and develop prognostic models from underlying data. Through such analysis, machine learning can be used to facilitate personalized medicine via its precise predictions. Additionally, other AI tools, such as natural language processing and computer vision, can play an instrumental part in personalizing the care provided to patients with …

Traumatic spinal cord injuryPrognosiPsychological interventionPatient characteristicsDiseaseSpinal cord injuryMachine learningcomputer.software_genreSpinal DiseaseMachine Learning03 medical and health sciences0302 clinical medicineArtificial IntelligenceHealth careFunctional StatuMedicineHumansSpine carePrecision MedicineDegenerative cervical myelopathyPrognostic modelsSpinal Cord InjuriesNatural Language ProcessingSpinal Cord Injuriebusiness.industryPrognosisPersonalized medicineFunctional Status030220 oncology & carcinogenesisSurgeryFunctional statusSpinal DiseasesNeurology (clinical)Personalized medicineArtificial intelligenceSpondylosisbusinesscomputerSpinal Cord Compression030217 neurology & neurosurgeryHuman

researchProduct