Search results for "Perplexity"

showing 5 items of 5 documents

Combining Machine Translated Sentence Chunks from Multiple MT Systems

2018

This paper presents a hybrid machine translation (HMT) system that pursues syntactic analysis to acquire phrases of source sentences, translates the phrases using multiple online machine translation (MT) system application program interfaces (APIs) and generates output by combining translated chunks to obtain the best possible translation. The aim of this study is to improve translation quality of English – Latvian texts over each of the individual MT APIs. The selection of the best translation hypothesis is done by calculating the perplexity for each hypothesis using an n-gram language model. The result is a phrase-based multi-system machine translation system that allows to improve MT out…

060201 languages & linguisticsParsingPerplexityPhraseMachine translationComputer sciencebusiness.industry06 humanities and the arts02 engineering and technologyHybrid machine translationcomputer.software_genre0602 languages and literatureChunking (psychology)0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingLanguage modelArtificial intelligencebusinesscomputerNatural language processingSentence
researchProduct

An LP-based hyperparameter optimization model for language modeling

2018

In order to find hyperparameters for a machine learning model, algorithms such as grid search or random search are used over the space of possible values of the models hyperparameters. These search algorithms opt the solution that minimizes a specific cost function. In language models, perplexity is one of the most popular cost functions. In this study, we propose a fractional nonlinear programming model that finds the optimal perplexity value. The special structure of the model allows us to approximate it by a linear programming model that can be solved using the well-known simplex algorithm. To the best of our knowledge, this is the first attempt to use optimization techniques to find per…

FOS: Computer and information sciencesMathematical optimizationPerplexityLinear programmingComputer scienceMachine Learning (stat.ML)02 engineering and technology010501 environmental sciences01 natural sciencesTheoretical Computer ScienceNonlinear programmingMachine Learning (cs.LG)Random searchSimplex algorithmSearch algorithmStatistics - Machine Learning0202 electrical engineering electronic engineering information engineeringFOS: MathematicsMathematics - Optimization and Control0105 earth and related environmental sciencesHyperparameterComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Computer Science - LearningHardware and ArchitectureOptimization and Control (math.OC)Hyperparameter optimization020201 artificial intelligence & image processingLanguage modelSoftwareInformation Systems
researchProduct

Multi-system machine translation using online APIs for English-Latvian

2015

This paper describes a hybrid machine translation (HMT) system that employs several online MT system application program interfaces (APIs) forming a MultiSystem Machine Translation (MSMT) approach. The goal is to improve the automated translation of English – Latvian texts over each of the individual MT APIs. The selection of the best hypothesis translation is done by calculating the perplexity for each hypothesis. Experiment results show a slight improvement of BLEU score and WER (word error rate).

PerplexityMachine translationComputer sciencebusiness.industryWord error rateLatvianHybrid machine translationcomputer.software_genreTranslation (geometry)language.human_languagelanguageComputer-assisted translationArtificial intelligencebusinesscomputerSelection (genetic algorithm)Natural language processingProceedings of the Fourth Workshop on Hybrid Approaches to Translation (HyTra)
researchProduct

K-Translate - Interactive Multi-system Machine Translation

2016

The tool described in this article has been designed to help machine translation (MT) researchers to combine and evaluate various MT engine outputs through a web-based graphical user interface using syntactic analysis and language modelling. The tool supports user provided translations as well as translations from popular online MT system application program interfaces (APIs). The selection of the best translation hypothesis is done by calculating the perplexity for each hypothesis. The evaluation panel provides sentence tree graphs and chunk statistics. The result is a syntax-based multi-system translation tool that shows an improvement of BLEU scores compared to the best individual baseli…

PerplexityParsingMachine translationComputer sciencebusiness.industryHybrid machine translationTransfer-based machine translationcomputer.software_genreExample-based machine translationComputer-assisted translationArtificial intelligenceComputational linguisticsbusinesscomputerNatural language processing
researchProduct

Teaching GP to program like a human software developer

2019

Program synthesis is one of the relevant applications of GP with a strong impact on new fields such as genetic improvement. In order for synthesized code to be used in real-world software, the structure of the programs created by GP must be maintainable. We can teach GP how real-world software is built by learning the relevant properties of mined human-coded software - which can be easily accessed through repository hosting services such as GitHub. So combining program synthesis and repository mining is a logical step. In this paper, we analyze if GP can write programs with properties similar to code produced by human software developers. First, we compare the structure of functions generat…

Perplexitybusiness.industryProgramming languageComputer scienceInitializationGenetic programming0102 computer and information sciences02 engineering and technologycomputer.software_genre01 natural sciencesSoftware010201 computation theory & mathematicsGrammatical evolution0202 electrical engineering electronic engineering information engineeringCode (cryptography)020201 artificial intelligence & image processingLanguage modelbusinesscomputerProgram synthesisProceedings of the Genetic and Evolutionary Computation Conference
researchProduct