[PDF] Top 20 A Unified Approach to Minimum Risk Training and Decoding

A Unified Approach to Minimum Risk Training and Decoding

... iterations), after which performance goes up to reach a peak (45.2 BLEU ) higher than that without the prior (44.2 BLEU ), before steadily declining. The entropic prior encourages diversity among the sample set, ... See full document

10

Minimum Risk Annealing for Training Log Linear Models

... the risk or expected error—a continuous function that can be derived by combining the likelihood with any eval- uation metric ...complex risk surface ( § ...to training log-linear combinations of ... See full document

8

Minimum Risk Training of Approximate CRF Based NLP Systems

... two training regimens perform similarly on the overall ...these training procedures try to approximately maximize conditional likelihood, whereas we will aim to mini- mize the empirical loss of the ... See full document

11

First and Second Order Expectation Semirings with Applications to Minimum Risk Training on Translation Forests

... Given a hypergraph, we are often interested in computing some quantities over it using dynamic programming algorithms. For example, we may want to run the Viterbi algorithm to find the most probable derivation tree in ... See full document

12

Consensus Training for Consensus Decoding in Machine Translation

... We propose a novel objective function for dis- criminatively tuning log-linear machine translation models. Our objective explicitly op- timizes the BLEU score of expected n-gram counts, the same quantities that arise ... See full document

10

Fluency Constraints for Minimum Bayes Risk Decoding of Statistical Machine Translation Lattices

... sensus decoding and system combination for SMT (Matusov et ...alternative approach to improving specific por- tions of translation ...their approach, our approach is able to exploit large ... See full document

9

Mixture Model based Minimum Bayes Risk Decoding using Multiple Machine Translation Systems

... Bayes Risk (MMMBR) decoding, an approach that makes use of multiple SMT systems to improve translation ac- ...MBR decoding methods defined on the basis of single SMT systems, an MMMBR decoder ... See full document

9

Efficient Path Counting Transducers for Minimum Bayes Risk Decoding of Statistical Machine Translation Lattices

... Shankar Kumar, Wolfgang Macherey, Chris Dyer, and Franz Och. 2009. Efficient minimum error rate training and minimum bayes-risk decoding for translation hypergraphs and lattices. In ... See full document

6

Minimum Imputed Risk: Unsupervised Discriminative Training for Machine Translation

... our approach is demon- strated by replacing a key supervised discriminative training step in the development of large MT systems — learning the log-linear combination of sev- eral component model scores ... See full document

10

Probabilistic Models of Nonprojective Dependency Trees

... likelihood training (as here) to the averaged perceptron and a max- imum margin model trained using exponentiated- gradient (Bartlett et ...(including minimum Bayes- risk decoding) and give ... See full document

9

Efficient Minimum Error Rate Training and Minimum Bayes Risk Decoding for Translation Hypergraphs and Lattices

... Lattice MBR decoding is obtained under a linear approximation to BLEU, where the weights are obtained using n-gram precisions derived from development data. This may not be optimal in practice for unseen test ... See full document

9

Minimum Risk Training for Neural Machine Translation

... propose minimum risk training for end-to-end neural machine ...estimation, minimum risk training is ca- pable of optimizing model parameters di- rectly with respect to arbitrary ... See full document

10

Minimum Bayes Risk Decoding for Statistical Machine Translation

... related training and search procedures for NLP that explicitly take into consideration task- specific performance ...a training procedure that incorporates various MT evalua- tion criteria in the ... See full document

8

Minimum Distance Decoding of Redundant Residue Number System Codes

... 9ùæ5êeí«ñ9÷é îïè¸ê«ñ,íWbõÆû,ù9ûGöSè füè½èéÉñiýsé9öSúí eè½è féÉþPé9fíí «ìfêÆîóô}è½ð}énè èíeôõéÉð{òïõ ù9çè½öSò ñiêÆì½ð}ê«ô}ñÉñ9îóò è½zé êfõn¥ð}° òï¯Cõ ì½êÆðÃè 6è½ì¤ë.[r] ... See full document

5

How to Speak a Language without Knowing It

... The word-based model can only decode 29 of the 65 test utterances, because wFST E fails if an ut- terance contains a new English word type, previously unseen in training. The phoneme-based models are more ... See full document

5

Differentiable Scheduled Sampling for Credit Assignment

... this approach is not a fully continuous approximation to the sampling operation, but it does result in much more informative gradients com- pared to naive scheduled sampling ... See full document

6

Lexicons and Minimum Risk Training for Neural Machine Translation: NAIST CMU at WAT2016

... 2M training sentences, training is stopped, and re-started using the previously saved model with a halved learning rate ...Once training converges for a learning rate of ... See full document

7

Minimum Core Genome Sequence Typing of Bacterial Pathogens: a Unified Approach for Clinical and Public Health Microbiology

... Phylogenetic analysis was also performed using both the neigh- bor-joining algorithm and the minimum evolution algorithm (33, 34). The clustering of the isolates was largely consistent with the Structure analysis ... See full document

11

Therapeutical approach to plasma homocysteine and cardiovascular risk reduction

... cardiovascular risk fac- tors and correction for regression diluition bias, a 25% lower usual homocysteine level was associated with about an 11% lower IHD and about a 19% lower stroke ...the risk of IHD ... See full document

6

Solving ECDLP via List Decoding

... the minimum distance of C is closely related to the solution of the ECDLP over E ...of minimum distance for the elliptic code over ...the minimum distance of a linear code is one of the fundamental ... See full document

23