• No results found

[PDF] Top 20 A Unified Approach to Minimum Risk Training and Decoding

Has 10000 "A Unified Approach to Minimum Risk Training and Decoding" found on our website. Below are the top 20 most common "A Unified Approach to Minimum Risk Training and Decoding".

A Unified Approach to Minimum Risk Training and Decoding

A Unified Approach to Minimum Risk Training and Decoding

... iterations), after which performance goes up to reach a peak (45.2 BLEU ) higher than that without the prior (44.2 BLEU ), before steadily declining. The entropic prior encourages diversity among the sample set, ... See full document

10

Minimum Risk Annealing for Training Log Linear Models

Minimum Risk Annealing for Training Log Linear Models

... the risk or ex- pected error—a continuous function that can be derived by combining the likelihood with any eval- uation metric ...complex risk surface ( § ...to training log-linear combinations of ... See full document

8

Minimum Risk Training of Approximate CRF Based NLP Systems

Minimum Risk Training of Approximate CRF Based NLP Systems

... two training regimens perform similarly on the overall ...these training procedures try to approximately maximize conditional likelihood, whereas we will aim to mini- mize the empirical loss of the ... See full document

11

First  and Second Order Expectation Semirings with Applications to Minimum Risk Training on Translation Forests

First and Second Order Expectation Semirings with Applications to Minimum Risk Training on Translation Forests

... Given a hypergraph, we are often interested in computing some quantities over it using dynamic programming algorithms. For example, we may want to run the Viterbi algorithm to find the most probable derivation tree in ... See full document

12

Consensus Training for Consensus Decoding in Machine Translation

Consensus Training for Consensus Decoding in Machine Translation

... We propose a novel objective function for dis- criminatively tuning log-linear machine trans- lation models. Our objective explicitly op- timizes the BLEU score of expected n-gram counts, the same quantities that arise ... See full document

10

Fluency Constraints for Minimum Bayes Risk Decoding of Statistical Machine Translation Lattices

Fluency Constraints for Minimum Bayes Risk Decoding of Statistical Machine Translation Lattices

... sensus decoding and system combination for SMT (Matusov et ...alternative approach to improving specific por- tions of translation ...their approach, our approach is able to exploit large ... See full document

9

Mixture Model based Minimum Bayes Risk Decoding using Multiple Machine Translation Systems

Mixture Model based Minimum Bayes Risk Decoding using Multiple Machine Translation Systems

... Bayes Risk (MMMBR) decoding, an approach that makes use of multiple SMT systems to improve translation ac- ...MBR decoding methods defined on the basis of single SMT systems, an MMMBR decoder ... See full document

9

Efficient Path Counting Transducers for Minimum Bayes Risk Decoding of Statistical Machine Translation Lattices

Efficient Path Counting Transducers for Minimum Bayes Risk Decoding of Statistical Machine Translation Lattices

... Shankar Kumar, Wolfgang Macherey, Chris Dyer, and Franz Och. 2009. Efficient minimum error rate training and minimum bayes-risk decoding for trans- lation hypergraphs and lattices. In ... See full document

6

Minimum Imputed Risk: Unsupervised Discriminative Training for Machine Translation

Minimum Imputed Risk: Unsupervised Discriminative Training for Machine Translation

... our approach is demon- strated by replacing a key supervised discriminative training step in the development of large MT sys- tems — learning the log-linear combination of sev- eral component model scores ... See full document

10

Probabilistic Models of Nonprojective Dependency Trees

Probabilistic Models of Nonprojective Dependency Trees

... likelihood training (as here) to the averaged perceptron and a max- imum margin model trained using exponentiated- gradient (Bartlett et ...(including minimum Bayes- risk decoding) and give ... See full document

9

Efficient Minimum Error Rate Training and Minimum Bayes Risk Decoding for Translation Hypergraphs and Lattices

Efficient Minimum Error Rate Training and Minimum Bayes Risk Decoding for Translation Hypergraphs and Lattices

... Lattice MBR decoding is obtained under a lin- ear approximation to BLEU, where the weights are obtained using n-gram precisions derived from development data. This may not be optimal in practice for unseen test ... See full document

9

Minimum Risk Training for Neural Machine Translation

Minimum Risk Training for Neural Machine Translation

... propose minimum risk training for end-to-end neural machine ...estimation, minimum risk training is ca- pable of optimizing model parameters di- rectly with respect to arbitrary ... See full document

10

Minimum Bayes Risk Decoding for Statistical Machine Translation

Minimum Bayes Risk Decoding for Statistical Machine Translation

... related training and search proce- dures for NLP that explicitly take into consideration task- specific performance ...a training procedure that incorporates various MT evalua- tion criteria in the ... See full document

8

Minimum Distance Decoding of Redundant Residue Number System Codes

Minimum Distance Decoding of Redundant Residue Number System Codes

... 9ùæ5êeí«ñ9÷é îïè¸ê«ñ,íWbõÆû,ù9ûGöSè füè½èéÉñiýsé9öSúí eè½è féÉþPé9fíí «ìfêÆîóô}è½ð}énè èíeôõéÉð{òïõ ù9”çè½öSò ñiêÆì½ð}ê«ô}ñÉñ9îóò è½zé êfõn¥ð}° òï¯Cõ ì½êÆðÃè 6è½ì¤ë.[r] ... See full document

5

How to Speak a Language without Knowing It

How to Speak a Language without Knowing It

... The word-based model can only decode 29 of the 65 test utterances, because wFST E fails if an ut- terance contains a new English word type, pre- viously unseen in training. The phoneme-based models are more ... See full document

5

Differentiable Scheduled Sampling for Credit Assignment

Differentiable Scheduled Sampling for Credit Assignment

... this approach is not a fully continuous ap- proximation to the sampling operation, but it does result in much more informative gradients com- pared to naive scheduled sampling ... See full document

6

Lexicons and Minimum Risk Training for Neural Machine Translation: NAIST CMU at WAT2016

Lexicons and Minimum Risk Training for Neural Machine Translation: NAIST CMU at WAT2016

... 2M training sentences, training is stopped, and re-started using the previously saved model with a halved learning rate ...Once training converges for a learning rate of ... See full document

7

Minimum Core Genome Sequence Typing of Bacterial Pathogens: a Unified Approach for Clinical and Public Health Microbiology

Minimum Core Genome Sequence Typing of Bacterial Pathogens: a Unified Approach for Clinical and Public Health Microbiology

... Phylogenetic analysis was also performed using both the neigh- bor-joining algorithm and the minimum evolution algorithm (33, 34). The clustering of the isolates was largely consistent with the Structure analysis ... See full document

11

Therapeutical approach to plasma homocysteine and cardiovascular risk reduction

Therapeutical approach to plasma homocysteine and cardiovascular risk reduction

... cardiovascular risk fac- tors and correction for regression diluition bias, a 25% lower usual homocysteine level was associated with about an 11% lower IHD and about a 19% lower stroke ...the risk of IHD ... See full document

6

Solving  ECDLP  via  List  Decoding

Solving ECDLP via List Decoding

... the minimum distance of C is closely related to the solution of the ECDLP over E ...of minimum distance for the elliptic code over ...the minimum distance of a linear code is one of the fundamental ... See full document

23

Show all 10000 documents...