• No results found

n-gram model smoothing

A Generalized Language Model as the Combination of Skipped n grams and Modified Kneser Ney Smoothing

A Generalized Language Model as the Combination of Skipped n grams and Modified Kneser Ney Smoothing

... We introduce a novel approach for build- ing language models based on a system- atic, recursive exploration of skip n-gram models which are interpolated using modi- fied Kneser-Ney smoothing. Our ...

10

Smooth Bilingual N Gram Translation

Smooth Bilingual N Gram Translation

... translation model. Reliable es- timation of unseen n-grams is very important in this translation ...data. N-gram hit rates are re- ported in the results section of this ...new smoothing ...

9

An Unsupervised Parameter Estimation Algorithm for a Generative Dependency N gram Language Model

An Unsupervised Parameter Estimation Algorithm for a Generative Dependency N gram Language Model

... bi-gram model achieves the same, or sometimes better perfor- mance of the original N-gram language ...dependency model does not pro- duce much improvement for the reasons we de- scribed ...

9

Self Organizing n gram Model for Automatic Word Spacing

Self Organizing n gram Model for Automatic Word Spacing

... -gram model is that it suffers from data sparseness however large the corpus ...many smoothing techniques have been proposed for construction of -gram models (Chen and Goodman, ...a ...

8

Chinese Spelling Check System Based on N gram Model

Chinese Spelling Check System Based on N gram Model

... yet. N-gram language modeling (LM) is widely used in CSC, since its simplicity and ...a model based on joint bi-gram and tri- gram LM and Chinese word ...employ smoothing ...

9

Non-Linear Recommender System using Stochastic Chaining and N-Gram Model

Non-Linear Recommender System using Stochastic Chaining and N-Gram Model

... last n − 1 ...previous n − 1. An estimate model is built using skipping and then linked with the normal n-gram ...scarcity. Smoothing is a normal name for samples that modify the ...

10

N gram based Machine Translation

N gram based Machine Translation

... translation model that has been derived from the finite-state perspective—more specifically, from the work of Casacuberta (2001) and Casacuberta and Vidal ...translation model is implemented by using a ...

24

Subcellular localization for Gram positive and Gram negative bacterial proteins using linear interpolation smoothing model

Subcellular localization for Gram positive and Gram negative bacterial proteins using linear interpolation smoothing model

... the Gram positive and Gram negative dataset ...and model n speci- fies the probabilistic models with dependency ...prediction model by the means of computing consensus and using the ...

17

The Operation Sequence Model—Combining N Gram Based and Phrase Based Statistical Machine Translation

The Operation Sequence Model—Combining N Gram Based and Phrase Based Statistical Machine Translation

... Our model, like the reordering models (Tillmann and Zhang 2005; Galley and Manning 2008) used in phrase-based decoders, is ...our model has richer conditioning as it considers both translation and ...

30

Modeling of term distance and term occurrence information for improving n gram language model performance

Modeling of term distance and term occurrence information for improving n gram language model performance

... conventional smoothing techniques (Chen & Goodman 1996), which rely mainly on the count- of-count statistics for re-estimating and smooth- ing the original ...

5

Subcellular localization for Gram Positive and Gram Negative Bacterial Proteins using Linear Interpolation Smoothing Model

Subcellular localization for Gram Positive and Gram Negative Bacterial Proteins using Linear Interpolation Smoothing Model

... dependency model n ¼ 0 has 20 unique probabilities per class, n ¼ 1 has 400 unique probabilities per class and n ¼ 2 has 8000 unique probabilities per ...20 n þ 1 Þ ...every ...

9

Approximating Style by N gram based Annotation

Approximating Style by N gram based Annotation

... token n-grams, measured by Fleiss’ Kappa (Fleiss, ...distinctive n-grams are more frequent in linguistics and literary stud- ies, ...the n-grams in the sample are more frequent in literary studies ...

11

Improvements to the Bayesian Topic N Gram Models

Improvements to the Bayesian Topic N Gram Models

... higher-order n-grams have not yet been sufficiently studied, which we investigate in this ...Wallach’s model to a higher-order: sparseness caused by di- viding all n-grams into exclusive topics, and ...

11

Word like character n gram embedding

Word like character n gram embedding

... character n-grams in a raw corpus are counted for selecting the K-most frequent n-grams as the n-gram vocabulary in ...determining n-gram vocabulary can also be found in Wieting ...

5

Segmentation free compositional n gram embedding

Segmentation free compositional n gram embedding

... sub-character n-grams of ...length n = 3 of the word where are extracted as <wh, whe, her, ere, re>, where “<”,“>” are special symbols added to the original word to represent its left and right ...

9

Faster and Smaller N Gram Language Models

Faster and Smaller N Gram Language Models

... language model queries issued by the Joshua de- coder (Li et ...language model queries in a cache should be effective at reducing overall language model ...given n-gram and its fully ...

10

Federated Learning of N Gram Language Models

Federated Learning of N Gram Language Models

... language model to discriminate between viable ...on n-grams and do not exceed ten megabytes. A language model (LM) is a prob- abilistic model on ...

10

Continuous N gram Representations for Authorship Attribution

Continuous N gram Representations for Authorship Attribution

... For all datasets, early stopping was used on the development sets and models trained with the Adam update rule (Kingma and Ba, 2015). Since none of the datasets have a standard develop- ment set, we randomly selected 10% ...

7

An Efficient Indexer for Large N Gram Corpora

An Efficient Indexer for Large N Gram Corpora

... other N-gram ...an N-gram file with 10,000,000 records would have 8,000 leaf nodes on level 3, 40 inter- nal nodes on level 2, and the root node on level ...given N-gram quickly, ...

6

N gram language models for massively parallel devices

N gram language models for massively parallel devices

... A GPU consists of many simple computational cores, which have neither complex caches nor branch predictors to hide latencies. Because they have far fewer circuits than CPU cores, GPU cores are much smaller, and many more ...

10

Show all 10000 documents...

Related subjects