• No results found

[PDF] Top 20 On the Use of Comparable Corpora to Improve SMT performance

Has 10000 "On the Use of Comparable Corpora to Improve SMT performance" found on our website. Below are the top 20 most common "On the Use of Comparable Corpora to Improve SMT performance".

On the Use of Comparable Corpora to Improve SMT performance

On the Use of Comparable Corpora to Improve SMT performance

... who use cross-language information retrieval techniques and dynamic programming to extract sentences from an English-Japanese comparable ...and use DP to find the least-cost alignment over the doc- ... See full document

8

Learning the Optimal Use of Dependency parsing Information for Finding Translations with Comparable Corpora

Learning the Optimal Use of Dependency parsing Information for Finding Translations with Comparable Corpora

... Finding new translations of single words using com- parable corpora is a promising method, for exam- ple, to assist the creation and extension of bilin- gual dictionaries. The basic idea is to first create context ... See full document

9

Parallel and comparable corpora: What are they up to?

Parallel and comparable corpora: What are they up to?

... parallel corpora for contrastive ...parallel corpora alone, for example, McEnery & Xiao (2002) would have come to the misleading conclusion that aspect markers occurred only infrequently in ...Parallel ... See full document

13

Identifying Comparable Corpora Using LDA

Identifying Comparable Corpora Using LDA

... parallel corpora is exceedingly expensive, requiring highly skilled an- notators or professional ...parallel corpora, and extracted parallel segments can be used to, for example, augment ma- chine ... See full document

5

Set Theoretic Alignment for Comparable Corpora

Set Theoretic Alignment for Comparable Corpora

... mining comparable corpora usually requires the use of seed translation knowledge extracted from a domain that differs from the one being mined, default tables with wide lexical coverage can be built ... See full document

10

Named Entity Transliteration with Comparable Corpora

Named Entity Transliteration with Comparable Corpora

... of the English phone string. For training data we have a small list of 721 names in Roman script and their Chinese equivalent. 3 Pronunciations for En- glish words are obtained using the Festival text-to- speech system ... See full document

8

Sentence Alignment for Monolingual Comparable Corpora

Sentence Alignment for Monolingual Comparable Corpora

... A potentially fruitful way to do so is to take ad- vantage of the topical structure of texts. In a given domain and genre, while the texts relate different subjects, they all use a limited set of topics to con- ... See full document

8

Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora

Bilingual Lexicon Extraction from Comparable Corpora Enhanced with Parallel Corpora

... specialized comparable corpora are gener- ally constructed via the consultation of specialized Web ...2002) use CISMeF 1 for building the French part of their comparable corpora and ... See full document

8

Clustering Comparable Corpora For Bilingual Lexicon Extraction

Clustering Comparable Corpora For Bilingual Lexicon Extraction

... yields corpora of higher quality in terms of comparability scores, and (b) whether the bilingual lexicons extracted from such corpora are of higher ...Several corpora were used in our experiments: ... See full document

6

Synonym Acquisition Using Bilingual Comparable Corpora

Synonym Acquisition Using Bilingual Comparable Corpora

... additional use of bilingual (or multilingual) resources for synonym acquisition is also con- sidered in (Van der Plas and Tiedemann, 2006) and (Wu and Zhou, ... See full document

5

Bootstrapping Entity Translation on Weakly Comparable Corpora

Bootstrapping Entity Translation on Weakly Comparable Corpora

... news corpora are not parallel but comparable corpora, with asymmetry of entities and relationship as the asymmetry in the number of documents also ...leverages comparable parts from the ... See full document

10

Extracting bilingual terminologies from comparable corpora

Extracting bilingual terminologies from comparable corpora

... parallel corpora tagged with ...parallel corpora. In addition to statistical methods Daille et al. use word trans- lation information between two words within the extracted terms as a further ... See full document

10

Image Image Search for Comparable Corpora Construction

Image Image Search for Comparable Corpora Construction

... Figure 2 shows the framework of CMIR. We also provide that of CLIR for comparison. For our method, i.e., CMIR, we collect the texts which summarize the main contents in images, and map the texts to the images in a ... See full document

10

A Factory of Comparable Corpora from Wikipedia

A Factory of Comparable Corpora from Wikipedia

... in-domain evaluation we build the test and devel- opment sets in a semiautomatic way. We depart from the parallel corpora gathered in Section 4 from which sentences with more than four tokens and beginning with a ... See full document

11

Improving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora

Improving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora

... There has been a growing interest in approaches focused on extracting word translations from comparable corpora (Fung and McKeown, 1997; Fung and Yee, 1998; Rapp, 1999; Chiao and Zweigenbaum, 2002; Dejean ... See full document

8

Automatic Building and Using Parallel Resources for SMT from Comparable Corpora

Automatic Building and Using Parallel Resources for SMT from Comparable Corpora

... Comparable corpora have been used in many research areas in NLP, especially in machine ...the use of comparable corpora in machine ... See full document

10

Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web

Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web

... Comparable corpora are collections of documents that are comparable in content and form in various degrees and ...multilingual corpora, but also sets of monolingual corpora that are ... See full document

10

Improved Machine Translation Performance via Parallel Sentence Extraction from Comparable Corpora

Improved Machine Translation Performance via Parallel Sentence Extraction from Comparable Corpora

... [r] ... See full document

8

Comparison of SMT and NMT trained with large Patent Corpora: Japio at WAT2017

Comparison of SMT and NMT trained with large Patent Corpora: Japio at WAT2017

... The SMT tools are a phrase-based SMT toolkit licensed by NICT (Utiyama and Sumita, 2014), and Moses (Koehn et ...to improve trans- lation into ... See full document

6

Adapting Translation Models to Translationese Improves SMT

Adapting Translation Models to Translationese Improves SMT

... allel corpora; such corpora are manually translated, but the direction of translation is usually unknown, and is consequently ig- ...parallel corpora translated in the same direction as the ... See full document

11

Show all 10000 documents...