[PDF] Top 20 Extracting Parallel Fragments from Comparable Corpora for Data to text Generation
Has 10000 "Extracting Parallel Fragments from Comparable Corpora for Data to text Generation" found on our website. Below are the top 20 most common "Extracting Parallel Fragments from Comparable Corpora for Data to text Generation".
Extracting Parallel Fragments from Comparable Corpora for Data to text Generation
... a data field and word cooccur if they are present in the same pair of data fields and sentence (as identified by the method described in Section ...each data field, and the number of times it occurs ... See full document
5
Improving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora
... Recently comparable corpora have got great at- tention in the field of ...NLP. Extracting parallel fragments of texts, paraphrases or sentences from comparable ... See full document
8
A Generative Model for Extracting Parallel Fragments from Comparable Documents
... butional profiles, the distributional profile of their corresponding translation should also be correlated in a comparable corpus. The optimiza- tion phase of the model that is based on gradient descent is very ... See full document
9
Extracting Parallel Phrases from Comparable Data
... two comparable documents have few or no parallel sentence pairs, there could still be paral- lel sub-sentential fragments, including word transla- tion pairs, named entities, and long phrase ... See full document
8
Identifying Parallel Documents from a Large Bilingual Collection of Texts: Application to Parallel Article Extraction in Wikipedia
... of parallel or closely parallel document ...of extracting parallel ...that extracting parallel sentences from a parallel corpus is something we do well, while ... See full document
9
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
... corpus data, and they often favor the use of parallel corpora for ob- vious practical ...The parallel corpora used in this context are consequently bilingual (sometimes trilingual), and ... See full document
6
Extracting Parallel Sub Sentential Fragments from Non Parallel Corpora
... good-quality parallel sentence pairs can be automatically extracted from comparable corpora, and used to improve the per- formance of machine translation (MT) ...large parallel ... See full document
8
Automatic Building and Using Parallel Resources for SMT from Comparable Corpora
... Building parallel resources for corpus based machine translation, especially Statistical Machine Translation (SMT), from comparable corpora has recently received wide attention in the field ... See full document
10
Phrase based Parallel Fragments Extraction from Comparable Corpora
... Parallel corpora are valuable resources for train- ing a statistical translation ...lel corpora. However, the parallel corpora only exist in particular domains for a few number of ... See full document
5
Multimodal Comparable Corpora as Resources for Extracting Parallel Data: Parallel Phrases Extraction
... lel corpora called bitexts for training the transla- tion model and monolingual data to build the tar- get language ...Unfortunately, parallel texts are a limited resource and they are often not ... See full document
7
Learning Sentential Paraphrases from Bilingual Parallel Corpora for Text to Text Generation
... extracted from bilingual parallel ...for extracting more sophisticated sen- tential paraphrases, which are more obviously learnable from monolingual parallel ...many text gener- ... See full document
12
Extracting Bilingual Lexica from Comparable Corpora Using Self Organizing Maps
... We extracted nouns from these corpora for our test sets as well as input data. We considered only nouns to reduce the sizes of the dimensions of ei- ther synonym vectors or SOMs. Thus, we finally ... See full document
6
PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora
... collected from the ...for parallel sentences in ...in parallel texts (paragraphs and sentences) which constitute a natural pruning technique when searching for parallel sentences in ...a ... See full document
8
Parallel and comparable corpora: What are they up to?
... English-Chinese parallel corpus of healthcare, found that the ratio of overt/covert marking of aspectual meanings was exceptionally low in Chinese ...the data in this case. To test this hypothesis, they ... See full document
13
Multext East: Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages
... Multext East Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European Languages Multext East Parallel and Comparable Corpora and Lexicons for Six Central and Eastern European[.] ... See full document
5
Self Supervised Neural Machine Translation
... Data. We use Wikipedia (WP) dumps 1 in En- glish (en) and French (f r), and pre-process the articles and split the text into sentences using the Wikitailor toolkit 2 (Barr´on-Cede˜no et al., 2015). We ... See full document
7
Extracting Transfer Rules for Multiword Expressions from Parallel Corpora
... For each of these phrase table entries, we looked up the lexemes on either side in the Jacy/ERG lexi- cons, and represented them with the semantic predi- cate (and their syntactic category). 5 Ambiguous lex- emes were ... See full document
9
Chinese–Japanese Parallel Sentence Extraction from Quasi–Comparable Corpora
... on extracting parallel sentences from noisy parallel or compara- ble ...Chinese–Japanese parallel sentences from quasi–comparable corpora, which are available in ... See full document
9
Parallel Sentence Extraction from Comparable Corpora with Neural Network Features
... Ref: Therefore, it is considered that until now in the field of education there is no time to teach from this aspect and the development of teaching materials is also being delayed. We can see that the word ... See full document
5
Improving Machine Translation Performance by Exploiting Non Parallel Corpora
... instances from a differ- ent parallel corpus (also around 5,000 sentence pairs) and checking how many of these instances are correctly ...test parallel corpus and applying the word-overlap filter (we ... See full document
28
Related subjects