[PDF] Top 20 Vocabulary-Based Language Similarity using Web Corpora
Has 10000 "Vocabulary-Based Language Similarity using Web Corpora" found on our website. Below are the top 20 most common "Vocabulary-Based Language Similarity using Web Corpora".
Vocabulary-Based Language Similarity using Web Corpora
... Rank-based comparison leads to a correct solution containing only North Germanic languages. The use of the Dice coefficient leads to a cluster erroneously including English and an English-based Pidgin ... See full document
6
The Challenges and Joys of Analysing Ongoing Language Change in Web based Corpora: a Case Study
... In times of heated discussion about data protec- tion, it is easily understood that members of a web- site or forum wish to remain anonymous. On the website used for the present investigation, mem- bers can ... See full document
8
A Web-based Text Corpora Development System
... The core task in part-of-speech tagging (or disambigua- tion) is choosing the most likely tag for each word in a con- text, given a set of possible tags (Armstrong et al., 1996). In our system, part-of-speech tagging is ... See full document
6
The Language Application Grid Web Service Exchange Vocabulary
... Perhaps more problematically, sources that do specify relations among concepts, such as the various UIMA type systems and GATE’s schemas, vary widely in their choices of what is an object and what is a feature; for ... See full document
10
An Integrated Approach to Measuring Semantic Similarity between Words Using Information Available on the Web
... semantic similarity between words is vital for various applications in natural language processing, such as language modeling, information retrieval, and document ...the Web to measure ... See full document
8
Similarity Based Alignment of Monolingual Corpora for Text Simplification Purposes
... our web page. The web page presented the RS randomly to annotators followed by the (randomly ordered) aligned ...ES. Similarity judgements were made by rating the pair on a scale 0-4, corresponding ... See full document
10
Readability of written medicine information materials in Arabic language: expert and consumer evaluation
... difficult based on the vocabulary, structure, and overall appearance ...the vocabulary difficulty, as well as the sentence ...Arabic language, which is one of the most spoken language ... See full document
7
Web Clustering Based On Tag Set Similarity
... novel web clustering method to compute the similarities between tag sets based on tag similarity ...tag vocabulary and compute a tag matrix with a simplified set-base vector space ...a ... See full document
8
Iranian language learners' attitude, motivasion and performance toward learning vocabulary using computer assisted language learning
... when using computers in particular online English Websites, in learning the English language ...and web-based resources offered teachers and learners vast resources and opportunities for ... See full document
30
Paraphrasing Predicates from Written Language to Spoken Language Using the Web
... written language and spoken lan- ...ken language into suitable ...distinguished based on the occurrence probability in written and spoken language corpora which are automatically col- ... See full document
8
Investigating the distribution of some (but not all ) implicatures using corpora and web-based methods
... Unfortunately, the data to support the categorization of scalar implica- tures as GCIs — or indeed, the categorization of any sort of implicature as a GCI or PCI — have thus far consisted entirely of linguists’ ... See full document
55
Proceedings of the Second Workshop on Hybrid Approaches to Translation
... The aim of the HyTra workshop series is to bring together and share ideas among MT researchers who combine data-driven statistical approaches with linguistic knowledge models. We open the floor for researchers and groups ... See full document
14
Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web
... Comparable corpora are collections of documents that are comparable in content and form in various degrees and ...multilingual corpora, but also sets of monolingual corpora that are used for ... See full document
10
The TALP–UPC Spanish–English WMT Biomedical Task: Bilingual Embeddings and Char based Neural Language Model Rescoring in a Phrase based System
... The first thing to notice is that the best trans- lation is obtained when only in-domain data are used to build the translation model. This is true in both directions. When going from Spanish into English, we obtain 0.45 ... See full document
6
The Acquisition of Formulaic Sequences in High-Intermediate ESL Learners
... that language learners overuse some FSs but do not employ a wide range of FSs as native speakers ...non-‐native language lacks formulaicity for failing to take account of frequency information and ... See full document
192
Exploring the feasibility of a classroom-based vocabulary intervention for mainstream secondary school students with language disorder
... have language disorder either as their primary area of need, or in association with another condition such as a learning or medical need (Norbury, Gooch, et ...term language disorder as a generic term to ... See full document
28
Building a Korean Web Corpus for Analyzing Learner Language
... two corpora pub- licly available right now, the Penn Korean Treebank (Han et ...Korean Language, 2007), with tens of millions of ...every language has such resources, and we want to work towards a ... See full document
9
Scaling Distributional Similarity to Large Corpora
... In Figure 1, the filled nodes demonstrate a search for the near-neighbours of some node q, us- ing k = 2. Our search begins with the root node A. As we are using k = 2, we must find the two nearest children of A ... See full document
8
Studies on Research and Development in Web Mining
... 3.2.2 Web Query System Natural language processing ,web structural information and standard database query language as SQL are used by different Web based query systems and languages for[r] ... See full document
5
Effects of spacing techniques on EFL learners’ recognition and production of lexical collocations
... For each collocation, one PowerPoint slide was designed with the same font and background color. On each slide, the first language (Persian) equivalent was presented on the left side of the screen. After two ... See full document
9
Related subjects