• No results found

[PDF] Top 20 SINICA CORPUS : Design Methodology for Balanced Corpora

Has 10000 "SINICA CORPUS : Design Methodology for Balanced Corpora" found on our website. Below are the top 20 most common "SINICA CORPUS : Design Methodology for Balanced Corpora".

SINICA CORPUS : Design Methodology for Balanced Corpora

SINICA CORPUS : Design Methodology for Balanced Corpora

20

Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation

Survey of Conversational Behavior: Towards the Design of a Balanced Corpus of Everyday Japanese Conversation

... require corpora containing various kinds of everyday ...several corpora of Japanese conversations have been developed, most of them are biased towards conversations among friends or families on the ... See full document

6

A parametric prosody coding approach for Mandarin speech using a hierarchical prosodic model

A parametric prosody coding approach for Mandarin speech using a hierarchical prosodic model

... demia Sinica Balanced Corpus of Modern Chinese (ASBC) ...speech corpus and their statistics. The Treebank speech corpus and the TCC300 were digitally recorded in forms of 20 kHz ... See full document

24

Sunnah Arabic Corpus: Design and Methodology.

Sunnah Arabic Corpus: Design and Methodology.

... Annotated corpora, as one example of LR, used to perform statistical analysis, hypothesis testing, verifying grammar within a language domain and for building statistical computational ...annotated corpora ... See full document

7

Unit 2 Representativeness, balance and sampling pdf

Unit 2 Representativeness, balance and sampling pdf

... Like corpus representativeness, balance is an important issue for corpus creators, corpus users and readers of corpus-based studies ...a corpus defines representativeness. If one wants ... See full document

8

The Role of Sketch Engine in Multiple Types of Corpora

The Role of Sketch Engine in Multiple Types of Corpora

... parallel corpora is not required by Sketch Engine ...parallel corpus as users are required to map file through sentence identifiers and finally linking this file in the configuration (Benko, ...for ... See full document

5

Constructing a Japanese Basic Named Entity Corpus of Various Genres

Constructing a Japanese Basic Named Entity Corpus of Various Genres

... this corpus for evaluating annotation perfor- mance and annotation methods on different genres of ...a corpus via non-expert annotators for named entity (NE) recognition ... See full document

6

Towards a Balanced Named Entity Corpus for Dutch

Towards a Balanced Named Entity Corpus for Dutch

... entity corpus for ...annotated corpus to be trained on. Such corpora exist for English, but not for ...reference corpus of written Dutch, with four semantic annotation layers: named entities, ... See full document

7

Challenges in Automating Maze Detection

Challenges in Automating Maze Detection

... the corpora included with the SALT software to train maze ...the corpora that the software uses to compute refer- ence ...These corpora share several charac- teristics we expect to be typical of ... See full document

9

WikiWars: A New Corpus for Research on Temporal Expressions

WikiWars: A New Corpus for Research on Temporal Expressions

... TimeBank v1.2 is a revised and improved version of TimeBank 1.1 resulting in a number of errors fixed and inconsistencies removed (see (Boguraev et al., 2007)). Unfortunely, this corpus has the same lim- itations ... See full document

10

Parallel Corpus of Croatian Italian Administrative Texts

Parallel Corpus of Croatian Italian Administrative Texts

... A bilingual dictionary is automatically generat- ed using the GIZA++ tool (Och and Ney, 2003), similarly to Aker et al. (2014). One of the major drawbacks of the tool, as the authors in Aker et al. (2014) point out, is ... See full document

8

FEATURE ANALYSIS OF PUBLIC COMPLAINT HANDLING APPLICATION USING FODA

FEATURE ANALYSIS OF PUBLIC COMPLAINT HANDLING APPLICATION USING FODA

... recommended design patterns using the proposed ...All design problem scenarios of failed cases were reviewed and it was noted that either these cases do not include descriptive words of the pattern, or are ... See full document

11

Commercialization and the decline of joint liability microcredit

Commercialization and the decline of joint liability microcredit

... Strongly balanced refers to a balanced dataset for which lending methodology data is available for the period 2008-2011, while weakly balanced includes only MFIs that report this informa[r] ... See full document

56

Identification of research hypotheses and new knowledge from scientific literature

Identification of research hypotheses and new knowledge from scientific literature

... both corpora and association types (events and ...GENIA-MK corpus (events) outper- formed those for the EU-ADR corpus ...EU-ADR corpus and only 150 abstracts from GENIA-MK indicates that event ... See full document

13

Time Expressions in Mental Health Records for Symptom Onset Extraction

Time Expressions in Mental Health Records for Symptom Onset Extraction

... As regards the IAA, we obtained a value of 60%/78% (strict/partial) for the average of precision and recall, and a value of 60%/77% (strict/partial) for the F1 score. Although these re- sults are lower in comparison to ... See full document

10

Chinese Grammatical Error Diagnosis System Based on Hybrid Model

Chinese Grammatical Error Diagnosis System Based on Hybrid Model

... The goal of this shared task, i.e. Chinese Grammatical Error Diagnosis (CGED) task for CFL is developing the computer-assisted tools to diagnose several kinds of grammatical errors, i.e., redundant word, missing word, ... See full document

9

Adapting Multilingual Parsing Models to Sinica Treebank

Adapting Multilingual Parsing Models to Sinica Treebank

... However, after adapting those parsers to Tra- ditional Chinese, we still find that probabilistic parsing was not efficient enough to provide accu- rate parsing result for Sinica Treebank compared to the work done ... See full document

5

USING CORPORA TO AID QUALITATIVE TEXT ANALYSIS

USING CORPORA TO AID QUALITATIVE TEXT ANALYSIS

... I would like to illustrate the use of this tool with an analysis of Crystal Jeans’s (2016) novel Vegetarian Tigers of Paradise. The author sets out to describe her childhood, her adolescent years and adulthood in ... See full document

11

When Embodiment Meets Generative Lexicon: The Human Body Part Metaphors in Sinica Corpus

When Embodiment Meets Generative Lexicon: The Human Body Part Metaphors in Sinica Corpus

... a balanced corpus, we have demonstrated that the visibility and the telic role of the body part are two major reasons constraining the selection of body parts for metaphorical ... See full document

8

The Sinica Sense Management System: Design and Implementation

The Sinica Sense Management System: Design and Implementation

... A sense-based lexical knowledgebase is a core foundation for language engineering. Two important criteria must be satisfied when constructing a knowledgebase: linguistic felicity and data cohesion. In this paper, we ... See full document

14

Show all 10000 documents...