• No results found

Corpus Creation

Towards Continuous Dialogue Corpus Creation: writing to corpus and generating from it

Towards Continuous Dialogue Corpus Creation: writing to corpus and generating from it

... The corpus development is performed following the ISO linguistic annotation framework and primary data encoding ...Dialogue Corpus Creation (D3C) methodology is proposed, where a corpus is ...

8

Corpus Creation and Emotion Prediction for Hindi English Code Mixed Social Media Text

Corpus Creation and Emotion Prediction for Hindi English Code Mixed Social Media Text

... comments. They annotated a corpus and achieved an accuracy of 95.76% using statistical models with monolingual dictionaries. (Raghavi et al., 2015) developed a Question Classification sys- tem for Hindi-English ...

8

Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium

Issues in Corpus Creation and Distribution: The Evolution of the Linguistic Data Consortium

... large-scale corpus creation, these groups typically lack the necessary distribution infrastructure; resources created at considerable cost in those environments are seldom shared outside the immediate ...

8

Corpus Creation and Initial SMT Experiments between Spanish and Shipibo-konibo

Corpus Creation and Initial SMT Experiments between Spanish and Shipibo-konibo

... In this context, it may seem that a SMT is not an appropriate approach for minority languages or with scarce digital resources. However, there have been different studies trying to take advan- tage of any small ...

7

Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium

Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium

... data creation efforts for these projects, creating tools and technical infrastructures for all aspects of data creation projects: data scouting, data collection, data selection, annotation, search, data ...

5

Enabling Code Mixed Translation: Parallel Corpus Creation and MT Augmentation Approach

Enabling Code Mixed Translation: Parallel Corpus Creation and MT Augmentation Approach

... parallel corpus of code-mixed English-Hindi and ...parallel corpus, and 4 human trans- lators, fluent in both English and Hindi, translated the 6,096 code-mixed English-Hindi sentences into ...parallel ...

10

Corpus Creation for New Genres: A Crowdsourced Approach to PP Attachment

Corpus Creation for New Genres: A Crowdsourced Approach to PP Attachment

... We previously evaluated crowdsourced PP attach- ment annotation by using MTurk workers to repro- duce PP attachments from the Wall Street Journal corpus (Rosenthal et al., 2010). The results demon- strated that ...

8

An Empirical Assessment of Contemporary Online Media in Ad Hoc Corpus Creation for Social Events

An Empirical Assessment of Contemporary Online Media in Ad Hoc Corpus Creation for Social Events

... external corpus to extract and quantify such se- mantic ...for creation of the ex- ternal ...which corpus is deemed to be best for the ...ad-hoc corpus, namely contempo- rary online ...

8

Corpus Creation and Analysis for Named Entity Recognition in Telugu English Code Mixed Social Media Data

Corpus Creation and Analysis for Named Entity Recognition in Telugu English Code Mixed Social Media Data

... this corpus, Singh et ...a corpus for NER in Hindi-English Code-Mixed along with experi- ments on their machine learning ...the corpus we created is the first Telugu-English code-mixed corpus ...

7

The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and Analysis

The Uppsala Corpus of Student Writings: Corpus Creation, Annotation, and Analysis

... Furthermore, we presented preliminary results from a quan- titative study on a pilot data set of student writings to illus- trate the potential of our corpus data. While the data set used is small and does not ...

8

ANC2Go: A Web Application for Customized Corpus Creation

ANC2Go: A Web Application for Customized Corpus Creation

... possible corpus options: the OANC, MASC, and the corpus of WordNet sense annota- tions, each of which has its particular set of ...the corpus is identified, the user can choose to pro- cess the ...

6

Creation and Analysis of a New Bangla Text Corpus BDNC01

Creation and Analysis of a New Bangla Text Corpus BDNC01

... computerized corpus of transcribed spoken language was constructed in 1971 by the Montreal French Project, containing one million words ...of corpus linguistics, corpora of other languages, either ...

9

From archive to corpus: transcription and annotation in the creation of signed language corpora

From archive to corpus: transcription and annotation in the creation of signed language corpora

... the creation of signed language corpora as corpora in the modern sense involves more than recording, digitising, editing, cataloguing and archiving video ...the creation of reference corpora for signed ...

14

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

The SALSA Corpus: a German Corpus Resource for Lexical Semantics

... In standard annotation cases, there is a strong parallelism between syntactic and semantic structure: a single-word predicate lexically introduces a frame, whose frame ele- ments are syntactic arguments (i.e. ...

6

The COPLE2 corpus: a learner corpus for Portuguese

The COPLE2 corpus: a learner corpus for Portuguese

... COPLE2 corpus, a learner corpus of Portuguese that includes written and spoken texts produced by learners of Portuguese as a second or foreign ...The corpus includes at the moment a total of 182,474 ...

8

CORPUS

CORPUS

... Starting from the opposition introduced by Adrienne Rich, I intend to analyze and spotlight this perspective in plays written by women in the twentieth century, by subdividing my literary corpus according to it. ...

156

Job creation, firm creation, and de novo entry

Job creation, firm creation, and de novo entry

... Firm turnover and growth recorded in administrative data sets differ from underlying firm dynamics. By tracing the employment history of the workforce of new and disappearing administrative firm identifiers, we can ...

53

Nexing Corpus: a corpus of verbal protocols on syllogistic reasoning

Nexing Corpus: a corpus of verbal protocols on syllogistic reasoning

... Nexing Corpus and report on the tools implemented and the tasks undertaken for its ...Nexing Corpus includes (i) a collection of written transcriptions of verbal data elicited during a psycholinguistic ...

7

Corpus-Induced Corpus Clean-up

Corpus-Induced Corpus Clean-up

... Discussion of non-words in English and Dutch In their essence, the two tables of statistics on non-words in two non-trivial corpora – one English, the other Dutch – have a very similar story to tell. In both corpora ...

6

What kind of corpus is a web corpus?

What kind of corpus is a web corpus?

... The word dass 'loo' is very colloquial and it is not among the 6000 most frequent words in the written corpus. A closer look at the hits in NoWaC reveals that some of the occurrences refer to the surname of the ...

8

Show all 5135 documents...

Related subjects