• No results found

corpus data

KorAP Architecture ― Diving in the Deep Sea of Corpus Data

KorAP Architecture ― Diving in the Deep Sea of Corpus Data

... eral corpus query ...the corpus analysis platform that KorAP is designed to ...protecting corpus data against the requirements of a large annotated text corpus such as D E R E K O ...

6

Learning the Countability of English Nouns from Corpus Data

Learning the Countability of English Nouns from Corpus Data

... relative corpus occurrence of those features for each ...from corpus data, we need the basic phrase structure, and partic- ularly noun phrase structure, of the source ...tagged data, chunked ...

8

Taiwan Child Language Corpus: Data Collection and Annotation

Taiwan Child Language Corpus: Data Collection and Annotation

... In order to speed up the building of the corpus, a word auto-segmentation program is necessary. Yet, when the program is segmenting words from the text, it can also deal with some related problems at the same ...

8

Predicting Strong Associations on the Basis of Corpus Data

Predicting Strong Associations on the Basis of Corpus Data

... The median ranks of the strong associations for all models are plotted in Figure 1. The means show the same pattern, but give a less clear indication of the number of associations that were suggested in the top n most ...

9

The Metalogue Debate Trainee Corpus: Data Collection and Annotations

The Metalogue Debate Trainee Corpus: Data Collection and Annotations

... tracking data from motion and speech capturing devices and semantic annotations - dialogue acts - as defined in ISO 24617-2 and discourse relations as defined in ISO ...The corpus comes with a manual ...

7

Linked Open Data and Web Corpus Data for noun compound bracketing

Linked Open Data and Web Corpus Data for noun compound bracketing

... open data resource brings a completely new dimension, as we now work with enti- ties and entity names instead of surface strings as for the frequency-based ...

8

Cross Corpus Data Augmentation for Acoustic Addressee Detection

Cross Corpus Data Augmentation for Acoustic Addressee Detection

... ing a t-test with a significance level of 0.05. First, we analyse the sensitivity of our neural networks to acoustic context length variations. This hy- perparameter was shown to be critical for par- alinguistic problems ...

10

Assessing the Potential of Metaphoricity of verbs using corpus data

Assessing the Potential of Metaphoricity of verbs using corpus data

... In this wok we introduce a method to define the POM of a verb based on its distributional behaviour. We follow (Hanks, 2006) and conjecture that verbs that occur with high frequency in many contexts (e.g. ’take a ...

5

Equivalent Malay-Arabic Data Corpus Collection

Equivalent Malay-Arabic Data Corpus Collection

... collect data corpus with high impact titles from the press and must be able to enlarge the scope of study as stated by Maia ...2014. Data search is by using Webcorp engine ...

9

‘I Agree With You’ – A Corpus-based Study Of Agreement

‘I Agree With You’ – A Corpus-based Study Of Agreement

... examined corpus data to critically evaluate existing language textbooks and to inform their production �Tribble & �ones, textbooks and to inform their production �Tribble & �ones, and to inform ...

27

PACE Corpus: a multilingual corpus of Polarity-annotated textual data from the domains Automotive and CEllphone

PACE Corpus: a multilingual corpus of Polarity-annotated textual data from the domains Automotive and CEllphone

... evaluation corpus for phrase-level Sentiment Analysis that can be used to evaluate real world applications in an industrial ...This corpus contains data from English and German Internet forums (1000 ...

6

Enriching a Lexicon of Discourse Connectives with Corpus based Data

Enriching a Lexicon of Discourse Connectives with Corpus based Data

... real corpus data, thus allowing us to both extend the resource and validate the ...a corpus of news annotated with ex- plicit and implicit discourse contrast relations for Italian ac- cording to the ...

6

Beyond Generic Summarization: A Multi faceted Hierarchical Summarization Corpus of Large Heterogeneous Data

Beyond Generic Summarization: A Multi faceted Hierarchical Summarization Corpus of Large Heterogeneous Data

... the corpus from 4,820 documents with 628,026 sen- tences to 3,984 documents with 171,976 ...our corpus, we have selected ten of those broad topic ...’06 data, we sample three large (> 125,000 ...

8

langid py: An Off the shelf Language Identification Tool

langid py: An Off the shelf Language Identification Tool

... tification in a traditional text categorization setting, where we have in-domain training data. The task be- comes much harder when trying to perform domain adaptation, that is, trying to use model parameters ...

6

Mainstreaming August Strindberg with Text Normalization

Mainstreaming August Strindberg with Text Normalization

... ern Swedish. This means that they are a bit more modern than the data Pettersson et al. (2012) used when they began to research the different approaches to text normalization. There is even some shifts in spelling ...

5

Quran question and answer corpus for data mining with WEKA

Quran question and answer corpus for data mining with WEKA

... solve data mining problems. WEKA has become one of the most widely used data mining systems while it offers many powerful features ...different data mining tasks such as data preprocessing and ...

7

Transcriptome signatures in Helicobacter pylori-infected mucosa identifies acidic mammalian chitinase loss as a corpus atrophy marker

Transcriptome signatures in Helicobacter pylori-infected mucosa identifies acidic mammalian chitinase loss as a corpus atrophy marker

... this data, we observed that the ...the corpus was associated with ...the corpus mucosa of both Hp + and Atr groups, revealed an equally high association to genes up-regulated in both SPEM and IM (p = ...

14

Multi Site Data Collection for a Spoken Language Corpus

Multi Site Data Collection for a Spoken Language Corpus

... Multi Site Data Collection for a Spoken Language Corpus M u l t i S i t e D a t a C o l l e c t i o n for a S p o k e n L a n g u a g e C o r p u s M A D C O W * Contact Lynette Hirschman NE43 643 Spo[.] ...

8

The Productivity of Urdu Affixes in Newspapers: A Corpus Driven Research

The Productivity of Urdu Affixes in Newspapers: A Corpus Driven Research

... from the emigration, half a century ago, of the Muhajirs, the Muslims who left what is now India to settle in Pakistan after Partition. Due to this migration, different dialects are produced and they influence on other ...

18

Proceedings of the 10th Web as Corpus Workshop

Proceedings of the 10th Web as Corpus Workshop

... Steffen Remus, Gerold Hintz, Chris Biemann, Christian M. Meyer, Darina Benikova, Judith Eckle- Kohler, Margot Mieskes and Thomas Arnold . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ...

10

Show all 10000 documents...

Related subjects