[PDF] Top 20 NoWaC: a large web based corpus for Norwegian
Has 10000 "NoWaC: a large web based corpus for Norwegian" found on our website. Below are the top 20 most common "NoWaC: a large web based corpus for Norwegian".
NoWaC: a large web based corpus for Norwegian
... writing noWaC is in the process of be- ing ...handle large text ...a large list of tagged lemmas to be used with ...the corpus will be available in the next few weeks (in any case, before the ... See full document
7
Test-retest reproducibility of a food frequency questionnaire (FFQ) and estimated effects on disease risk in the Norwegian Women and Cancer Study (NOWAC)
... the NOWAC web-site [8]. NOWAC includes the Norwegian sub-cohort in the European Prospective Investigation into Cancer and Nutrition ...for NOWAC and the Norwegian part of the ... See full document
10
High coffee consumption and different brewing methods in relation to postmenopausal endometrial cancer risk in the Norwegian Women and Cancer Study: a population-based prospective study
... the NOWAC Study [25] and other studies [31,32] have shown a satisfactory reproducibility and val- idity of information on coffee consumption, various biases may still arise when self-reported FFQs are ...the ... See full document
10
Each pregnancy linearly changes immune gene expression in the blood of healthy women compared with breast cancer patients
... a large body of evidence demonstrating long-lasting protective effect of each full-term pregnancy (FTP) on the development of breast cancer (BC) later in life, a phe- nomenon that could be related to both hormonal ... See full document
10
What kind of corpus is a web corpus?
... the Norwegian NoWaC corpus. We have com- pared this web corpus with one corpus of spo- ken language and one of written ...the web corpus sides with the written ... See full document
8
A Web-based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts
... For most linguists, this kind of search is not straightforward. But this example is still fairly simple: With a little bit of training, it’s possible for anybody to use regular expressions. However, once the user needs ... See full document
5
Large-Scale Noun Compound Interpretation Using Bootstrapping and the Web as a Corpus
... a large number of semantically interpreted noun compounds from a small number of ...filtered based on their semantic similarity with the original ...a large number of noun compounds without ... See full document
11
The Web as a Parallel Corpus
... the Web in order to extract the parallel text it ...the Web for bilingual text (STRAND) (Resnik 1998, 1999), incor- porating new work on content-based detection of translations (Smith 2001, 2002), ... See full document
32
A Semantic Feature for Relation Recognition Using a Web based Corpus
... RDC corpus, five relation types, AT , N EAR, P ART , ROLE, and SOC, are defined; each rela- tion type has extended ...RDC corpus for ACE 2003. Based on Table 2, we find that the distribution of the ... See full document
7
CSNIPER Annotation by query for Non canonical Constructions in Large Corpora
... a web-based multi-user annotation scenario in which linguists formulate and refine queries that identify a given linguistic construction in a corpus and as- sess the query results to distinguish ... See full document
6
Annotated Web as corpus
... on corpus annotation has utilised either manual coding or automated software tagging systems, or else a semi-automatic combination of the two ap- proaches ...of web-based or email services (CLAWS 4 , ... See full document
7
Toward a Web based Speech Corpus for Algerian Dialectal Arabic Varieties
... very few attempts have considered Algerian Ara- bic dialect. Which, make us affirm that the Al- gerian dialect and its varieties are considered as under-resourced language. In this paper, we tend to fill this gap by ... See full document
9
NoReC: The Norwegian Review Corpus
... 2.2. Converting content to canonical HTML The raw data dumps from the sources are mostly in HTML format, but may also be e.g. JSON objects, and have differ- ent conventions for document structuring and use of mark- up. ... See full document
6
Automatic Evaluation of Relation Extraction Systems on Large scale
... works based on realistic-sized ...for large-scale evaluation of relation extraction systems based on an automatic annotator that uses a public online database and a large web ... See full document
6
The development of a web corpus of Hindi language and corpus based comparative studies to Japanese
... a large corpus such as COSH make to linguistic studies? With this corpus, we can observe instances of actual use, based on the word itself or combina- tions of other POS and the ...vaalaa. ... See full document
10
Inforex – a web-based tool for text corpus management and semantic annotation
... Anaphora is a kind of relation that connect two elements. In general, anaphora could be annotated using general mech- anism for relations. However, the number of operations re- quired to create an anaphora relation is ... See full document
7
A Probabilistic Approach to Study Features in Opinion Mining using Fuzzy Selection
... The proposed model is a novel approach to the identification of such features from unstructured textual reviews. Supervised learning model may be tuned to work well in a given domain, but the model must be analyzed if it ... See full document
6
Large Corpus based Semantic Feature Extraction for Pronoun Coreference
... In order to remove noise, we only keep contex- tual compatibility patterns that appear more than 5 times; and only keep role pair patterns which appear more than 15 times, and appear in more than three different years to ... See full document
9
RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus
... the Norwegian broadcast news speech corpus RUNDKAST. The corpus contains recordings of approximately 77 hours of broadcast news shows from the Norwegian broadcasting company ...The ... See full document
7
Crowdsourcing Language Generation Templates for Dialogue Systems
... dialogue corpus contains phrases the system has generated, and crowd-workers con- struct alternates for these phrases, which can be plugged back into the system as crowd ... See full document
9
Related subjects