• No results found

[PDF] Top 20 Character encoding in corpus construction

Has 10000 "Character encoding in corpus construction" found on our website. Below are the top 20 most common "Character encoding in corpus construction".

Character encoding in corpus construction

Character encoding in corpus construction

... in character encoding is the Baudot code, invented by Frenchman Jean-Maurice-Émile Baudot (1845-1903) for teleprinters in ...5-bit character code that uses a succession of “on” and “off” codes as ... See full document

11

Construction and Analysis of a Large Vietnamese Text Corpus

Construction and Analysis of a Large Vietnamese Text Corpus

... a corpus consisting of news- papers coming from two news sources collected within 6 months in ...This corpus was annotated with entity classes such as person, location, organization for named entity ...a ... See full document

5

Corpus Exploitation from Wikipedia for Ontology Construction

Corpus Exploitation from Wikipedia for Ontology Construction

... Ontology construction usually requires a domain-specific corpus for building corresponding concept ...domain corpus must have a good coverage of domain ...domain corpus resource in ontology ... See full document

8

Construction and Annotation of a French Folkstale Corpus

Construction and Annotation of a French Folkstale Corpus

... tales corpus - which is to our knowledge the only French tales corpus available and classified according to the Aarne&Thompson classification - composed of historical texts (with old French ...this ... See full document

6

Construction of a Multilingual Corpus Annotated with Translation Relations

Construction of a Multilingual Corpus Annotated with Translation Relations

... Translation relations, which distinguish literal translation from other translation techniques, con- stitute an important subject of study for human translators (Chuquet and Paillard, 1989). How- ever, automatic ... See full document

10

New tools for the encoding of lexical data extracted from corpus

New tools for the encoding of lexical data extracted from corpus

... the encoding of such a word, the goal of the lexicographer who has to compare empirical evidences, the linguistic cues with the syntactic features that a given class is supposed to ... See full document

6

Thai Broadcast News Corpus Construction and Evaluation

Thai Broadcast News Corpus Construction and Evaluation

... the construction and evaluation of the first Thai broadcast news speech and text ...speech corpus contains about 17 hours of speech data while the text corpus was transcribed from around 35 hours of ... See full document

6

Construction of the Turkish National Corpus (TNC)

Construction of the Turkish National Corpus (TNC)

... Optical Character Recognition (OCR) process. A web-based corpus management system was developed to process and monitor data coming from the OCR, keyboarding, existing electronic texts or speech ...made ... See full document

5

Feature-based Encoding and Querying Language Resources with Character Semantics

Feature-based Encoding and Querying Language Resources with Character Semantics

... of character features pertaining to written language resources, which we argue are critically necessary in the long term of archiving language ...the corpus itself; however it is generally accepted that ... See full document

6

Sentimental Corpus Construction of Chinese Online Reviews

Sentimental Corpus Construction of Chinese Online Reviews

... of corpus annotation. The corpus annotation standard refers to the annotation degree and content of the ...the corpus in ...of corpus as well as the characteristics of the sentiment ... See full document

6

Incremental Construction of an Associative Network from a Corpus

Incremental Construction of an Associative Network from a Corpus

... WAS (Steyvers, Shiffrin & Nelson, in press) is not based on a corpus but on association norms providing associates for 5,000 words. The authors applied scaling methods to these data in order to assign a  ... See full document

6

Construction of Structurally Annotated Spoken Dialogue Corpus

Construction of Structurally Annotated Spoken Dialogue Corpus

... In this project, a system was specially built in a Data Collection Vehicle (DCV), shown in Fig- ure 1, and was used for the synchronous recording of multi-channel audio data, multi-channel video data, and vehicle related ... See full document

8

Multilingual document clustering : state of the art (Construction de corpus multilingues : état de l’art) [in French]

Multilingual document clustering : state of the art (Construction de corpus multilingues : état de l’art) [in French]

... Partant de l’hypothèse selon laquelle le sens d’un mot se détermine en contexte, Mohammad et al. (2007) proposent une méthode de calcul de la distance sémantique translingue des mots à travers la comparaison de leurs ... See full document

13

Temporal Evaluation

Temporal Evaluation

... Next addition is to optimize Timegraph construction. For each relation we have to make sure all constraints are met. The easiest and best way to approach this is to consider all relations together. For example, ... See full document

6

Proceedings of the 10th Web as Corpus Workshop

Proceedings of the 10th Web as Corpus Workshop

... With WAC-X, the series of WAC workshops continues its successful tradition going back to 2005. Thematically, the WAC workshops have always been positioned between computational linguistics and theoretically oriented ... See full document

10

Construction of a Free Large Part of Speech Annotated Corpus in French (Construction d’un large corpus écrit libre annoté morpho syntaxiquement en français) [in French]

Construction of a Free Large Part of Speech Annotated Corpus in French (Construction d’un large corpus écrit libre annoté morpho syntaxiquement en français) [in French]

... autres corpus disponibles réunis. En pratique ce corpus se compose d’articles journalistiques issus du journal Le Monde écrits dans les années 90, soit plus de 500 000 mots ...du corpus P7T dans une ... See full document

14

A Phonetic Based Approach to Chinese Chat Text Normalization

A Phonetic Based Approach to Chinese Chat Text Normalization

... To address the sparse data problem and dynamic problem in Chinese chat text normalization, the phonetic mapping models are proposed in this paper to represent mappings between chat terms and standard words. Different ... See full document

8

The Italian Particle “ne”: Corpus Construction and Analysis

The Italian Particle “ne”: Corpus Construction and Analysis

... Although syntactic aspects of ne have been studied inten- sively (Belletti and Rizzi, 1981; Burzio, 1986; Sorace, 2000), this particle has received very little attention from a semantic and discourse perspective. ... See full document

5

Emotion Cause Events: Corpus Construction and Analysis

Emotion Cause Events: Corpus Construction and Analysis

... Most theories of emotion treat recognition of a triggering cause event as an integral part of emotion (Descartes 1649, James 1884, Plutchik 1962, Wierzbicka 1996). In this study, cause events refer to the explicitly ... See full document

8

Corpus, Lexicon, and Construction: A Quantitative Corpus Approach to Mandarin Possessive Construction

Corpus, Lexicon, and Construction: A Quantitative Corpus Approach to Mandarin Possessive Construction

... traditional corpus linguistics has made a step further in contributing a great deal to the linguistic theorizing in general, such an approach does not typically produce data which are interpretable and usable by ... See full document

36

Show all 10000 documents...