• No results found

Word Boundaries

Word Boundaries in French: Evidence from Large Speech Corpora

Word Boundaries in French: Evidence from Large Speech Corpora

... how word boundaries may be inferred from the acoustic signal by human listen- ...human word segmentation reveals two main tendencies: (i) the word segmentation problem can be – at least partly ...

6

On the Importance of Word Boundaries in Character level Neural Machine Translation

On the Importance of Word Boundaries in Character level Neural Machine Translation

... of word boundaries in character-level NMT is advantageous for capturing longer-term contextual depen- dencies and generalizing to morphological variations in the target ...

7

Character Level Machine Translation Evaluation for Languages with Ambiguous Word Boundaries

Character Level Machine Translation Evaluation for Languages with Ambiguous Word Boundaries

... Ambiguous word Bound- aries) for automatic machine translation eval- ...and word boundaries are often fuzzy, TESLA-CELAB acknowledges the advantage of character-level evaluation over ...

9

N th Order Ergodic Multigram HMM for Modeling of Languages without Marked Word Boundaries

N th Order Ergodic Multigram HMM for Modeling of Languages without Marked Word Boundaries

... N th Order Ergodic Multigram HMM for Modeling of Languages without Marked Word Boundaries N t h Order E r g o d i e M u l t i g r a m H M M for M o d e l i n g o f L a n g u a g e s w i t h o u t M a[.] ...

6

Word boundaries in the Old Phrygian Germanos inscription

Word boundaries in the Old Phrygian Germanos inscription

... a word boundary. And indeed, wherever we are able to determine word boundaries on combinatoric grounds, we find new words at the beginning of a line, ...a word boundary, ...

6

Greek Word Segmentation Using Minimal Information

Greek Word Segmentation Using Minimal Information

... removed. Word boundaries are represented by the symbol #, ut- terance boundaries by $, following Brent ...represent word bounda- ries without comment or correction; however, it is worth noting ...

6

A Stochastic Finite State Word Segmentation Algorithm for Chinese

A Stochastic Finite State Word Segmentation Algorithm for Chinese

... Most languages that use Roman, Greek, Cyrillic, Armenian, or Semitic scripts, and m a n y that use Indian-derived scripts, mark orthographic word boundaries; however, languages written i[r] ...

28

A compression based algorithm for Chinese word segmentation

A compression based algorithm for Chinese word segmentation

... To infer word boundaries, a general adaptive text compression technique is used that predicts upcoming characters on the basis of their preceding context.. Spaces are inserted into posit[r] ...

20

Character Cluster Based Segmentation using Monolingual and Bilingual Information for Statistical Machine Translation

Character Cluster Based Segmentation using Monolingual and Bilingual Information for Statistical Machine Translation

... where word boundaries are not obviously marked by using both monolingual and bilingual information on English-Thai language pair and demonstrate that (1) unsegmented corpus is able to provide the nearly ...

8

Learning Words and Their Meanings from Unsegmented Child directed Speech

Learning Words and Their Meanings from Unsegmented Child directed Speech

... a word learning task when the meaning bearing words were consistently placed at the end of ...two word boundaries is already known (the ut- terance boundary ...

9

Toward Better Chinese Word Segmentation for SMT via Bilingual Constraints

Toward Better Chinese Word Segmentation for SMT via Bilingual Constraints

... language word- ...extract word boundary distributions for character- level trigrams (types) from the “chars-to-word” ...these word boundaries are encoded into a graph propagation (GP) ...

10

Syllables as Linguistic Units?

Syllables as Linguistic Units?

... The morpheme-level syntax is hard to distin- guish from word-level syntax (is “was clean” two words and “cleaned” only one?). Thus, linguists prefer to look at morphosyntax rather than word- level syntax ...

9

A Statistically Emergent Approach for Language Processing: Application to Modeling Context Effects in Ambiguous Chinese Word Boundary Perception

A Statistically Emergent Approach for Language Processing: Application to Modeling Context Effects in Ambiguous Chinese Word Boundary Perception

... A computer program that tests this model on the task of capturing the effect of context on the perception of ambiguous word boundaries in Chinese sentences is presented.. The program ado[r] ...

24

Lattice Based Word Identification in CLARE

Lattice Based Word Identification in CLARE

... I argue that because of spelling and typing errors and other properties of typed text, the identification of words and word boundaries in general requires syntactic and semantic knowledg[r] ...

8

‘Indicatements’ that character language models learn English morpho syntactic units and regularities

‘Indicatements’ that character language models learn English morpho syntactic units and regularities

... Character language models have access to sur- face morphological patterns, but it is not clear whether or how they learn abstract morpho- logical regularities. We instrument a charac- ter language model with several ...

9

A Classical Chinese Corpus with Nested Part of Speech Tags

A Classical Chinese Corpus with Nested Part of Speech Tags

... Second, word segmentation sets boundaries for automatic word ...preceding word, and men is the first character of the following one. In a word study on 本 覺 ben jue ‘original ...

10

The Role of Prosody and Speech Register in Word Segmentation: A Computational Modelling Perspective

The Role of Prosody and Speech Register in Word Segmentation: A Computational Modelling Perspective

... Infants start learning their native language even before birth and, already during their first year of life, they succeed in acquiring linguistic structure at several levels, including phonetic and lexical knowledge. One ...

6

A Melody Conditioned Lyrics Language Model

A Melody Conditioned Lyrics Language Model

... This paper presents a novel, data-driven lan- guage model that produces entire lyrics for a given input melody. Previously proposed mod- els for lyrics generation suffer from the in- ability of capturing the relationship ...

10

Density Based Traffic Signal System Using Matlab

Density Based Traffic Signal System Using Matlab

... To count the objects present in the image, the close boundaries of the objects are identified. The exterior boundaries of the objects as well as the boundaries of holes inside[r] ...

6

Critical Tokenization and its Properties

Critical Tokenization and its Properties

... The main results are as follows: 1 Critical points are all and only unambiguous toke~ boundaries for any character string on a complete dictionary; 2 A n y critically tokenized word stri[r] ...

28

Show all 10000 documents...

Related subjects