• No results found

Extending Neural Machine Translation to Documents and Signed Languages

N/A
N/A
Protected

Academic year: 2021

Share "Extending Neural Machine Translation to Documents and Signed Languages"

Copied!
197
0
0

Loading.... (view fulltext now)

Full text

(1)Extending Neural Machine Translation to Documents and Signed Languages Kayo Yin Talk @ University of Pittsburgh October 7 2021. 1.

(2) Introduction ➔. Neural Machine Translation is the state-of-the-art in automated translation. 2.

(3) Introduction ➔. Neural Machine Translation is the state-of-the-art in automated translation. ➔. However, they are usually limited to sentence-level translations. 3.

(4) Introduction ➔. Neural Machine Translation is the state-of-the-art in automated translation. ➔. However, they are usually limited to sentence-level translations. ➔. Current NLP systems also cannot process signed languages. 4.

(5) Today’s Agenda ➔. Do context-aware machine translation models pay the right attention?. 5.

(6) Today’s Agenda ➔. Do context-aware machine translation models pay the right attention?. ➔. When does translation require context?. 6.

(7) Today’s Agenda ➔. Do context-aware machine translation models pay the right attention?. ➔. When does translation require context?. ➔. How do we resolve coreference in signed languages? 7.

(8) Do Context-Aware Translation Models Pay the Right Attention? Kayo Yin, Patrick Fernandes, Danish Pruthi, Aditi Chaudhary André F.T. Martins, Graham Neubig (ACL 2021). 8.

(9) Why is Context Important for Translation?. We’ll have to get rid of that mole.. 9.

(10) Why is Context Important for Translation?. Things could start to get dangerous if the ministers find out. We’ll have to get rid of that mole.. 10.

(11) Why is Context Important for Translation?. Things could start to get dangerous if the ministers find out. We’ll have to get rid of that mole.. 11.

(12) Why is Context Important for Translation?. Could it be anything serious, Doctor? We’ll have to get rid of that mole.. 12.

(13) Why is Context Important for Translation?. Could it be anything serious, Doctor? We’ll have to get rid of that mole.. 13.

(14) Why is Context Important for Translation? English: Things could start to get dangerous if the ministers find out. We’ll have to get rid of that mole. French: Les choses pourraient commencer à devenir dangereuses si les ministres le découvraient. Nous devrons nous débarrasser de cette taupe. 14.

(15) Why is Context Important for Translation? English: Could it be anything serious, Doctor? We’ll have to get rid of that mole. French: Serait-ce quelque chose de grave, docteur ? Nous devrons nous débarrasser de cette taupe. cet grain de beauté 15.

(16) Why is Context Important for Translation? English: So you see how bad the implications are. Yes, they are quite devastating. French: Vous voyez donc à quel point les implications sont mauvaises. Oui, ils sont assez dévastateurs. 16.

(17) Why is Context Important for Translation? English: So you see how bad the implications are. Yes, they are quite devastating. French: Vous voyez donc à quel point les implications sont mauvaises. Oui, ils sont assez dévastateurs. elles dévastatrices. 17.

(18) Context-Aware NMT ● Many approaches have been proposed for context-aware machine translation. 18.

(19) Context-Aware NMT ● Many approaches have been proposed for context-aware machine translation ○ Concatenation, Multi-Encoder, Cache-Based, Hierarchical…. 19.

(20) Context-Aware NMT ● Many approaches have been proposed for context-aware machine translation ○ Concatenation, Multi-Encoder, Cache-Based, Hierarchical…. ● Most of these approaches perform poorly on document-level translation 20.

(21) Context-Aware NMT. Have we got her report? Source input Yes, it’s in the infirmary already. Context-aware NMT output. On dispose de son rapport? Oui, elle est déjà à l’infirmerie.. 21.

(22) Context-Aware NMT. Have we got her report? Source input Yes, it’s in the infirmary already. Context-aware NMT output. On dispose de son rapport? Oui, elle est déjà à l’infirmerie.. 22.

(23) Context-Aware NMT. Have we got her report? Source input Yes, it’s in the infirmary already. Context-aware NMT output. On dispose de son rapport? Oui, elle est déjà à l’infirmerie.. 23.

(24) Context-Aware NMT. Have we got her report? Source input Yes, it’s in the infirmary already. Context-aware NMT output. On dispose de son rapport? Oui, elle est déjà à l’infirmerie.. 24.

(25) Outline 1.. What context is useful during ambiguous translations?. 2.. Are models paying attention to this context or not?. 3.. If not, can we encourage them to do so?. 25.

(26) Outline 1.. What context is useful during translation?. 2.. Are models paying attention to this context or not?. 3.. If not, can we encourage them to do so?. 26.

(27) User Study. 27.

(28) User Study. 28.

(29) User Study. 29.

(30) User Study. 30.

(31) What Context do Human Translators Use?. 31.

(32) What Context do Human Translators Use?. 32.

(33) What Context do Human Translators Use? (Pronoun Anaphora Resolution). 33.

(34) What Context do Human Translators Use? (Pronoun Anaphora Resolution). 34.

(35) What Context do Human Translators Use? (Pronoun Anaphora Resolution). Have we got her report? Yes, it’s in the infirmary already. On dispose de son rapport? Oui, [il / elle] est à l’infirmière.. 35.

(36) What Context do Human Translators Use? (Pronoun Anaphora Resolution). Have we got her report? Yes, it’s in the infirmary already. On dispose de son rapport? Oui, [il / elle] est à l’infirmière.. 36.

(37) What Context do Human Translators Use? (Pronoun Anaphora Resolution). Have we got her report? It’s important. Yes, it’s in the infirmary already. On dispose de son rapport? Il est important. Oui, [il / elle] est à l’infirmière.. 37.

(38) What Context do Human Translators Use? (Pronoun Anaphora Resolution). Have we got her report? It’s important. Yes, it’s in the infirmary already. On dispose de son rapport? Il est important. Oui, [il / elle] est à l’infirmière.. 38.

(39) What Context do Human Translators Use? (Pronoun Anaphora Resolution). Have we got her report? It’s important. Yes, it’s in the infirmary already. On dispose de son rapport? Il est important. Oui, [il / elle] est à l’infirmière.. 39.

(40) What Context do Human Translators Use? (Pronoun Anaphora Resolution). Have we got her report? It’s important. Yes, it’s in the infirmary already. On dispose de son rapport? Il est important. Oui, [il / elle] est à l’infirmière.. 40.

(41) What Context do Human Translators Use? (Word Sense Disambiguation). 41.

(42) What Context do Human Translators Use? (Word Sense Disambiguation). 42.

(43) What Context do Human Translators Use? (Word Sense Disambiguation) Your charm is only exceeded by your frankness. Ton [charme / portebonheur] n’a d’égal que ta franchise.. 43.

(44) What Context do Human Translators Use? (Word Sense Disambiguation) nsubj:pass. obl:agent. Your charm is only exceeded by your frankness. VERB NOUN . Ton [charme / portebonheur] n’a d’égal que ta franchise.. 44.

(45) What Context do Human Translators Use? (Word Sense Disambiguation) nsubj:pass. obl:agent. Your charm is only exceeded by your frankness. VERB NOUN . Ton [charme / portebonheur] n’a d’égal que ta franchise.. 45.

(46) What Context do Human Translators Use? (Word Sense Disambiguation) nsubj:pass. obl:agent. Your charm is only exceeded by your frankness. VERB NOUN . Ton [charme / portebonheur] n’a d’égal que ta franchise.. SCAT: Supporting Context for Ambiguous Translations dataset (14K). 46.

(47) Outline 1.. What context is useful during translation?. 2.. Are models paying attention to this context or not?. 3.. If not, can we encourage them to do so?. 47.

(48) Model. Encoder Current source sentence. Decoder Current target sentence. 48.

(49) Model 5 previous source sentences. 5 previous target sentences. Encoder. Current source sentence. Decoder. Current target sentence. 49.

(50) Model. Encoder. Decoder. 50.

(51) Quantifying Human-Model Alignment with SCAT En Have we got her report? Yes, it’s in the infirmary already. Fr. On dispose de son rapport? Oui, il est à l’infirmière.. En Have we got her report? Yes, it’s in the infirmary already. Fr. On dispose de son rapport? Oui, il est à l’infirmière.. 51.

(52) Quantifying Human-Model Alignment with SCAT En Have we got her report? Yes, it’s in the infirmary already. Fr. En Have we got her report? Yes, it’s in the infirmary already.. On dispose de son rapport? Oui, il est à l’infirmière.. ... s di. Fr. On dispose de son rapport? Oui, il est à l’infirmière.. ... se o p. t de son por p ra. ?. i, u O. il s di. se o p. t de son por p ra. ?. i, u O. il 52.

(53) Quantifying Human-Model Alignment with SCAT. .5. .1. .3. .1. 53.

(54) Quantifying Human-Model Alignment with SCAT. Sort .5. .1. .3. .5. .3. .1. .1. .1. 54.

(55) Quantifying Human-Model Alignment with SCAT. Sort .5. .1. .3. 1. 2. .5. .3. .1. .1. .1. 55.

(56) Alignment Results. 56.

(57) Alignment Results. 57.

(58) Alignment Results. 58.

(59) Alignment Results. 59.

(60) Outline 1.. What context is useful during translation?. 2.. Are models paying attention to this context or not?. 3.. If not, can we encourage them to do so?. 60.

(61) Attention Regularization. OpenSubtitles18. Context-aware MT Model. 61.

(62) Attention Regularization. OpenSubtitles18. Context-aware MT Model. SCAT 62.

(63) Evaluation ● BLEU ● COMET. 63.

(64) Evaluation ● BLEU ● COMET ● Pronouns F-measure. 64.

(65) Evaluation ● BLEU ● COMET ● Pronouns F-measure ● Contrastive Evaluation. Oui, il est déjà à l’infirmerie.. Oui, elle est déjà à l’infirmerie. 65.

(66) Results. 66.

(67) Results. 67.

(68) Results +9 +8. 68.

(69) Results. 69.

(70) Results. 70.

(71) Results. Have we got her report? Yes, it’s in the infirmary already. Baseline. On dispose de son rapport? Oui, elle est déjà à l’infirmerie.. Have we got her report? Yes, it’s in the infirmary already. Attention-Reg. On dispose de son rapport? Oui, il est déjà à l’infirmerie. 71.

(72) Results. Have we got her report? Yes, it’s in the infirmary already. More experiments Baseline & results in paper: On dispose de son rapport? ➔ Increased usage of supporting Oui, elle est déjà à l’infirmerie. context ➔ Regularizing encoder self-attention contributes the most Have we got her report? ➔ Yes, it’s in the infirmary already. Attention-Reg On dispose de son rapport? Oui, il est déjà à l’infirmerie. 72.

(73) Results. Have we got her report? Yes, it’s in the infirmary already. More experiments Baseline & results in paper: On dispose de son rapport? ➔ Increased usage of supporting Oui, elle est déjà à l’infirmerie. context ➔ Regularizing encoder self-attention contributes the most Have we got her report? ➔ Yes, it’s in the infirmary already. Attention-Reg On dispose de son rapport? Oui, il est déjà à l’infirmerie. 73.

(74) Results. Have we got her report? Yes, it’s in the infirmary already. More experiments Baseline & results in paper: On dispose de son rapport? ➔ Increased usage of supporting Oui, elle est déjà à l’infirmerie. context ➔ Regularizing encoder self-attention contributes the most Have we got her report? ➔ Little difference in WSD performance Yes, it’s in the infirmary already. Attention-Reg On dispose de son rapport? Oui, il est déjà à l’infirmerie. 74.

(75) When Does Translation Require Context? A Data-driven, Multilingual Exploration Kayo Yin*, Patrick Fernandes*, André Martins, Graham Neubig (Ongoing work). *Equal contribution.

(76) Evaluating Document-Level Machine Translation. ➔. In machine translation (MT), context is crucial to translate certain discourse phenomena. 76.

(77) Evaluating Document-Level Machine Translation. ➔. In machine translation (MT), context is crucial to translate certain discourse phenomena. ➔. However these phenomena represent only a small portion of the words in natural language data. 77.

(78) Evaluating Document-Level Machine Translation. ➔. In machine translation (MT), context is crucial to translate certain discourse phenomena. ➔. However these phenomena represent only a small portion of the words in natural language data. ➔. Common translation metrics don’t provide a clear picture of performance in these. 78.

(79) Evaluating Document-Level Machine Translation. ➔. Recent work on context-aware MT side-steps this by using contrastive datasets. 79.

(80) Evaluating Document-Level Machine Translation. ➔. Recent work on context-aware MT side-steps this by using contrastive datasets. ➔. However the availability of these datasets is limited. 80.

(81) Evaluating Document-Level Machine Translation. ➔. Recent work on context-aware MT side-steps this by using contrastive datasets. ➔. However the availability of these datasets is limited. ➔. Also this type of evaluation does not measure translation performance directly. 81.

(82) Evaluating Document-Level Machine Translation. ➔. In this work, we propose data-driven, semi-automatic methodology for identifying salient phenomena. 82.

(83) Evaluating Document-Level Machine Translation. ➔. In this work, we propose data-driven, semi-automatic methodology for identifying salient phenomena. ➔. We create a first-of-its-kind multilingual benchmark testing these discourse phenomena. 83.

(84) Evaluating Document-Level Machine Translation. ➔. In this work, we propose data-driven, semi-automatic methodology for identifying salient phenomena. ➔. We create a first-of-its-kind multilingual benchmark testing these discourse phenomena. ➔. We evaluate multiple CAMT models, both trained by us and commercially available, on this benchmark. 84.

(85) Measuring Context Usage. ➔. Previously, we proposed conditional cross-mutual information (CXMI). 85.

(86) Measuring Context Usage. ➔. Previously, we proposed conditional cross-mutual information (CXMI). ➔. This is corpus-level metric that tells us how well the context helps modelling a dataset. 86.

(87) Measuring Context Usage. ➔. We propose a sentence-level extension, Pointwise Cross Mutual Information (P-CXMI). 87.

(88) Measuring Context Usage. ➔. We propose a sentence-level extension, Pointwise Cross Mutual Information (P-CXMI). ➔. It can also be extended to word-level. 88.

(89) Which Translation Phenomena Benefit from Context?. 89.

(90) Which Translation Phenomena Benefit from Context?. ➔. Look at POS tags with high mean P-CXMI. 90.

(91) Which Translation Phenomena Benefit from Context?. ➔. Look at POS tags with high mean P-CXMI. ➔. Look at vocabulary items with high mean P-CXMI. 91.

(92) Which Translation Phenomena Benefit from Context?. ➔. Look at POS tags with high mean P-CXMI. ➔. Look at vocabulary items with high mean P-CXMI. ➔. Look at individual tokens with high P-CXMI. 92.

(93) Which Translation Phenomena Benefit from Context?. ➔. ~120k parallel sentences from TED talk transcripts. 93.

(94) Which Translation Phenomena Benefit from Context?. ➔. ~120k parallel sentences from TED talk transcripts. ➔. 14 language pairs: English → Arabic, German, Spanish, French, Hebrew, Italian, Japanese, Korean,. Dutch, Portuguese, Romanian, Russian, Turkish and Mandarin Chinese. 94.

(95) Which Translation Phenomena Benefit from Context?. 95.

(96) Which Translation Phenomena Benefit from Context?. 96.

(97) Which Translation Phenomena Benefit from Context?. 97.

(98) Which Translation Phenomena Benefit from Context?. 98.

(99) Which Translation Phenomena Benefit from Context?. 99.

(100) Which Translation Phenomena Benefit from Context?. 100.

(101) Which Translation Phenomena Benefit from Context?. 101.

(102) Which Translation Phenomena Benefit from Context?. 102.

(103) Which Translation Phenomena Benefit from Context?. 103.

(104) Which Translation Phenomena Benefit from Context?. 104.

(105) Which Translation Phenomena Benefit from Context?. 105.

(106) Multilingual Discourse-Aware (MuDA) Benchmark. 106.

(107) Multilingual Discourse-Aware (MuDA) Benchmark. ➔. Lexical Cohesion: tag target words y if the aligned source and target words pair (x,y) appears at least 3 times in the document. 107.

(108) Multilingual Discourse-Aware (MuDA) Benchmark. ➔. Lexical Cohesion: tag target words y if the aligned source and target words pair (x,y) appears at least 3 times in the document. ➔. Formality: tag target words that are T-V pronouns/verbs or honorific terms. 108.

(109) Multilingual Discourse-Aware (MuDA) Benchmark. ➔. Lexical Cohesion: tag target words y if the aligned source and target words pair (x,y) appears at least 3 times in the document. ➔. Formality: tag target words that are T-V pronouns/verbs or honorific terms. ➔. Pronoun choice: tag target pronouns if the corresponding source pronoun has multiple possible translations. 109.

(110) Multilingual Discourse-Aware (MuDA) Benchmark. ➔. Lexical Cohesion: tag target words y if the aligned source and target words pair (x,y) appears at least 3 times in the document. ➔. Formality: tag target words that are T-V pronouns/verbs or honorific terms. ➔. Pronoun choice: tag target pronouns if the corresponding source pronoun has multiple possible translations. ➔. Verb form: tag target verbs if it has a verb form such that the corresponding source verb form has multiple possible translations. 110.

(111) Multilingual Discourse-Aware (MuDA) Benchmark. ➔. Lexical Cohesion: tag target words y if the aligned source and target words pair (x,y) appears at least 3 times in the document. ➔. Formality: tag target words that are T-V pronouns/verbs or honorific terms. ➔. Pronoun choice: tag target pronouns if the corresponding source pronoun has multiple possible translations. ➔. Verb form: tag target verbs if it has a verb form such that the corresponding source verb form has multiple possible translations. ➔. Ellipsis: tag target verbs, nouns and pronouns if the source sentence contains an ellipsis and the target word is not aligned to any source word 111.

(112) Multilingual Discourse-Aware (MuDA) Benchmark. 112.

(113) Multilingual Discourse-Aware (MuDA) Benchmark. 113.

(114) A Cross-lingual, Cross-Model Exploration of Context-aware MT. ➔. We evaluate a sentence-level MT model and context-aware MT model on our system ◆. We use a transformer small. ◆. For the context-aware method, we prepend the previous target context sentences to the current target. 114.

(115) A Cross-lingual, Cross-Model Exploration of Context-aware MT. 115.

(116) A Cross-lingual, Cross-Model Exploration of Context-aware MT. ➔. To evaluate more powerful models, we also finetune a large, pretrained model on this task ◆. We do this for DE, FR, JA and ZH. ◆. We use a transformer large. ◆. We pretrain on Paracrawl, JParacrawl and Backtranslated News. 116.

(117) A Cross-lingual, Cross-Model Exploration of Context-aware MT. 117.

(118) A Cross-lingual, Cross-Model Exploration of Context-aware MT. ➔. Finally we consider two commercial engines and evaluate them on our benchmark ◆. the Google Cloud Translation v2 API. ◆. the DeepL v2 API. 118.

(119) A Cross-lingual, Cross-Model Exploration of Context-aware MT. 119.

(120) Signed Coreference Resolution Kayo Yin, Kenneth DeHaan, Malihe Alikhani (EMNLP 2021). 120.

(121) Coreference Resolution.

(122) Coreference Resolution.

(123) Signed Coreference Resolution.

(124) Signed Coreference Resolution.

(125) Signed Coreference Resolution ➔ Novel challenges in modeling discourse and spatial context.

(126) Signed Coreference Resolution ➔ Novel challenges in modeling discourse and spatial context ➔ Better understanding of grounding in different forms of communication.

(127) Signed Coreference Resolution ➔ Novel challenges in modeling discourse and spatial context ➔ Better understanding of grounding in different forms of communication ➔ Broaden the scope of NLP to multiple modalities.

(128) Signed Coreference Resolution ➔ Novel challenges in modeling discourse and spatial context ➔ Better understanding of grounding in different forms of communication ➔ Broaden the scope of NLP to multiple modalities ➔ Enable Sign Language Processing technologies.

(129) Outline 1. Pronominal Pointing Signs 2. Signed Coreference Resolution 3. Unsupervised Continuous Multigraph 4. Results & Discussion.

(130) Outline 1. Pronominal Pointing Signs 2. Signed Coreference Resolution 3. Unsupervised Continuous Multigraph 4. Results & Discussion.

(131) Pronominal Pointing Signs ➔ Pointing signs with a pronominal function. ➔ Referents are established in the signing space. ➔ Point to the actual location of the referent. ➔ Assign a locus to the referent.

(132) Pronominal Pointing Signs ➔ Pointing signs with a pronominal function. ➔ Referents are established in the signing space. ➔ Point to the actual location of the referent. ➔ Assign a locus to the referent.

(133) Pronominal Pointing Signs ➔ Pointing signs with a pronominal function. ➔ Referents are established in the signing space. ➔ Point to the actual location of the referent.

(134) Pronominal Pointing Signs ➔ Pointing signs with a pronominal function. ➔ Referents are established in the signing space. ➔ Point to the actual location of the referent. ➔ Assign a locus to the referent.

(135) Pronominal Pointing Signs.

(136) Pronominal Pointing Signs.

(137) Complexities of Pointing Signs ➔ Pointing signs can serve other functions.

(138) Complexities of Pointing Signs ➔ Pointing signs can serve other functions. ➔ Difficult to distinguish between different pointing signs based solely on local visual features.

(139) Complexities of Pointing Signs English Pronouns + Carry some meaning on its own. ASL Pointing Signs.

(140) Complexities of Pointing Signs English Pronouns. ASL Pointing Signs. + Carry some meaning on its own. - Use the same handshape, harder to distinguish on its own.

(141) Complexities of Pointing Signs English Pronouns. ASL Pointing Signs. + Carry some meaning on its own - The same word can refer to multiple entities at once. - Use the same handshape, harder to distinguish on its own.

(142) Complexities of Pointing Signs English Pronouns. ASL Pointing Signs. + Carry some meaning on its own - The same word can refer to multiple entities at once. - Use the same handshape, harder to distinguish on its own + 1 locus = 1 referent.

(143) Complexities of Pointing Signs English Pronouns. ASL Pointing Signs. + Carry some meaning on its own - The same word can refer to multiple entities at once. - Use the same handshape, harder to distinguish on its own + 1 locus = 1 referent - Loci can be reassigned to different referents.

(144) Complexities of Pointing Signs English Pronouns. ASL Pointing Signs. + Carry some meaning on its own - The same word can refer to multiple entities at once. - Use the same handshape, harder to distinguish on its own + 1 locus = 1 referent - Loci can be reassigned to different referents - Referents can be assigned multiple loci.

(145) Why study Signed Coreference Resolution in NLP? ➔ Theories of coreference in spoken languages may be extended to signed languages.

(146) Why study Signed Coreference Resolution in NLP? ➔ Theories of coreference in spoken languages may be extended to signed languages ◆ Discourse Representation Theory (Kamp et al., 2011; Steinbach and Onea 2016).

(147) Why study Signed Coreference Resolution in NLP? ➔ Theories of coreference in spoken languages may be extended to signed languages ◆ Discourse Representation Theory (Kamp et al., 2011; Steinbach and Onea 2016) ◆ First mention effect (Gernsbacher and Hargreaves, 1988; Wienholz et al., 2020).

(148) Why study Signed Coreference Resolution in NLP? ➔ Theories of coreference in spoken languages may be extended to signed languages ◆ Discourse Representation Theory (Kamp et al., 2011; Steinbach and Onea 2016) ◆ First mention effect (Gernsbacher and Hargreaves, 1988; Wienholz et al., 2020) ➔ It can help us better understand multimodal communication.

(149) Why study Signed Coreference Resolution in NLP? ➔ Theories of coreference in spoken languages may be extended to signed languages ◆ Discourse Representation Theory (Kamp et al., 2011; Steinbach and Onea 2016) ◆ First mention effect (Gernsbacher and Hargreaves, 1988; Wienholz et al., 2020) ➔ It can help us better understand multimodal communication ◆ Spatial iconicity and situated referents in signed languages.

(150) Why study Signed Coreference Resolution in NLP? ➔ Theories of coreference in spoken languages may be extended to signed languages ◆ Discourse Representation Theory (Kamp et al., 2011; Steinbach and Onea 2016) ◆ First mention effect (Gernsbacher and Hargreaves, 1988; Wienholz et al., 2020) ➔ It can help us better understand multimodal communication ◆ Spatial iconicity and situated referents in signed languages ➔ Widen the accessibility of language technologies.

(151) Outline 1. Pronominal Pointing Signs 2. Signed Coreference Resolution 3. Unsupervised Continuous Multigraph 4. Results & Discussion.

(152) Signed Coreference Resolution.

(153) Signed Coreference Resolution. 1. Mention Detection.

(154) Signed Coreference Resolution. 2. Coreference Resolution.

(155) DGS-Coref Dataset. Public DGS Corpus (Hanke et al., 2020).

(156) DGS-Coref Dataset.

(157) DGS-Coref Dataset ➔ 16m30s of signing ➔ 3 conversations ➔ 5 different signers ➔ 288 signed sentences ➔ 1,457 glosses ◆ 95 <I> signs ◆ 8 <YOU> signs ◆ 93 <INDEX> signs. A: WITH TRIP INDEX SHIP INDEX A: We went there with an excursion boat..

(158) Outline 1. Pronominal Pointing Signs 2. Signed Coreference Resolution Task & Data 3. Unsupervised Continuous Multigraph 4. Results & Discussion.

(159) Unsupervised Continuous Multigraph.

(160) Unsupervised Continuous Multigraph.

(161) Unsupervised Continuous Multigraph.

(162) Unsupervised Continuous Multigraph.

(163) Positive Relations 1. I and I.

(164) Positive Relations 1. I and I 2. You and You.

(165) Positive Relations 1. I and I 2. You and You 3. I and You.

(166) Positive Relations 1. 2. 3. 4.. I and I You and You I and You Temporally Close Index.

(167) Positive Relations 1. 2. 3. 4. 5.. I and I You and You I and You Temporally Close Index Noun Phrase.

(168) Positive Relations 1. 2. 3. 4. 5. 6.. I and I You and You I and You Temporally Close Index Noun Phrase Spatially Close Index.

(169) Negative Relations 1. I and I.

(170) Negative Relations 1. I and I 2. You and You.

(171) Negative Relations 1. I and I 2. You and You 3. I and You.

(172) Negative Relations 1. 2. 3. 4.. I and I You and You I and You Different Person.

(173) Negative Relations 1. 2. 3. 4.. I and I You and You I and You Spatially Far Index.

(174) Weight Assignment Positive Relations. Negative Relations. 1. I and I. 1. I and I. 2. You and You. 2. You and You. 3. I and You. 3. I and You. 4. Temporally Close Index. 4. Spatially Far Index. 5. Noun Phrase 6. Spatially Close Index.

(175) Weight Assignment Positive Relations. Negative Relations. 1. I and I. 1. I and I. 2. You and You. 2. You and You. 3. I and You. 3. I and You. 4. Temporally Close Index. 4. Spatially Far Index. 5. Noun Phrase 6. Spatially Close Index. -∞.

(176) Weight Assignment Positive Relations 1. I and I 2. You and You. Negative Relations 1. I and I. +0.5. 2. You and You. 3. I and You. 3. I and You. 4. Temporally Close Index. 4. Spatially Far Index. 5. Noun Phrase 6. Spatially Close Index. -∞.

(177) Weight Assignment Positive Relations 1. I and I 2. You and You. Negative Relations 1. I and I. +0.5. 2. You and You. 3. I and You. 3. I and You. 4. Temporally Close Index. 4. Spatially Far Index. 5. Noun Phrase 6. Spatially Close Index. +(10-t)/20. -∞.

(178) Weight Assignment Positive Relations. Negative Relations. 1. I and I 2. You and You. 1. I and I. +0.5. 2. You and You. 3. I and You. 3. I and You. 4. Temporally Close Index. 4. Spatially Far Index. 5. Noun Phrase 6. Spatially Close Index. +(10-t)/20 +(50-s)/50. -∞.

(179) Clustering.

(180) Clustering.

(181) Outline 1. Pronominal Pointing Signs 2. Signed Coreference Resolution Task & Data 3. Unsupervised Continuous Multigraph 4. Results & Discussion.

(182) Results.

(183) Results.

(184) Results.

(185) Examples. TO-SEE YOU GOOD YOU I think you could do a good job there.. GEST-DECLINE I CAN NOT TO-SAY TO-HOLD-ON I I can’t keep that promise.

(186) Examples. P_IAndYou. TO-SEE YOU GOOD YOU I think you could do a good job there.. P_YouAndYou. GEST-DECLINE I CAN NOT TO-SAY TO-HOLD-ON I I can’t keep that promise. P_IAndI.

(187) Examples. STUTTGART NUM-1 NAME INDEX NUM-1 FREIBURG Once we were in Stuttgart, once in Ingolstadt and once in Freiburg..

(188) Examples. P_NounPhrase. STUTTGART NUM-1 NAME INDEX NUM-1 FREIBURG Once we were in Stuttgart, once in Ingolstadt and once in Freiburg..

(189) Examples. WITH TRIP INDEX SHIP INDEX We went there with an excursion boat..

(190) Examples. P_TemporallyCloseIndex P_SpatiallyCloseIndex. WITH TRIP INDEX SHIP INDEX We went there with an excursion boat..

(191) Examples. I TO-LEARN INDEX HAMBURG INDEX I learned it in Hamburg..

(192) Examples. P_TemporallyCloseIndex P_SpatiallyCloseIndex. I TO-LEARN INDEX HAMBURG INDEX I learned it in Hamburg..

(193) Summary. 193.

(194) Today’s Agenda ➔. Do context-aware machine translation models pay the right attention? ◆. No, but attention regularization on human rationales can encourage them to do so!. ➔. When does translation require context? ◆. ➔. Ambiguous pronouns, lexical cohesion, verb forms, formality, ellipsis. How do we resolve coreference in signed languages? ◆. Linguistically-informed heuristics and unsupervised multigraph 194.

(195) Today’s Agenda ➔. Do context-aware machine translation models pay the right attention? ◆. No, but attention regularization on human rationales can encourage them to do so!. ➔. When does translation require context? ◆. ➔. Ambiguous pronouns, lexical cohesion, verb forms, formality, ellipsis. How do we resolve coreference in signed languages? ◆. Linguistically-informed heuristics and unsupervised multigraph 195.

(196) Today’s Agenda ➔. Do context-aware machine translation models pay the right attention? ◆. No, but attention regularization on human rationales can encourage them to do so!. ➔. When does translation require context? ◆. ➔. Ambiguous pronouns, lexical cohesion, verb forms, formality, ellipsis. How do we resolve coreference in signed languages? ◆. Linguistically-informed heuristics and unsupervised multigraph 196.

(197) Today’s Agenda ➔. Do context-aware machine translation models pay the right attention? ◆. No, but attention regularization on human rationales can encourage them to do so!. ➔. When does translation require context? ◆. ➔. Ambiguous pronouns, lexical cohesion, verb forms, formality, ellipsis. How do we resolve coreference in signed languages? ◆. Linguistically-informed heuristics and unsupervised multigraph 197.

(198)

References

Related documents

In comparison to the wild-type results, URA5 from msh2 Δ, mlh1 Δ, and pms1 Δ 5-FOA-resistant strains showed a reduction in the number of insertion, deletion, and transversion

A Hadoop cluster using the Cloudera Distribution of Hadoop (CDH) consisting of one NameNode and six DataNodes was set up for the purpose of determining the benefits of using

Three categories of performance are practised within most commercial cruise ships: performances for guests, performances that engage passengers and staff in collaborative play

Chiropractic patients with MRI confirmed lumbar disc herniations (n = 139) and patients treated for neck pain (n = 742) showed no significant differences in baseline NRS scores, 1

and VII cranial nerves can be involved: Ipsilateral sclera, conjunctiva, cornea, iris, fundus, optic nerve papilla, the retroorbital fat, the external eye muscles, the lacrimal

We report herein the quantitative MR analyses in the first 180 patients with AD, VaD, and other clin- ical disorders, as well as in the 24 cognitively nor- mal subjects, all with

Since in both the Actinomycin D and Anisomycin experiments the number of attempted openings in the memory test session was statistically as large as in the initial training session