MemoryaugmentedNeuralMachineTranslation專業(yè)知識課件

上傳人：1*** IP屬地：北京上傳時(shí)間：2023-04-15 格式：PPTX 頁數(shù)：24 大小：1.22MB 積分：25 舉報(bào) 版權(quán)申訴

MemoryaugmentedNeuralMachineTranslation專業(yè)知識課件_第2頁

MemoryaugmentedNeuralMachineTranslation專業(yè)知識課件_第3頁

MemoryaugmentedNeuralMachineTranslation專業(yè)知識課件_第4頁

MemoryaugmentedNeuralMachineTranslation專業(yè)知識課件_第5頁

已閱讀5頁，還剩19頁未讀，繼續(xù)免費(fèi)閱讀

版權(quán)說明：本文檔由用戶提供并上傳，收益歸屬內(nèi)容提供方，若內(nèi)容存在侵權(quán)，請進(jìn)行舉報(bào)或認(rèn)領(lǐng)

文檔簡介

Memory-augmentedNeuralMachineTranslationShiyue

ZhangNLP

Group,

CSLT,

Tsinghua

UniversityCo-work

with

Yang

Feng,

Dong

Wang,

Andi

ZhangEMNLP’17

(Submitted)OutlineIntroductionAttention-based

NMTMemory-augmented

NMTExperimentsConclusionsFuture

workReferenceIntroductionStatistical

Machine

Translation

(SMT)Phrase-based

machine

translation

(Moses,

Koehn

al.

2023

)Phrase

table

language

modelAn

example:什么是成人高考|||成人高考簡介Phrase

table:什么是=>簡介,成人高考=>成人高考Language

model

guides

the

orderNeural

Machine

Translation

(NMT)Achieved

significant

success,

especially

when

dataset

big

enough,

NMT

performs

quite

better

than

SMT

IntroductionAn

interesting

insight:Let’s

say

have

zh-en

translation

task,

and

the

number

Chinese

words

training

set

150,000.

SMT,

the

vocabulary

size

150,000,

OOV

(out

vocabulary)

words

only

appear

test

set.

NMT,

since

“word

embedding”

trained

along

with

the

model,

typically,

the

vocabulary

size

has

set

~30,000.

The

remained

120,000

words

are

uniformly

labeled

one

word

“UNK”.

So,

OOV

problem

dramatically

aggravated

NMT.

But,

surprisingly,

NMT

better

than

SMT.

Why?NMT

very

good

reasoning!IntroductionOverfits

frequent

observations,

while

overlooks

special

cases.

NMT

gives

reasonable

translation,

but

the

meaning

drifts

away.

experiment:

after

decoding

training

set,

30,000

English

vocabulary

shrinks

26911.

IntroductionOur

aim:

address

rare

and

unknown

word

problemsOur

method:

augment

NMT

with

memory

component

which

memorizes

source-target

word

pairs.

It’s

equipping

translator

with

dictionary.

OutlineIntroductionAttention-based

NMTMemory-augmented

NMTExperimentsConclusionsFuture

workAttention-based

NMT

OutlineIntroductionAttention-based

NMTMemory-augmented

NMTExperimentsConclusionsFuture

workMemory-augmented

NMT

Memory-augmented

NMT

Memory-augmented

NMTOOV

treatmentMain

idea:

Represent

OOV

word

its

similar

word

vocabularyAn

example:Src:目前沒有治愈阿爾茲海默癥旳措施Word

mapping:

<阿爾茲海默癥–

alzheimer>UNKNot

UNK<感冒–

alzheimer>Res:

Currentlythereisnocureforalzheimer'sdiseaseNote

that

similar

words

can

either

defined

human

selected

based

word

vector

similarity.

OutlineIntroductionAttention-based

NMTMemory-augmented

NMTExperimentsConclusionsFuture

workExperiments

(zh-en)Data:IWSLT:

44K

sentence

pairs

training

set,

~13,000

words,

~9,500

words.NIST:

sentence

pairs

training

set,

~190,000

words,

~100,000

words.

Systems:SMT:

MosesNMTNMT-L

(Arthur,P.

al.

2023)NMT-PL

(Minh-ThangLuong

al.

2023)

M-NMTEvaluation

metrics:BLEU:

the

average

1-4

grams

bleu

multiplied

brevity

penaltyTranslation

baselineOOV

baselineExperiments

(zh-en)Two

observations:M-NMT

performs

bestM-NMT

brings

improvement

IWSLT

corpusTwo

conclusions:M-NMT

effectiveM-NMT

robustExperiments

(zh-en)M-NMT

recalls

OOV

words.Experiments

(zh-en)Experiments

(zh-uy)Data:

180k

sentence

pairs,

~170,000

Uyghur

words,

~130,000

Chinese

wordsPerformance:SystemsSMTNMTM-NMT1-gramBLEU54.557.758.82-gramBLEU34.639.840.83-gramBLEU26.631.932.44-gramBLEU22.127.027.1Brevitypenalty1.0000.9390.968BLEU32.4435.2436.88SystemsRecalled

words

testSMT3680/6666NMT3509/6666M-NMT3560/6666*6666

the

number

words

referenceOutlineIntroductionAttention-based

NMTMemory-augmented

NMTExperimentsConclusionsFuture

workConclusionsM-NMT

alleviates

rare

word

and

under-translation

problems

NMT.M-NMT

provides

way

address

OOV

problem.So

far,

M-NMT

brings

least

1.6

BLEU

improvement

different

datasets.

OutlineIntroductionAttention-based

NMTMemory-augmented

NMTExperimentsConclusionsFuture

workFuture

workBetter

OOV

treatment?

need

similar

word

replacementImplement

the

whole

datasetPhrase-based

memory?ReferenceKoehn,Philipp,Hoang,Hieu,Alexandra,&CallisonBurch,etal.(2023).Moses:opensourcetoolkitforstatisticalmachinetranslation.

inProceedingsoftheAssociationforComputationalLinguistics(ACL’07,9(1),177--180.Bahdanau,D.,Cho,K.,&Bengio,Y.(2023).Neuralmachinetranslationbyjointlylearningtoalignandtranslate.

ComputerScience.Arthur,P.,Neubig,G.,&Nakamura,S.(2023).Incorporating

人人文庫> 全部分類> 辦公材料 > 辦公文檔

溫馨提示

1. 本站所有資源如無特殊說明，都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
2. 本站的文檔不包含任何第三方提供的附件圖紙等，如果需要附件，請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
3. 本站RAR壓縮包中若帶圖紙，網(wǎng)頁內(nèi)容里面會(huì)有圖紙預(yù)覽，若沒有圖紙預(yù)覽就沒有圖紙。
4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
5. 人人文庫網(wǎng)僅提供信息存儲(chǔ)空間，僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理，對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯，并不能對任何下載內(nèi)容負(fù)責(zé)。
6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容，請與我們聯(lián)系，我們立即糾正。
7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

MemoryaugmentedNeuralMachineTranslation專業(yè)知識課件

文檔簡介

溫馨提示

最新文檔

評論

MemoryaugmentedNeuralMachineTranslation專業(yè)知識課件

文檔簡介

溫馨提示

最新文檔

評論

相關(guān)文檔