Statistical Machine Translation: Decoding
Statistical Machine Translation: Decoding
Matthias Huck (slides credits: Ales Tamchyna)
LMU Munich
May 31, 2017
Outline
What features are used in PBMT? How to compute the score of a translation? Search for the best translation: decoding.
Overview of the translation process. Making decoding tractable: beam search.
Log-Linear Model
We know how to score a full translation hypothesis: P(e, a|f ) exp i fi (e, a, f )
i
i . . . feature weights fi . . . feature functions
Log-Linear Model: Features
Typical baseline feature set for PBMT: Phrase translation probability, both direct and inverse: PTM (e|f ) PTMinv (f |e) Lexical translation probability (direct and inverse): Plex (e|f ) Plexinv (f |e) Language model probability: PLM (e) Phrase penalty. Word penalty. Distortion penalty.
Lexical Weights (Plex )
The problem: many extracted phrases are rare. (Esp. long phrases might only be seen once in the parallel corpus.)
Lexical Weights (Plex )
The problem: many extracted phrases are rare. (Esp. long phrases might only be seen once in the parallel corpus.)
P("modr?y autobus prist?al na Marsu"|"a blue bus lands on Mars") = 1 P("a blue bus lands on Mars"|"modr?y autobus prist?al na Marsu") = 1 Is that a reliable probability estimate?
Lexical Weights (Plex )
The problem: many extracted phrases are rare. (Esp. long phrases might only be seen once in the parallel corpus.)
P("; distortion carried - over"|"; zkreslen?i") = 1 P("; zkreslen?i"|"; distortion carried - over") = 1
Data from the "wild" are noisy. Word alignment contains errors. This is a real phrase pair from our best English-Czech system. Both PTM (e|f ) and PTMinv (f |e) say that this is a perfect translation.
Lexical Weights (Plex )
Decompose the phrase pair into word pairs. Look at the word-level translation probabilities.
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- cis 526 assignment 4 report
- enterobactin and salmochelin β lactam conjugates induce
- at t firstnet msi lex l11 compensation and log credit
- statistical machine translation decoding
- komuanlno podjetje log
- salus animarum suprema lex festschrift für offizial max
- vol 71 no 10 october 2015 a star is re born
- l appalto by sergio grea
- explanation of leave and earnings statement les
- a terapia do som portuguese edition by clederson paduani
Related searches
- transcription translation animation
- protein synthesis translation video
- protein synthesis translation worksheet
- translation protein synthesis steps
- protein synthesis translation worksheet an
- protein synthesis translation worksheet a
- free translation english to spanish mexico
- protein synthesis translation diagram
- khan academy translation and transcription
- decoding numbers into letters
- decoding objectives for iep
- decoding iep goals examples