Model in Word

The portion of the corpus used to train our Spanish-English system contains 208,000 sentence pairs, while the portion used for English-Spanish contains 183,000. English sentences in the entire corpus average 14.1 words and the vocabulary size (number of unique word tokens) is 41,834, indicating a fairly substantial domain. ................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download