Corpora. The various techniques presented here rely on three types of data: monolingual tree- banks, for training baseline monolingual parsers; parallel corpora, for training baseline un- supervised word aligners and also for training and evaluating machine translation systems; and bilingual treebanks, for training and evaluating the bilingual parsing models.
Appears in 2 contracts
Sources: Dissertation, Dissertation