Corpus Collection. Two bilingual corpora will be collected, using the parallel web crawler. They should be in a special domain, like software manuals. The size of the corpus should be such that it can support the requirements of the different extraction tools, in order to allow them to show their full capacity. The languages will be determined depending on the progress of the different PANACEA tools, but two different language pairs will be involved.
Appears in 2 contracts
Sources: Grant Agreement, Grant Agreement