Common use of Europarl Tools Clause in Contracts

Europarl Tools. Sentence splitting, tokenization and lowercasing The Europarl tools11 were developed to process the proceedings of the European Parliament, in order to derive parallel corpora suitable for training Statistical Machine Translation systems. The tools that have been integrated in PANACEA are the sentence-splitter, the tokeniser and the lowercaser. The sentence-splitter and the tokeniser are based on a set of regular expressions (independent of the language) and use optionally a list of language-dependent abbreviations. The lowercaser uses Perl's lc function. These webservices can be accessed and integrated via the information from Table 1, Table 2, and Table 3. URL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis/#panacea.europarl_sentence_splitter_row WSDL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis/typed/services/panacea.europarl_sentence_splitter?wsdl PANACEA Catalogue Entry xxxx://xxxxxxxx.xxxx.org/services/76 PANACEA MyExperiment Workflow(s) using the WS xxxx://xxxxxxxxxxxx.xxxx.org/workflows/7 Table 1 WS Details for Europarl sentence-splitter URL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis//#panacea.europarl_tokeniser_row WSDL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis/typed/services/panacea.europarl_tokeniser?wsdl PANACEA Catalogue Entry xxxx://xxxxxxxx.xxxx.org/services/77 PANACEA MyExperiment Workflow(s) using the WS xxxx://xxxxxxxxxxxx.xxxx.org/workflows/7 Table 2 WS Details for Europarl tokeniser URL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis//#panacea.europarl_lowercase_row WSDL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis/typed/services/panacea.europarl_lowercase?wsdl PANACEA Catalogue Entry xxxx://xxxxxxxx.xxxx.org/services/75 Table 3 WS Details for Europarl lowercaser 11 xxxx://xxx.xxxxxx.xxx/europarl/

Appears in 2 contracts

Samples: repositori.upf.edu, cordis.europa.eu

AutoNDA by SimpleDocs

Europarl Tools. Sentence splitting, tokenization and lowercasing The Europarl tools11 tools16 were developed to process the proceedings of the European Parliament, in order to derive parallel corpora suitable for training Statistical Machine Translation systems. The tools that have been integrated in PANACEA are the sentence-splitter, the tokeniser and the lowercaser. The sentence-splitter and the tokeniser are based on a set of regular expressions (independent of the language) and use optionally a list of language-dependent abbreviations. The lowercaser uses Perl's lc function. These webservices can be accessed and integrated via the information from Table 1, Table 2, and Table 3. URL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis/#panacea.europarl_sentence_splitter_row WSDL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis/typed/services/panacea.europarl_sentence_splitter?wsdl PANACEA Catalogue Entry xxxx://xxxxxxxx.xxxx.org/services/76 PANACEA MyExperiment Workflow(s) using the WS xxxx://xxxxxxxxxxxx.xxxx.org/workflows/7 Table 1 WS Details for Europarl sentence-splitter URL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis//#panacea.europarl_tokeniser_row WSDL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis/typed/services/panacea.europarl_tokeniser?wsdl PANACEA Catalogue Entry xxxx://xxxxxxxx.xxxx.org/services/77 PANACEA MyExperiment Workflow(s) using the WS xxxx://xxxxxxxxxxxx.xxxx.org/workflows/7 Table 2 WS Details for Europarl tokeniser URL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis//#panacea.europarl_lowercase_row WSDL xxxx://xxx.xxxx.xx/panacea-soaplab2- axis/typed/services/panacea.europarl_lowercase?wsdl PANACEA Catalogue Entry xxxx://xxxxxxxx.xxxx.org/services/75 Table 3 WS Details for Europarl lowercaser 11 16 xxxx://xxx.xxxxxx.xxx/europarl/

Appears in 2 contracts

Samples: cordis.europa.eu, www.panacea-lr.eu

AutoNDA by SimpleDocs
Time is Money Join Law Insider Premium to draft better contracts faster.