Existing Tools Clause Samples
Existing Tools. A. Corpus normalization
enca(1) is a Linux/Unix command used to detect and convert encoding of text files. It supports all known text encodings and languages.
B. Language identification
C. Corpus cleaning
D. Duplicate detection
Existing Tools. This subsection includes short descriptions of available open source tools for crawling. An overview of their functionalities is presented in Table 1. A comparison of these tools in terms of licensing, languages supported and availability as web services is illustrated in Table 2. Furthermore, we compare tools according to the usability requirements described in D8.
Existing Tools. In this section we summarize the results of a survey on several aspects of TPC tools already available to PANACEA partners. The tools surveyed include both tools developed by consortium partners, and open source tools developed by third parties and adapted/extended by partners. The survey was conducted via a template that is included in Appendix A. Overall, descriptions for 33 tools or pipelines of tools were submitted by PANACEA partners for all languages addressed by the project. As examples of the descriptions provided by partners, we also include in the Appendix descriptions for two tools: a) the Decomposer for German Compounds developed by Linguatec, and b) the ILSP FBT POS Tagger. An overview of all documented tools classified according to their functionalities is presented in Table 5 below. Technologies and tools for corpus creation, normalization and annotation English French German Spanish Italian Greek Sentence splitting EuroParl tools ▇▇▇▇▇▇▇▇ ▇▇-SentenceSegmentiser EuroParl tools EuroParl tools LT- SentenceSegmentiser EuroParl tools ▇▇▇▇ Processing Tool FreeLing EuroParl tools FreeLing Syn SG EuroParl tools ILSP SST POS tagging Charniak, Bikel, Berkeley, ▇▇▇▇▇▇▇▇, ▇▇-Tagger, RASP C&C, ▇▇▇▇▇▇▇▇ ▇▇▇▇▇ BitPar LT-Tagger Berkeley Morfette ▇▇▇▇ POS Tagger FreeLing FreeLing Syn SG ILSP FBT Parsing Charniak, Bikel, Berkeley, ▇▇▇▇▇▇▇▇, ▇▇-Parser, RASP, C&C, ▇▇▇▇▇▇▇▇ ▇▇▇▇▇ Berkeley BitPar Berkeley LT-Parser ▇▇▇▇▇ FreeLing Function labeling FunTag Labeller FunTag Lemmatization XLE grammar lexicon, ▇▇▇▇▇▇▇▇, ▇▇-Lemmatiser, RASP, C&C LT-Lemmatiser TreeTagger LT-Lemmatiser Morfette ▇▇▇▇▇▇▇▇ ▇▇-Lemmatiser Syn ▇▇ ▇▇▇▇▇▇▇▇ LT-Lemmatiser ILSP Lemmatizer LFG parsing LFG AA LFG AA LFG AA LFG AA Dependency parsing Maltparser, MSTparser, FreeLing FreeLing Syn SG Chunking Syn SG ILSP Chunker Named Entity Recognition ▇▇▇▇▇▇▇▇, ▇▇-Namer, ILSP MENER, C&C LT-Namer LT-Namer ▇▇▇▇▇▇▇▇ ▇▇-Namer ▇▇▇▇▇▇▇▇ Syn SG LT-Namer ILSP MENER Multiword Expression Extraction FreeLing FreeLing FreeLing Word Sense Disambiguation FreeLing FreeLing FreeLing Other LT-TopicIdentifier LT-Decomposer LM- MonoTermExtract
5.3.1 Availability and licensing
Existing Tools. West shall perform an MCR on tools that are in existence as of the date of this Agreement that are used by West to manufacture Products hereunder on a [**]. BD shall be liable for the expense of any maintenance and/or repairs to the tooling. When appropriate, West shall provide BD with a proposed development agreement to address the manufacture of replacement tooling for those tools that warrant replacement.
Existing Tools. Asynchronous Broadcast: This primitive, called A-cast, was introduced and implemented by ▇▇▇▇▇▇ [6] with 3t + 1 parties. Formally, A-cast is defined as follows: Definition 7 (A-cast [8]) Let π be an asynchronous proto- col initiated by a Sender , having an input m (the mes- sage to be broadcast). We say that π is an A-cast protocol if the following requirements hold, for every possible Adv:
Existing Tools. A- cast : In our protocols, we use the asynchronous broad- cast primitive, called A-cast, which was introduced and ele-
