MEANING data flow Sample Clauses

MEANING data flow. Major milestones and output results during this development regard the delivery of the software tools and resources, whose three version submissions will be at 12, 21 and 30 months. The end of this phase will coincide with the starting of the validation activities on the demonstration. Milestones: First release of Linguistic Processors (English, Italian, Spanish, Catalan and Basque); First release of Multilingual Central Repository Software ACQ0: First acquisition process WSD0: First Word Sense Disambiguation process PORT0: First upload and porting processes Second release of Linguistic Processors (English, Italian, Spanish, Catalan and Basque); Second release of Multilingual Central Repository Software ACQ1: Second acquisition process. WSD1: Second Word Sense Disambiguation process PORT1: Second upload and porting processes Final release of Linguistic Processors (English, Italian, Spanish, Catalan and Basque); Final release of Multilingual Central Repository Software ACQ2: Final acquisition process. WSD2: Final Word Sense Disambiguation process. PORT2: Final upload and porting processes Next sections will provide for each component, details on the different (sub)problems and proposed solutions for each component. We will re-engineer and scale up existing robust language processing software tools (for English, Italian, Spanish, Catalan and Basque) in accordance with the WP1 assessment of the software requirements for MEANING. The software tools will form part of the systems developed for acquisition (WP5) and word sense disambiguation (WP6). The functionality of these tools include: tokenisation and sentence boundary detection lemmatisation part of speech tagging noun group chunking robust shallow parsing named-entity recognition and categorisation (e.g. into location, company or product names) keyword, topic and terminology detection text classification (e.g. ECONOMIC, SPORT domains) We will direct further development and refinement effort via assessment of accuracy and speed of the tools within the context of WP5 and WP6. This workpackage will produce five software tools: ELP (English language processor), ILP (Italian language processor), SLP (Spanish language processor), CLP (Catalan language processor) and BLP (Basque Language processor). Immediately on project start-up we will carry out such engineering actions as are necessary to equip each partner with fast Internet access, and sufficient processing power and storage space. Due to the amount of data...