VIII UNL School: Difference between revisions
From UNLwiki
				
				
				Jump to navigationJump to search
				
				
imported>Martins No edit summary  | 
				imported>Martins No edit summary  | 
				||
| Line 125: | Line 125: | ||
*[http://www.unlweb.net.br/resources/geneva2012/Dictionary2.pdf UNL-NL Dictionary]  | *[http://www.unlweb.net.br/resources/geneva2012/Dictionary2.pdf UNL-NL Dictionary]  | ||
*[http://www.unlweb.net.br/resources/geneva2012/Grammar.pdf Grammar]  | *[http://www.unlweb.net.br/resources/geneva2012/Grammar.pdf Grammar]  | ||
== Other material ==  | |||
*[http://www.unlweb.net.br/resources/geneva2012/corpus_dic.txt English analysis dictionary]  | |||
== Participants ==  | == Participants ==  | ||
Revision as of 12:10, 13 February 2012
Goals
- To build the basic modules of a NL-UNL (analysis) grammar
 - To build the basic modules of a UNL-UNL (generation) grammar
 
Corpus
- Reference corpus in English (500 sentences), to be manually translated to the target languages, in order to be used as the input for IAN
 - Reference corpus in UNL (500 graphs), to be used as the input for EUGENE
 - Reference corpus according to the complexity of the graphs (the same as above, but split in different files)
 
Deliverables
- ANALYSIS (IAN)
 
- The manual translated version of the 500 sentences of the reference corpus (corpus_LID.txt)
 - The analysis dictionary used to analyze those 500 sentences (ana_dic_LID.txt)
 - The analysis grammar used to analyze those 500 sentences (ana_gra_LID.txt)
 - The analysis disambiguation grammar, if any, used to analyze those 500 sentences (ana_dis_LID.txt)
 - The UNL output for those 500 sentences generated from the dictionary and grammars above (ana_out_LID.txt)
 
- GENERATION (EUGENE)
 
- The generation dictionary used to generate the reference corpus onto natural language (gen_dic_LID.txt)
 - The generation grammar, including inflectional paradigms, used to generate the reference corpus onto natural language (gen_gra_LID.txt)
 - The generation disambiguation grammar used to generate the reference corpus onto natural language (gen_dis_LID.txt)
 - The natural language output generated from the dictionary and grammars above (gen_out_LID.txt)
 
- LID is to be replaced by the ISO639-2 two-character code of the language (en = English, el = Greek, etc.)
 
Presentations
Other material
Participants
- Carolin ARNOLD (German)
 - Ewa CZAJKOWSKA (Polish)
 - Grega MILHARCIC (Slovenian)
 - Luisa GOUVEIA (Portuguese)
 - Martin LUTS (Estonian)
 - Mihaela ILIOAIA (Romanian)
 - Ofelia HOVHANNISYAN (Armenian)
 - Olga VARTZIOTI (Greek)
 - Polina LENKOVA (Russian)
 - Ronaldo MARTINS (UNL)
 - Sameh ALANSARY (Arabic)
 - Sara STYMNE (Swedish)
 - Yordanka STANCHEVA (Bulgarian)
 
Schedule
- Feb 06th, 2012 - Monday
 - 09:00-10:00 Introduction
 - 10:00-12:00 I – Corpus
 - 14:00-17:00 II – UNL-NL dictionary
 - Feb 07th, 2012 - Tuesday
 - 09:00-12:00 III – Morphology (inflectional paradigms)
 - 14:00-17:00 IV – NL dictionary
 - Feb 08th, 2012- Wednesday
 - 09:00-12:00 V – UNL-NL grammar (I)
 - 14:00-17:00 V – UNL-NL grammar (II)
 - Feb 09th, 2012 - Thursday
 - 09:00-12:00 VI – NL-UNL grammar (I)
 - 14:00-17:00 VI – NL-UNL grammar (II)
 - Feb 10th, 2012 - Friday
 - 09:00-12:00 Evaluation
 - 14:00-17:00 Discussion