AEPC 2011 - Second Workshop on Annotation and Exploitation of Parallel Corpora (AEPC 2)
Topics/Call fo Papers
Second Workshop on Annotation and Exploitation of Parallel Corpora
(AEPC 2)
RANLP 2011 Workshop
15th or 16th September, Hissar, Bulgaria
This workshop is a follow-up of the First Workshop on Annotation and Exploitation of Parallel Corpora.
The creation of parallel corpora has been very active especially since 90s. The globalization, the extension of EU with new countries as well as the availability of open-source places for information, such as Wikipedia, DBPeadia, etc. required a multilingual approach towards the interpersonal and official communication. This status quo produced a lot of parallel data ? especially administrative and political documents in several languages (EuroParl), but also news (SETIMES) and texts on various topics (wikipedia, bi- and multilingual web sites). However, the fast compilation of large amounts of data very often compromised in lower quality of paralleling texts. Here comes the challenge to discover the inconsistencies in these huge quantities of parallel data, to process them in adequate ways, and to exploit them for various applications: QA, Information Retrieval, Machine Translation, etc. The parallel corpora go beyond word-to-word alignments. They rely on dependency, constituent or semantic pairings. There appeared guidelines and tools for aligning linguistic structures, which raised up the issue of transferability of aligning schemes from one language to another, and also for the compatibility among various resources.
Topics
The topics of interest include, but are not limited to:
Strategies for creation of annotated parallel corpora
Annotation guidelines for alignment
Annotation alignment transfer over languages
Tools for manual and automatic processing and exploitation of parallel corpora
Problems in manual and automatic alignment
Syntax-based and semantic-based approaches to using parallel corpora in MT
Parallel Grammars
Parallel Statistical Parsing
Usability of the existing parallel resources for various applications
Invited speaker
TBA
Organizers
Petya Osenova (Sofia University “St. Kl. Ohridski” and IICT Bulgarian Academy of Sciences)
Kiril Simov (IICT, Bulgarian Academy of Sciences)
Radovan Garabik (JÚĽŠ, Slovak Academy of Sciences)
Jörg Tiedemann (Uppsala University)
(AEPC 2)
RANLP 2011 Workshop
15th or 16th September, Hissar, Bulgaria
This workshop is a follow-up of the First Workshop on Annotation and Exploitation of Parallel Corpora.
The creation of parallel corpora has been very active especially since 90s. The globalization, the extension of EU with new countries as well as the availability of open-source places for information, such as Wikipedia, DBPeadia, etc. required a multilingual approach towards the interpersonal and official communication. This status quo produced a lot of parallel data ? especially administrative and political documents in several languages (EuroParl), but also news (SETIMES) and texts on various topics (wikipedia, bi- and multilingual web sites). However, the fast compilation of large amounts of data very often compromised in lower quality of paralleling texts. Here comes the challenge to discover the inconsistencies in these huge quantities of parallel data, to process them in adequate ways, and to exploit them for various applications: QA, Information Retrieval, Machine Translation, etc. The parallel corpora go beyond word-to-word alignments. They rely on dependency, constituent or semantic pairings. There appeared guidelines and tools for aligning linguistic structures, which raised up the issue of transferability of aligning schemes from one language to another, and also for the compatibility among various resources.
Topics
The topics of interest include, but are not limited to:
Strategies for creation of annotated parallel corpora
Annotation guidelines for alignment
Annotation alignment transfer over languages
Tools for manual and automatic processing and exploitation of parallel corpora
Problems in manual and automatic alignment
Syntax-based and semantic-based approaches to using parallel corpora in MT
Parallel Grammars
Parallel Statistical Parsing
Usability of the existing parallel resources for various applications
Invited speaker
TBA
Organizers
Petya Osenova (Sofia University “St. Kl. Ohridski” and IICT Bulgarian Academy of Sciences)
Kiril Simov (IICT, Bulgarian Academy of Sciences)
Radovan Garabik (JÚĽŠ, Slovak Academy of Sciences)
Jörg Tiedemann (Uppsala University)
Other CFPs
- Workshop: Biomedical Natural Language Processing
- Workshop on Language Technologies for Digital Humanities and Cultural Heritage
- Workshop on Information Extraction and Knowledge Acquisition
- Workshop on Robust Unsupervised and Semisupervised Methods in Natural Language Processing
- AST 2011 6th International Workshop on Applications of Semantic Technologies
Last modified: 2011-04-14 20:46:04