Text-To-Text 2011 - Workshop on Monolingual Text-To-Text Generation
Topics/Call fo Papers
Workshop on Monolingual
Text-To-Text Generation
Co-located with ACL-HLT 2011 in Portland, OR, 24 June 2011
Call for papers
The ability to perform monolingual text-to-text generation is an important step in solving many natural language processing problems. For example, when generating novel text at the sentence-level, abstractive summarization systems may need to compress sentences or fuse multiple sentences together; the evaluation of translation systems may require additional paraphrases to use as reference gold standards; and answers to questions may need to be generated automatically from extracted sentences.
The community of researchers examining monolingual text-to-text generation has grown steadily in recent years, introducing the need for a focused venue to communicate results in this area. As tools and approaches are developed, it is important that our community shares its experiences and its resources.
This workshop will solicit work describing the use of data-oriented text-to-text generation methods, where the generation process begins with some source text as input. As such, it complements existing events such as GenChal'11 at ENLG 2011, which will have a focus on data-to-text surface realisation methods.
This year, the workshop will focus on work describing the generation of novel sentences, with preference given to submissions that describe how the proposed text-to-text generation model has an impact on content selection and/or issues of grammaticality at the sentence level. Submissions can describe work-in-progress, resources, position papers as well as traditional unpublished work.
Suggested topics for this workshop include (but are not limited to):
Sentence fusion
Sentence compression
Sentence-level paraphrase generation
Answer generation for questions
Sentence simplification
Evaluating novel sentence generation
Semantics and world knowledge for sentence generation
Content planning issues in text-to-text generation
Incorporating user preferences for text-to-text generation
Discourse-level constraints for novel sentence generation
Descriptions of new monolingual text-to-text generation problems
Applications of monolingual text-to-text generation
List of useful resources.
Submit a paper
TBA
Important dates
Dec 18 Workshop CFP
Apr 01 Paper due date
Apr 25 Notification of acceptance
May 06 Camera-ready deadline
Jun 24 Workshop
Program & steering committee
Anja Belz
Bernd Bohnet
Aoife Cahill
Chris Callison-Burch
Robert Dale
Mark Dras
Michel Galley
Kevin Knight
Emiel Krahmer
Mirella Lapata
Nitin Madnani
Erwin Marsi
Ryan McDonald
Cécile Paris
Michael Strube
Michael White
David Zajic
Steering committee:
Chris Callison-Burch
Mirella Lapata
Erwin Marsi
Organizers & contact information
Katja Filippova (katjaf at google dot com)
Stephen Wan (first name dot last name at csiro dot au)
Useful resources
Links to compression and paraphrase corpora in English.
Paraphrase resources (English).
Microsoft paraphrase corpus.
Paraphrase data sets (English).
Dutch fusion data.
A German comparable corpus of biographies.
Sets of related sentences from reviews (English).
Parallel monolingual treebank for Dutch.
Text-To-Text Generation
Co-located with ACL-HLT 2011 in Portland, OR, 24 June 2011
Call for papers
The ability to perform monolingual text-to-text generation is an important step in solving many natural language processing problems. For example, when generating novel text at the sentence-level, abstractive summarization systems may need to compress sentences or fuse multiple sentences together; the evaluation of translation systems may require additional paraphrases to use as reference gold standards; and answers to questions may need to be generated automatically from extracted sentences.
The community of researchers examining monolingual text-to-text generation has grown steadily in recent years, introducing the need for a focused venue to communicate results in this area. As tools and approaches are developed, it is important that our community shares its experiences and its resources.
This workshop will solicit work describing the use of data-oriented text-to-text generation methods, where the generation process begins with some source text as input. As such, it complements existing events such as GenChal'11 at ENLG 2011, which will have a focus on data-to-text surface realisation methods.
This year, the workshop will focus on work describing the generation of novel sentences, with preference given to submissions that describe how the proposed text-to-text generation model has an impact on content selection and/or issues of grammaticality at the sentence level. Submissions can describe work-in-progress, resources, position papers as well as traditional unpublished work.
Suggested topics for this workshop include (but are not limited to):
Sentence fusion
Sentence compression
Sentence-level paraphrase generation
Answer generation for questions
Sentence simplification
Evaluating novel sentence generation
Semantics and world knowledge for sentence generation
Content planning issues in text-to-text generation
Incorporating user preferences for text-to-text generation
Discourse-level constraints for novel sentence generation
Descriptions of new monolingual text-to-text generation problems
Applications of monolingual text-to-text generation
List of useful resources.
Submit a paper
TBA
Important dates
Dec 18 Workshop CFP
Apr 01 Paper due date
Apr 25 Notification of acceptance
May 06 Camera-ready deadline
Jun 24 Workshop
Program & steering committee
Anja Belz
Bernd Bohnet
Aoife Cahill
Chris Callison-Burch
Robert Dale
Mark Dras
Michel Galley
Kevin Knight
Emiel Krahmer
Mirella Lapata
Nitin Madnani
Erwin Marsi
Ryan McDonald
Cécile Paris
Michael Strube
Michael White
David Zajic
Steering committee:
Chris Callison-Burch
Mirella Lapata
Erwin Marsi
Organizers & contact information
Katja Filippova (katjaf at google dot com)
Stephen Wan (first name dot last name at csiro dot au)
Useful resources
Links to compression and paraphrase corpora in English.
Paraphrase resources (English).
Microsoft paraphrase corpus.
Paraphrase data sets (English).
Dutch fusion data.
A German comparable corpus of biographies.
Sets of related sentences from reviews (English).
Parallel monolingual treebank for Dutch.
Other CFPs
- EACL 2012 workshop on: Language Technology for Cultural Heritage, Social Sciences, and Humanities
- The 7th Workshop on the Innovative Use of NLP for Building Educational Applications
- 5th Workshop on Building and Using Comparable Corpora
- 5th Linguistics Annotation Workshop (The LAW V)
- CoNLL-2012 16th Conference on Computational Natural Language Learning
Last modified: 2011-01-31 23:23:51