Resources

CODA Tools software Release 1.1 February 20, 2012. This release contains 1) software for converting text parsed with RST relations into dialogue and 2) an annotation tool for annotating dialogue and translating it into monologue (used for creating CODA corpus).

CODA corpus Release 1.0 July 16, 2010. This release contains approximately 700 turns of human-authored expository dialogue (by Mark Twain and George Berkeley) which has been aligned with monologue that expresses the same information as the dialogue. The monologue side is annotated with Coherence Relations (RST), and the dialogue side with Dialogue Act tags.

Annotation Scheme for Authored Dialogues (CODA annotation manual includes documentation of the distributed data file formats) Svetlana Stoyanchev and Paul Piwek Open University Technical Report 2010

QGSTEC 2010 Generating Questions from Sentences Corpus December 21, 2010. A corpus of over 1000 questions (both human and machine generated). The automatically generated questions have been rated by several raters according to five criteria (relevance, question type, syntactic correctness and fluency, ambiguity, and variety).

QGSTEC+ 2016 New annotations for the QGSTEC corpus (with higher inter-rater reliability) as described in Godwin and Piwek (2016).