3 Data Report of the Database
This data report serves the purpose of documenting the variables and their linkages of the Open Discourse corpus. This report is supplementary to a soon to be published data paper.
The Open Discourse Corpus consists of five main tables. This report provides information about the contents of these tables and the meaning of the respective variables. A detailed documentation about the procedures used to provide this corpus can be found in aforementioned data paper soon.
Furthermore the codebase developed and used to create the corpus can be retrieved from GitHub. This open source codebase can be used to recreate the database from scratch and to contribute to the repository to further improve the quality of the data.
A current data dump can be found at the associated Dataverse. Also, a full text search engine for researching the corpus can be found on the Open Discourse Website (currently only available in German).