A historical corpus of Portuguese theater plays

PorThea is a historical corpus of European and Brazilian Portuguese theater plays, collected for my habilitation thesis on "Variation and Change in Romance wh-interrogatives". The corpus consists of over 400 plays dated between 1733 and 2016, containing over 3,3 million words. Due to copyright reasons, the corpus is currently not available online. Any correction or comment is welcome.


Summary statistics

Country18th19th20th21st
BrazilWords175891787015747110948485
Plays98266157
PortugalWords52642316914212552140188
Plays14272723

List of plays

A list of the included plays can be downloaded here.