A historical corpus of Portuguese theater plays
PorThea is a historical corpus of European and Brazilian Portuguese theater plays, collected for my habilitation thesis on "Variation and Change in Romance wh-interrogatives". The corpus consists of over 400 plays dated between 1733 and 2016, containing over 3,3 million words. Due to copyright reasons, the corpus is currently not available online. Any correction or comment is welcome.
Summary statistics
Country | 18th | 19th | 20th | 21st | |
Brazil | Words | 175891 | 787015 | 747110 | 948485 |
Plays | 9 | 82 | 66 | 157 | |
Portugal | Words | 52642 | 316914 | 212552 | 140188 |
Plays | 14 | 27 | 27 | 23 |
List of plays
A list of the included plays can be downloaded here.