(updated in June 2024)
Participants
The interviewees in the Czech subcorpus are twenty-five secondary-school English teachers (19 female, 6 male; mean age=40.2 years) based in different regions of the Czech Republic. Their self-assessed proficiency was B2 (n=1), C1 (n=9), C1+ (n=5) and C2 (n=10).
The speakers in the parallel native-speaker corpus are fifteen English teachers (8 female, 7 male; mean age=32.6 years) from the US, UK, and South Africa, currently teaching in the Czech Republic. A range of metadata was collected through a speaker-profile form, in which the speakers’ informed consent to the use of the data for research purposes was signed.
Corpus size
Length in tokens | ||||
---|---|---|---|---|
A & B turns | B turns only | Mean | SD | |
Czechs | 76,122 | 68,323 | 3,044 | 591 |
Natives | 31,898 | 27,694 | 2,127 | 397 |
Duration (hh:mm:ss) | ||||
---|---|---|---|---|
A & B turns | B turns only | Mean | SD | |
Czechs | 09:05:47 | 08:04:23 | 21:50 | 4:15 |
Natives | 03:34:52 | 03:02:18 | 14:19 | 2:05 |