Descriptive statistics


(updated in June 2024)


Participants


The interviewees in the Czech subcorpus are twenty-five secondary-school English teachers (19 female, 6 male; mean age=40.2 years) based in different regions of the Czech Republic. Their self-assessed proficiency was B2 (n=1), C1 (n=9), C1+ (n=5) and C2 (n=10). 

The speakers in the parallel native-speaker corpus are fifteen English teachers (8 female, 7 male; mean age=32.6 years) from the US, UK, and South Africa, currently teaching in the Czech Republic. A range of metadata was collected through a speaker-profile form, in which the speakers’ informed consent to the use of the data for research purposes was signed.


Corpus size


Length in tokens
A & B turns B turns only Mean SD
Czechs 76,122 68,323 3,044 591
Natives 31,898 27,694 2,127 397


Duration (hh:mm:ss)
A & B turns B turns only Mean SD
Czechs 09:05:47 08:04:23 21:50 4:15
Natives 03:34:52 03:02:18 14:19 2:05