[bnc] Creation stage - Designing and Creating the BNC

Creation stage

Once the basic design principles had been agreed on, creation of the BNC began in 1991. The chief stages in the process are:

Permissions clearance
Collection of texts
Encoding of texts
Linguistic annotation of texts
Storage and documentation of texts

Once permission had been obtained for a text to be included in the corpus, it was converted to machine readable form by one of the commercial partners (OUP, Longman or Chambers). The resulting text was then converted to the standard project encoding format at OUCS, where its accuracy and internal consistency was also validated. The text was then passed to UCREL, where word class tagging was automatically added, and returned to OUCS for documentation and accession into the corpus. Each stage of corpus processing was recorded in a database maintained at OUCS.

Up: Contents

Creating the BNC
Creation stage
Permissions Clearance
Collection of Texts
Making electronic texts
Encoding of Texts
Linguistic Annotation of Texts ("tagging")
Storage and Documentation of Texts