Once the basic design principles had been agreed on, creation of the BNC began in 1991.
The chief stages in the process are:
Once permission had been obtained for a text to be included in the corpus, it was
converted to machine readable form by one of the commercial partners (OUP, Longman or
Chambers). The resulting text was then converted to the standard project encoding format
at OUCS, where its accuracy and internal consistency was also validated. The text was then
passed to UCREL, where word class tagging was automatically added, and returned to OUCS
for documentation and accession into the corpus. Each stage of corpus processing was
recorded in a database maintained at OUCS.