The Bergen Corpus of London Teenage Language (COLT) is the first large English Corpus focusing on the speech of teenagers. It was collected in 1993 and consists of the spoken language of 13 to 17-year-old teenagers from different boroughs of London. The complete corpus, half a million words, has been orthographically transcribed and word-class tagged, and is a constituent of the British National Corpus.
At present, CoRD provides descriptions of a large number of corpora, subcorpora and databases. Many more are forthcoming and we encourage all compilers of English language corpora to submit a description.