Comparable corpora
C-CT
This is a corpus of English and Spanish clinical trials. It consists of twenty-five clinical trials written in English (74,488 words) and another twenty-five written in Spanish (92,604 words). It was compiled to analyse and contrast the rhetorical and lexico-grammatical structure of the genre.
C-FIVI
This comparable corpus of wine tasting notes from wineries in Spain, Canada, the USA, South Africa, Australia and New Zealand, comprises 715 notes in English (54,408 words) and 750 in Spanish (52,413 words) and was compiled to analyse and contrast the rhetorical and lexico-grammatical structure of the genre.
C-FMR

This small comparable corpus of football match reports, consisting of 40 texts in European Spanish (30,986 words) and 40 in British English (30,153 words) drawn from the most widely read dailies, was compiled to analyse and contrast the rhetorical and lexico-grammatical structure of the genre.

C-GAC
This corpus of medical abstracts comprises two sub-corpora: one, comparable and tagged, of 50 English abstracts (14,484 words) and 50 Spanish (15,113 words); the other is a parallel translation sub-corpus of 50 other abstracts translated into Spanish and another 50 into English. It was compiled to analyse and contrast the rhetorical and lexico-grammatical structure of the genre.
C-GARE
This is a comparable corpus made up of English and Spanish minutes of meetings. It contains 100 texts in each language, comprising 174,347 words in English and 139,919 words in Spanish. It was compiled to analysed and contrast the rhetorical and lexico-grammatical structure of the genre Minutes of meetings.
C-GDPE
This small comparable corpus of football match reports, consisting of 40 texts in European Spanish (30,986 words) and 40 in British English (30,153 words) drawn from the most widely read dailies, was compiled to analyse and contrast the rhetorical and lexico-grammatical structure of the genre.
C-GIT
This corpus, which consists of 80 home appliance instruction manuals, 40 in English (200,000 words) and 40 in Spanish (170,000 words), was compiled to analyze and contrast the rhetorical and lexico-grammatical structure of the genre.
C-GITEC
This is a corpus of English and Spanish informative and promotional technical reports, consisting of 40 texts written in English (54,940 words) and 40 texts written in Spanish (28,844 words). It was compiled to analyse and contrast the rhetorical and lexico-grammatical structure of the genre.
C-OPRES
This is a comparable corpus of English and Spanish newspaper opinion discourse. It consists of approximately one million words in each language (English: 1,007,384 words; Spanish 1,007,414 words) and it includes editorials (non-bylined), by-lined opinion columns and comment articles. C-OPRES was compiled to analyse and contrast the language of evaluation.