Web interface: User manual

To search and download the corpus, click on the tab ‘Search/Download’, where you can do a ‘simple search’ or an ‘advanced search’. The web interface has been designed so as to be as simple and intuitive as possible. Next to each search element, you will see a question mark symbol like this: Click on it to get help about how to use each element. Additionally, the search interface is divided into coloured sections to help you visualize the different sections.

SIMPLE SEARCH: Use this to look for a word (or a combination of words in the corpus). The result will be a concordance.

In the blue area “Corpus to search” you can select the subcorpus to search (learners of L2 English or natives). You can also select the mother tongue (L1) of the learners.
In the yellow area “Words (optional)”, you can type in the word(s) to search. Click on the question mark symbol for further details and examples. You can optionally leave this search box blank so as to browse (and download) the entire corpus.
In the pink area “Result (Output)”, select the size of your output, i.e., how many concordance lines you want per page (50, 100, 500 or 1000).

ADVANCED SEARCH: Use this to look for words, grammatical categories, lemmas, frequencies, concordances, etc. You can apply several filters to restrict your search. You can also download the corpus texts here. We recommend you click on the question marks for additional help and examples.

In the blue area “Corpus to search”, you can select the subcorpus you are interested in (learners of L2 English or natives). You can also select the mother tongue (L1) of the learners.
In the pink area “Result (Output)”, select the type of result type you need: concordances (KWIC), texts, simple frequency and full frequency. You can also select the result subtype (words, grammatical element, proximal words and proximal grammatical elements). You can sort your output by customising it. Finally, you can select the size of your output, i.e., the number of concordance lines per page (50, 100, 500 or 1000).
In the pale yellow area “Sensitivity”, you can choose whether your search will be case and accent sensitive or not. Click on the question mark for additional help and examples.
In the green area “Filters”, you can filter or narrow down your search according to a series of variables. Click on the question mark symbol for further details. For example, you can filter the output according to: the speakers’ mother tongue (L1), the medium (written or spoken), the learners’ proficiency level(s), their sex, their placement test score (in %), the proficiency score that they gave themselves (self-assessment), the title of the task or composition, the filename, the age of the speaker, the age at which the learnerers were first exposed to English, the years the learners have been learning English, and the months the learners have stayed in an English-speaking country.

In the yellow area at the bottom, you will see different options depending on the subresult type you have chosen:

WORDS: It searches for normal word(s) or combinations of words. For example: are, table, there is, I like, in spite of, … You can also use wildcards: * (asterisk for any number of characters), ? (for 1 character),

(for either one word or the other). For example:

writ ⇒ concordances of words beginning with ‘writ’ and followed by any number of characters: *write, writer, writes, writing, written,** …
*stand ⇒ concordances of words beginning with any number of characters and ending in ‘stand’: misunderstand, stand, understand, withstand.
?at ⇒ concordances of words beginning with a character and followed by ‘at’: hat, sat, cat, eat, fat, …
da?e ⇒concordances of words starting with ‘da’ and ending with ‘e’ with one character in between: dame, dare, date, …
am are ⇒ concordances containing either ‘am’ or ‘are’.

GRAMMATICAL ELEMENTS: It can search for two types of elements:
1. grammatical words or parts of speech (POS) like Noun, Adjective, Verb, Adverb, etc. You can subspecify the type of grammatical word (e.g., noun plural; verb past)
2. lemmas provide all the word forms of a particular lemma (e.g., BE= am, are, is, was, were, be, being, been).
WORDS PROXIM: It searches for a first word separated X words from a second word (example: “he 2 then” = he will then, he was then, he decides then,, …).
GRAMMATICAL ELEMENTS PROXIM: It searches for a first grammatical word separated X words from a second grammatical word (example: “PRONOUN-PERSONAL 2 VERB-PARTICIPLE” = he was admitted, he has attempted, he has come, he is discovered, he had found, …; “PRONOUN-PERSONAL 3 VERB-PARTICIPLE” = he must have gotten, he has apparently collected, he was not allowed, I have recently watched, …).

Technical cookies		So that our website can work. Activated by default.
Technical cookies are strictly necessary for our website to work and for you to navigate through it. These types of cookies are those that, for example, allow us to identify you, give you access to certain restricted parts of the website if necessary, or remember different options or services already selected by you, such as your privacy preferences. Therefore, they are activated by default and your authorization is not necessary. Through the configuration of your browser, you can block or alert the presence of this type of cookies, although such blocking will affect the proper functioning of the different functionalities of our website.
Analysis cookies		To allow us to know how our web is being used. You can enable or disable them.
Analysis cookies allow us to study the navigation of the users of our website in general (for example, which sections of the site are the most visited, which services are used most and if they work correctly, etc.). From this statistical information about navigation on our website, we can improve both the operation of the site itself and the different services it offers. Therefore, these cookies do not have an advertising purpose, but only serve to make our website work better, adapting to our users in general. By activating them you will contribute to this continuous improvement. You can activate or deactivate these cookies by changing the corresponding sliders.

COREFL: Corpus of English as a Foreign Language

COREFL

Web interface: User manual