FROM THE EDITOR
In this Issue
Contributors
Letters to the Editor
Write to Us
Spread the Word
Back Issues
Index

COLUMNS

Focus on Phrasal
Verbs:
Introduction
Metaphor and phrasal verbs


New word of the month
New words in
Australian English

Corpora tips
Digging deeper with WebCorp:
collocation tools


Games
Sporting vocabulary

American football,
baseball and golf

Survey
Take part in our online
dictionary survey
and use MED online
free for a year!


Corpora tips
Digging deeper with WebCorp:
collocation tools
by Mairi MacDonald

Next in a series of articles looking at ways in which teachers and students can explore words using free corpus resources on the Web.

Last month's article touched on how a simple left or right word sort using WebCorp can generate a selection of collocates. For example, a left word sort on the word meeting gives us board meeting, cabinet meeting, crisis meeting, crunch meeting, crucial meeting, emergency meeting and summit meeting.

However, not all collocates appear to the immediate left or right of the search term. WebCorp also offers a valuable collocation tool that goes beyond examining compound nouns and produces a table of data that enables you to further explore the relationship between your search word and the surrounding text.

How to generate a collocation table

To start your investigation of a particular word or phrase, you need to generate a collocation table.

1 Go to the Advanced search page at http://www.webcorp.org.uk/wcadvanced.html.
2 Type in your search word or phrase.
3 Select a newspaper domain e.g. UK broadsheets as this tends to speed up the search.
4

Scroll down to the collocation section and select External collocates and Exclude Stopwords. Stopwords are high frequency words such as a and the that can distort results. Don't exclude stopwords if you want to look at prepositions.

5 Select Exclude link text and Exclude wildcard match to e-mail address.
Note: A detailed explanation of the Collocation search is available at http://www.webcorp.org.uk/guide/advanced5.html
6 Click on Submit.
7 Once you have your search results scroll down to the foot of the page for the collocation table which will look something like this:

Word
Total
L4
L3
L2
L1
R1
R2
R3
R4
Left
Total
Right
Total
42 5 6 3 15 3 5 5 14 28
31 3 1 14 4 8 4 27
said 21 1 2 7 3 5 1 2 10 11
section 15 7 3 5 15 0
Blair 13 1 3 2 1 1 2 3 7 6
Downing 10 6 1 1 1 1 6 4
called 9 1 2 2 2 2 5 4
cabinet 9 2 1 6 9 0
Street 8 6 1 1 6 2
attended 8 2 1 3 1 1 3 5
first 8 2 3 2 1 7 1
board 7 1 6 7 0
held 7 1 3 1 1 1 1 6
chaired 6 2 4 2 4
annual 5 1 4 5 0
people 5 1 2 1 1 3 2
minister 5 2 3 2 3
Staff 5 4 1 0 5
Bush 5 1 1 2 1 2 3
today's 5 1 1 3 5 0

This is also accompanied by a selection of Key Phrases, in this instance:

Key Phrases:   Street meeting   board meeting   crisis meeting   summit meeting  cabinet meeting   today's meeting   meeting   
meeting chaired

Investigating the table

This grid of numbers and words provides the key to a thorough investigation of the word meeting. The table lists the most frequent words found up to 4 places to the left or right of your search term.

From the table, compound nouns such as cabinet meeting, board meeting and annual meeting are immediately obvious as these are the examples that have the most appearances in column L1 (one position to the left of the search word).

Called is one of the most frequent verbs to occur within 4 spaces of meeting.

Word
Total
L4
L3
L2
L1
R1
R2
R3
R4
Left
Total
Right
Total
called 9 1 2 2 2 2 5 4

A left word sort, position 2 shows examples of call, called, calling and calls.

According to the table, call is found both to the left and right of the word, suggesting that it is used more or less equally in both the active and passive. Other verbs in the table – attended, chaired and held appear more frequently to the right of the search term, implying that these are more normally found in the passive.

Being critical - don't believe everything you read!

It is important that students have an awareness of how the computer can skew results and give misleading information. This is one of the many things lexicographers have to consider when interpreting corpora search results.

Street occurs frequently immediately to the left of meeting. It is also given as one of the Key Phrases – it sounds like a plausible compound, after all we can have a street party and a street protest so why not a street meeting? A left word sort in position 1 reveals that this 'collocate' is only a part of the phrase "Downing Street meeting". In the Key Phrases section, WebCorp helpfully provides a link to a Google search and clicking on this we find examples such as Arch Street Meeting House, Logging on to Wall Street: Meeting materials – further evidence that street meeting is a false collocate.

Knowing which results to ignore is also an important skill when assessing corpora search results. A quick sort, using the table as a guide, will establish what is useful and what isn't. According to the table, section frequently occurs 4 places to the left but again this is not a collocate of meeting as the following extract shows.

It's easy for native-speakers to run their eye down an alphabetically sorted list of search results and pick out the collocates. Equally, it can be tempting for students to just glance at the words in blue and red and accept what they read. Therefore it is important to encourage students to examine the results closely, investigate what they see and ignore them if necessary.

Using the Word Filter

To generate more meaningful results, the filter on the Advanced Search page allows you to exclude certain words. By typing -street into the filter, you can cancel out the misleading "street meeting".

This also has the effect of cutting out unhelpful examples such as Blair and Downing leaving you with more 'true' collocates to work with.

Top external collocates of "meeting" (excluding stopwords)
Word
Total
L4
L3
L2
L1
R1
R2
R3
R4
Left
Total
Right
Total
  37 2 3 1 14 3 9 5 6 31
  34 1 2 1 15 4 11 4 30
said 21 1 2 8 3 3 1 3 11 10
section 16 7 3 6   16 0
first 12 3 3 4 2 10 2
cabinet 7 1 2 1 3 7 0
today's 1 2 4 7 0
people 6 1 2 2 1 3 3
called 6 1 1 2 1 1 2 4
targets 6 1 4 1 0 6
EU 6 1 1 2 2 4 2
Bush 6 1 1 1 2 1 2 4
emergency 6 6 6 0
planned 5 1 2 1 1 4 1
told 5 1 3 1 4 1
G8 5 3 1 1 3 2
held 5 2 1 1 1 2 3
Today's 5 1 4 5 0
" 5 2 2 1 2 3
Iraq 5 2 1 1 1 4 1

Key Phrases: emergency meeting   key meeting   Today's meeting
crisis meeting   summit meeting   first meeting   today's meeting
meeting today

A new entry in this improved version of the collocation table for meeting is targets with a total of 6 appearances, mainly occurring 2 places to the right:

Word
Total
L4
L3
L2
L1
R1
R2
R3
R4
Left
Total
Right
Total
targets 6 1 4 1 0 6

However, the corresponding sort (right, position: 2) reveals that this belongs to the verb meet rather than the noun meeting.

Thinking about the evidence – some points for discussion

The collocation tables raise a lot of questions. Sorting the search results provides many of the answers. In this way, WebCorp is a valuable resource for analysing relationships between words while exposing students to a considerable amount of authentic material.

The following tasks are aimed at familiarising students with the collocation table and encouraging them to think about their existing knowledge of the word.

1 Show the students a collocation table of the word you want to study and introduce the subject by asking the students to guess what that word might be.
2 Ask students to discuss the following questions:
Are there any obvious compound nouns?
Which words could be ignored? Why do you think so?
What verbs are used frequently with the search term? Do they occur more frequently in the active or passive?
3 Get students to investigate their assumptions by sorting the search results according to the location of the collocate they want to research.
4 To follow up, discuss the following points:
What words would you choose to filter out? Why?
Are there any additional collocates that weren't included in the table?
Did you learn anything new about the word?

Next in the series

The next article will look at search facilities available on Web Concordancer and possible classroom applications.