Hello fellow applied linguists.

This page is the Appendix to my paper for the 2009 Temple University Applied Linguistics Colloquium and will describe the following resources.

  1. Antconc Concordancer
  2. Compleat Lexical Tutor
  3. David Lee’s Devoted to Corpora

Antconc Concordancer

To start, the one tool that I use for most of my analysis is Antconc Concordance program developed by Laurence Anthony of Waseda University in Tokyo, Japan.

This is a freeware program, which is extremely handy because it can be opened without installing it on your computer. You can simply keep it on your flash memory drive and use it on any computer. It can be run on Windows 98, Me, 2000, NT, and XP, though I am not certain about Vista at the moment. It can also be used on Macintosh and Linux OS as well.

Also it contains various tools as well as a concordancer. According to Professor Anthony’s description it contains the following:

  • Concordance
  • Concordance Plot
  • File View
  • Clusters
  • N-Grams
  • Collocates
  • Word List
  • Keyword list

I will only explain a few of these,  but I recommend downloading the program and reading the explanation text that accompanies it.

The tool I use is the “word list” which takes the words in the corpus and places them in a ranked order based on the most frequent.  In Parise’s Building and Investigating a Written Corpus of Japanese Junior High School Learners (in press), the word lists were produced by this tool.

The “concordance” function takes the words, like the ones that could be found in the word list, can be then traced back to its original context on a sentence level using this function. With the “concordance plot ” function we are able to see where the words and concordances occur within the text itself. This way we can get a sense of the distribution of the concordance. Is it generally used or in isolated sections?

Antconc Concordance tool

Antconc Concordance tool

Finally, “file view” allows the researcher to see the word or concordance in its “natural habitat” the original text in order to re-contextualize how it is used.

Here is the link to Laurence Anthony’s site for more information about this tool. http://www.antlab.sci.waseda.ac.jp/

 

Another tool that I strongly recommend which is probably already well known among  applied and corpus linguistic circles is Tom Cobb’s  ”Compleat Lexical Tutor”  an  Internet site devoted to “data-driven learning”. The very nice aspect of this site is the fact that it provides tools not only for corpus building and research, but for teachers and students.

Compleat Lexical Tutor

Compleat Lexical Tutor

The most helpful tools on this site for my own teaching are the concordance and the vocabulary profile tools.
The concordance tool (highlighted in yellow) allow the user to search various corpora( Brown, BNC spoken and

http://www.lextutor.ca/

written, US TV talk, Learner corpora, etc. ) the concordance allows you to look at instances of language in their native context, allowing the researcher, or student to access collocational information.

For example, if you want to see how the word enjoy is collocated in US English, you type the word and select US TV Talk. You also have the option of choosing to highlight the left or right of the word.  Then push submit and then you have your concordances.

Concordance tool screen

Concordance tool screen

Result concordance of "enjoy"

Result concordance of "enjoy"

As for the vocabulary profiles, rather than reinvent the wheel, I have an excellent PDF by a fellow teacher, Jean-Pierre Richard who wrote about the vocabulary profile tool for his colleagues.

Please look here for the PDF: English in Action at Ichikawa

The final part of this guide is an introduction to a main resource for Corpus Linguistics, and this is David Lees’ Bookmarks for Corpus Based Linguists.

http://personal.cityu.edu.hk/~davidlee/devotedtocorpora/CBLLinks.htm

Lee offers excellent commentaries along with lists of corpora, collections, data archives, multilingual corpora and parallel-corpora, some of which are freely available to download, or for purchase. He also lists sofware tools, frequency lists,  and ranks them according to user friendliness. Visitors will also find relevant references, journals, and papers-some of which are accessible as e-journals. There is a section for teaching as well. The downside, and Lee admits this in the opening page; is that the site needs maintenance some of the links may be dead-ends.  Regardless, it is one of the best sites on the subject and a great start for those who want a general view of what is available

2 Responses to “A (brief) Guide to Corpus Analysis Tools”

  1. Fantastic, I hadn’t heard about this topic up to the present. Thx!

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.