bootcat:tutorials:basic_1

This is an old revision of the document!


BootCaT front-end tutorial

(by Eros Zanchetta and Federico Gaspari)

Welcome to the BootCat front-end tutorial!

This short and simple guide assumes no prior knowledge of the tool, and will walk you through the process of creating your own web corpus using the BootCaT front-end.

The tutorial is based on version 0.6 of the BootCaT front-end

This is the welcome screen, where you'll find some basic information about the BootCaT method for creating a web corpus.

Click “Next”.

:!: when you run BootCaT front-end for the first time a new folder named “BootCaT Corpora” is created in your home folder (or your “My documents” folder if you're on Windows). You can see all your corpora by clicking on “My corpora” in the File menu.

Here you have to choose a name for your new corpus. For example, insert the name “dogs” in the box. You can also choose a language/country combination, if you select “Unspecified” the language will be automatically guessed by the system based on the seeds you'll provide in the next step.

For now let's not specify a language, choose “Unspecified” from the drop-down list and click “Next” to move on to the next step.

:!: A note about languages: in this list you'll find only the “Markets” officially supported by the Bing API (please note the “API” part); this means that even if a language is supported by the Bing search engine, it may not necessarily be supported by the API. Also, BootCaT doesn't support multibyte character encoding yet (for instance, Chinese, Japanese and most Russian pages are *not* supported).


Next page

  • bootcat/tutorials/basic_1.1338885647.txt.gz
  • Last modified: 2012/06/05 08:40
  • by eros