tutorials:basic_2

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
tutorials:basic_2 [2011/12/14 15:33] erostutorials:basic_2 [2012/05/30 15:22] (current) – removed eros
Line 1: Line 1:
-====== BootCaT front-end tutorial - Part 2 ====== 
  
-[[tutorials:basic_1|{{:buttons:previous.png|Previous page}}]] 
-[[tutorials:basic_3|{{ :buttons:next.png|Tutorial part 3}}]] 
- 
-==== Providing your chosen seeds ==== 
- 
-This is the most important step in the corpus creation process: here you provide the seeds that will be used to generate the queries that will be submitted to the search engine. 
- 
-Type (or copy/paste) the seeds that you choose into the text box (one seed per line, multi-word seeds go on the same line, quotes are not necessary), as in the example provided: 
- 
-<file> 
-dog 
-Fido 
-food hygiene 
-leash 
-breeds 
-pet 
-</file> 
- 
-The minimum number of seeds you must provide is 5; here for the purposes of illustration we used 6. 
- 
-{{:tutorials:basic_steps:003.png|}} 
- 
-Once you have provided the seeds of your choice, check the "I'm done editing seeds" box and click "Next". 
- 
-{{:tutorials:basic_steps:004.png|}} 
-==== Tuple generation ==== 
- 
-The seeds you provided in the previous step will be randomly grouped to form [[wp>tuples]] (a variety of combinations of your seeds). These tuples will be submitted as queries to the search engine. 
- 
-{{:tutorials:basic_steps:005.png|}} 
- 
-You can choose the number of tuples to be generated; of course the number of possible random combinations is finite and depends on how many seeds you provided. The maximum number of tuples you can generate is shown in parentheses. Since we provided 6 seeds, we can generate a maximum of 20 tuples. We choose to generate 15 tuples. 
- 
-You can also alter the **length** of the tuple (i.e. the number of seeds forming it); typical values for this option are: 
- 
-  * 3 if you want to build a specialized corpus 
-  * 2 if you are creating a general language corpus and are using general language words 
- 
-We'll use a length of 3 and recommend that you do the same. 
- 
-Once you're finished setting the options, click on "Generate tuples" 
- 
-{{:tutorials:basic_steps:006.png|}} 
- 
-Here you can also unselect individual tuples if you think that they will not yield interesting results. 
- 
-:!: Notice how "food hygiene" has been automatically surrounded with quotes. The tuples in which this seed appears are 4 **words** long but only 3 **seeds** long since "food hygiene" counts as a single seed. 
- 
-Click "Next" to proceed to the next step. 
- 
-[[tutorials:basic_1|{{:buttons:previous.png|Previous page}}]] 
-[[tutorials:basic_3|{{ :buttons:next.png|Tutorial part 3}}]] 
  • tutorials/basic_2.1323873208.txt.gz
  • Last modified: 2011/12/14 15:33
  • by eros