bootcat:tutorials:basic_3

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Last revisionBoth sides next revision
bootcat:tutorials:basic_3 [2018/02/07 13:50] erosbootcat:tutorials:basic_3 [2022/10/07 10:58] – [Saving query results] eros
Line 8: Line 8:
 It's time to generate the queries that will be sent to the search engine (i.e. Google) using the tuples we generated earlier. The queries we generate here will be used in the next step to open a browser and save results. It's time to generate the queries that will be sent to the search engine (i.e. Google) using the tuples we generated earlier. The queries we generate here will be used in the next step to open a browser and save results.
  
-A number of parameters can be specified here, but, for the purposes of this tutorial, we'll just accept the default values and click on "Next".+A number of parameters can be specified here, but, for the purposes of this tutorial, we'll just accept the default values and click on "Generate Queries".
  
-{{bootcat:tutorials:basic_steps:0065.png?nolink|}}+{{ bootcat:tutorials:basic_steps:0065.png?nolink |}}
  
-==== Collecting URLs ====+==== Saving query results ====
  
-What happens here is that we open each of the queries generated in the previous step in a web browser. Each query consists of the tuples (combinations of our seeds) we generated earlier. This identifies texts that are relevant to the more or less specific corpus (domain) in which we are interested, based on how specialized or general the seeds are.+What happens here is that we open each of the queries generated in the previous step in a web browser. Each tuple (combinations of our seeds) generated earlier becomes a query. This method allows us to identify texts that are relevant to the more or less specific corpus (domain) in which we are interested, based on how specialized or general the seeds are.
    
-{{bootcat:tutorials:basic_steps:008.png?nolink|}}+{{ bootcat:tutorials:basic_steps:008.png?nolink |}}
  
 Click on "Open in browser", a message will appear explaining what's about to happen and the folder where you'll need to save the results page. You can also open the folder by clicking on "Open folder". Click on "Open in browser", a message will appear explaining what's about to happen and the folder where you'll need to save the results page. You can also open the folder by clicking on "Open folder".
  
-{{bootcat:tutorials:basic_steps:0085.png?nolink|}}+{{ bootcat:tutorials:basic_steps:0085.png?nolink |}}
  
 Once you click on "OK" your default Web browser will open and you'll see the results of the query, the page will look something like this: Once you click on "OK" your default Web browser will open and you'll see the results of the query, the page will look something like this:
  
-{{bootcat:tutorials:basic_steps:0087.png?nolink|}}+{{ bootcat:tutorials:basic_steps:0087.png?nolink |}}
  
-Now you need to save the page by using the "Save page" function of your browser (on Windows you can just press CTRL-S, on MacOS CMD-S), a dialog box will appear asking you where you want to save the page. You need to select the folder **BootCaT Corpora -> dogs -> queries**:+Now you need to save the page by using the "Save page" function of your browser (on Windows you can just press CTRL-S, on MacOS press CMD-S), a dialog box will appear asking you where you want to save the page. You need to select the folder '''BootCaT Corpora -> dogs -> queries'''.
  
-{{bootcat:tutorials:basic_steps:0088.png?nolink|}}+**NB**: make sure you're saving the queries either as "Web page, Complete", "Web page, HTML only" or "page Source", basically you need to save them in HTML format (and not in MHTML or some other compressed format). 
 + 
 +{{ bootcat:tutorials:basic_steps:0088.png?nolink |}} 
 + 
 +==== Collecting URLs ====
  
-Once you're done saving the results of all queries, click on "Collect URLs":+Once you're done saving the results of all queries, click on "Collect URLs" and you'll be taken to the next step:
  
-{{bootcat:tutorials:basic_steps:0089.png?nolink|}}+{{ bootcat:tutorials:basic_steps:0089.png?nolink |}}
  
 :!:: you can choose to click on "Open All in Browser" to send all queries to the browser with a single click, but this sometimes results in Google blocking the operation. :!:: you can choose to click on "Open All in Browser" to send all queries to the browser with a single click, but this sometimes results in Google blocking the operation.
  • bootcat/tutorials/basic_3.txt
  • Last modified: 2022/10/07 10:58
  • by eros