bootcat:help:corpus_creation_mode

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revisionBoth sides next revision
bootcat:help:corpus_creation_mode [2019/11/05 13:09] – [Local files (advanced)] erosbootcat:help:corpus_creation_mode [2019/11/08 09:08] – [Custom URLs (advanced)] eros
Line 52: Line 52:
 </file> </file>
  
 +NB: up to version 1.21, BootCaT does not accept URLs lists encoded as "UTF8 **with BOM**", the issue will be solved in future versions of BootCaT.
 ===== Local files (advanced) ===== ===== Local files (advanced) =====
  
Line 57: Line 58:
  
 Most common file formats are supported, including ''html'', ''pdf'' and ''doc'' files. Most common file formats are supported, including ''html'', ''pdf'' and ''doc'' files.
 +
 +===== Local queries (advanced) =====
 +
 +Using this mode, you can query Google normally using a web browser and save the result pages to a folder. Then you can tell BootCaT where this folder is and it will extract the URLs from the queries you saved.
 +
  • bootcat/help/corpus_creation_mode.txt
  • Last modified: 2023/04/19 10:56
  • by eros