bootcat:help:corpus_creation_mode

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revisionBoth sides next revision
bootcat:help:corpus_creation_mode [2013/06/15 20:37] – [Custom URLs (advanced)] erosbootcat:help:corpus_creation_mode [2015/01/27 15:08] – [Custom URLs (advanced)] eros
Line 19: Line 19:
 ===== Custom tuples (advanced) ===== ===== Custom tuples (advanced) =====
  
-In this mode you skip the seed selection steps and directly provide a list of tuples: a window will open and you'll be able to type in the tuples.+In this mode you skip the seed selection step and directly provide a list of tuples: a window will open and you'll be able to type in the tuples.
  
 Remember that each line will become a single query to the search engine, therefore phrases should be enclosed in quotes. You tuples should look like this: Remember that each line will become a single query to the search engine, therefore phrases should be enclosed in quotes. You tuples should look like this:
Line 50: Line 50:
 </file> </file>
  
-**N.B.**: only URLs pointing to HTML files will be downloaded (typical extensions for such files are ''.htm'', ''.html'', ''.php'', ''.asp''), if the list you provide contains URLs ending in PDF, DOC, DOCX etc. BootCaT will display an error and will refuse to proceed. In order to continue you'll have to remove the links to illegal files from the list.+**N.B.**: only URLs pointing to HTML files will be downloaded (typical extensions for such files are ''.htm'', ''.html'', ''.php'', ''.asp''), if the list you provide contains URLs ending in PDF, DOC, DOCX etc. BootCaT will display an error and will refuse to proceed. In order to continue you'll have to remove the links to unsupported file formats from the list.
  • bootcat/help/corpus_creation_mode.txt
  • Last modified: 2023/04/19 10:56
  • by eros