Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
bootcat:help:corpus_creation_mode [2015/01/27 15:08]
eros [Custom URLs (advanced)]
bootcat:help:corpus_creation_mode [2016/11/14 12:29] (current)
eros
Line 8: Line 8:
   * [[bootcat:​help:​corpus_creation_mode#​custom_tuples_advanced|Custom tuples]] (advanced)   * [[bootcat:​help:​corpus_creation_mode#​custom_tuples_advanced|Custom tuples]] (advanced)
   * [[bootcat:​help:​corpus_creation_mode#​custom_urls_advanced|Custom URLs]] (advanced)   * [[bootcat:​help:​corpus_creation_mode#​custom_urls_advanced|Custom URLs]] (advanced)
 +  * [[bootcat:​help:​corpus_creation_mode#​local_files|Local files]] (advanced)
  
 {{:​bootcat:​help:​corpus_creation_modes.png?​nolink|}} {{:​bootcat:​help:​corpus_creation_modes.png?​nolink|}}
Line 51: Line 52:
  
 **N.B.**: only URLs pointing to HTML files will be downloaded (typical extensions for such files are ''​.htm'',​ ''​.html'',​ ''​.php'',​ ''​.asp''​),​ if the list you provide contains URLs ending in PDF, DOC, DOCX etc. BootCaT will display an error and will refuse to proceed. In order to continue you'll have to remove the links to unsupported file formats from the list. **N.B.**: only URLs pointing to HTML files will be downloaded (typical extensions for such files are ''​.htm'',​ ''​.html'',​ ''​.php'',​ ''​.asp''​),​ if the list you provide contains URLs ending in PDF, DOC, DOCX etc. BootCaT will display an error and will refuse to proceed. In order to continue you'll have to remove the links to unsupported file formats from the list.
 +
 +===== Local files (advanced) =====
 +
 +Using this mode BootCaT will process all files contained in a folder (and its subfolders) on your computer. Files will be cleaned and a single text file will be created.
  • bootcat/help/corpus_creation_mode.txt
  • Last modified: 2016/11/14 12:29
  • by eros