This is an old revision of the document!
Version 1.21
- NEW (feature): pseudo-XML versions of the extracted plain text files are now created in the
xml_corpusfolder; a singlecorpus.xmlfile is also created, containing the merged version of the pseudo-XML corpus; 
- NEW (feature): in the “Project Definition” step, you can now add up to three XML attributes to the XML version of the corpus;
 
- NEW (feature): a random string is now appended to the names of downloaded files, individual corpus text files and XML corpus files; this makes it possible to easily merge different corpora in the same folder; file names still start with a progressive number;
 
- BUGFIX : fixed a bug that prevented download timeout to work properly, resulting in BootCaT to wait forever for certain URLs to download