This is an old revision of the document!
Version 1.21
- NEW (feature): pseudo-XML versions of the extracted plain text files are now created in the
xml_corpus
folder;
- NEW (feature): two new files are created,
corpus.txt
andcorpus.xml
containing the merged versions of the plain text and pseudo-XML version of the corpus;
- BUGFIX : fixed a bug that prevented download timeout to work properly, resulting in BootCaT to wait forever for certain URLs to download