This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | |
bootcat:release_notes:1.21 [2019/07/15 12:14] – eros | bootcat:release_notes:1.21 [2019/10/29 14:47] (current) – eros |
---|
====== Version 1.21 ====== | ====== Version 1.21 ====== |
| |
| * **NEW (feature)**: Windows and Mac users no longer need to install Java for BootCaT to work, Java is already included in the distribution package; |
| |
* **NEW (feature)**: pseudo-XML versions of the extracted plain text files are now created in the ''xml_corpus'' folder; a single ''corpus.xml'' file is also created, containing the merged version of the pseudo-XML corpus; the XML version of the corpus contains more metadata than the plain text version: | * **NEW (feature)**: pseudo-XML versions of the extracted plain text files are now created in the ''xml_corpus'' folder; a single ''corpus.xml'' file is also created, containing the merged version of the pseudo-XML corpus; the XML version of the corpus contains more metadata than the plain text version: |
* ''id'' (the URL of the original file), | * ''id'', a unique identifier for the document consisting of the corpus name followed by a number, |
* ''content_type'' of the original file, | * ''filename'' of the downloaded file (basically, the id plus the file extension), |
* ''filename'' of the downloaded file; | * ''uri'', the uri of the original file, |
| * ''content_type'' of the original file; |
| |
* **NEW (feature)**: in the "Project Definition" step, you can now add up to three user-defined XML attributes to the XML version of the corpus; | * **NEW (feature)**: in the "Project Definition" step, you can now add up to three user-defined XML attributes to the XML version of the corpus; |