This shows you the differences between two versions of the page.
| |
bootcat:release_notes:toolkit:0.18 [2013/06/17 18:00] – created eros | bootcat:release_notes:toolkit:0.18 [2014/09/09 15:25] (current) – eros |
---|
====== Version 0.18 (TBA) ====== | ====== Version 0.18 (2014-09-10) ====== |
| |
* **New tool**: ''BootCaTExtractor.jar'' performs the same task as ''retrieve_and_clean_pages_from_url_list.pl'' but, unlike the Perl script, supports UTF-8 , language filtering and document size filtering; | * **New tool**: ''BootCaTExtractor.jar'' performs the same task as ''retrieve_and_clean_pages_from_url_list.pl'' but, unlike the Perl script, supports UTF-8 , language filtering and document size filtering; |
* ''UrlCollector.jar'' does not require the "market" parameter anymore; | * ''UrlCollector.jar'' does not require the "market" parameter anymore; |
| |