Just released the first version of my new Weka package for natural language processing (NLP): https://github.com/fracpete/nlp-weka-package At the moment, it contains only some filters (ChangeCase, PartOfSpeechTagging) and tokenizers (WhiteSpaceTokenizer, PTBTokenizer). It uses the Stanford parser for the NLP heavy lifting.
Tag Archive: gpl3
I always wanted to be able to visualize large confusion matrices as a heatmap. Making it easier to visualize where misclassifications hot spots are. Hence I started another plugin project for the Weka Explorer https://github.com/fracpete/confusionmatrix-weka-package It offers, at the moment,…
Last year, while working on a consulting project, I had to export lots of screenshots from ADAMS. I got so annoyed at constantly having to click through my directory hierarchy, that I implemented a little accessory component for the JFileChooser…
Also created another Weka package for the PTStemmer developed by Pedro Oliveira: https://github.com/fracpete/ptstemmer-weka-package You can download package archives ready to install from the release section: https://github.com/fracpete/ptstemmer-weka-package/releases
Just created a new Weka package for the snowball stemmers: https://github.com/fracpete/snowball-stemmers-weka-package You can download Weka packages from the release section of that github repository: https://github.com/fracpete/snowball-stemmers-weka-package/releases