Just released the first version of my new Weka package for natural language processing (NLP): https://github.com/fracpete/nlp-weka-package At the moment, it contains only some filters (ChangeCase, PartOfSpeechTagging) and tokenizers (WhiteSpaceTokenizer, PTBTokenizer). It uses the Stanford parser for the NLP heavy lifting.
Tag Archive: package
Petr came across a bug that affected the output of the predictions generated from the test set. It worked fine for cross-validation, but not for the Random split and Unlabled/Test set modes. I’ve committed a fix and made a new…
Just pushed out a new release of the collective-classification Weka package, incorporating feedback from the Weka mailing list. Changes: Explorer panel now offers loading of unlabeled/test set in Unlabeled/test set mode classifiers now create copies of train/test sets in build…
A long time ago, I added a Weka meta-classifier for parameter optimization called MultiSearch. In contrast to GridSearch, which forces you to optimize two parameters (hence grid), this scheme allows you to optimize an arbitrary number of parameters. However, it…
Just made a new maintenance release available for the collective-classification project: it now works with Weka 3.7.11. You can download the Weka package from here: https://drive.google.com/folderview?id=0B4q6REcT3R4WcmN0bElLRHJUbHc&usp=sharing
Also created another Weka package for the PTStemmer developed by Pedro Oliveira: https://github.com/fracpete/ptstemmer-weka-package You can download package archives ready to install from the release section: https://github.com/fracpete/ptstemmer-weka-package/releases
Just created a new Weka package for the snowball stemmers: https://github.com/fracpete/snowball-stemmers-weka-package You can download Weka packages from the release section of that github repository: https://github.com/fracpete/snowball-stemmers-weka-package/releases