snowball stemmers release
Just made a new release of the Snowball stemmers Weka package available, version 1.0.1, which is just a minor release: depends on Weka 3.7.12 now fixed Maven integration added unit tests for all stemmers
Open-Source and related Stuff
Just made a new release of the Snowball stemmers Weka package available, version 1.0.1, which is just a minor release: depends on Weka 3.7.12 now fixed Maven integration added unit tests for all stemmers
Released a new version of the python-weka-wrapper library. It’s just a point-release with the following minor enhancements: The packages parameter of the weka.core.jvm.start() function can be used for specifying an alternative Weka home directory now as well added train_test_split method to…
Read more
Duh! Of course, there had to be a lurking bug that I only encountered when I was about to create a What’s new? video… A workaround would have been possible for the glitch with the Breakpoint control actor, but it…
Read more
Finally, finally, it’s here… The 0.4.9 release of ADAMS! Some highlights: 19 new actors 1 new conversion Flow editor debugging framework overhaul, allows step-by-step debugging now Flow editor now offers Find usages of variables, storage items callable actors added support…
Read more
Despite WEKA offering attribute and instance weights, you could only set them programmatically or manually fiddling with ARFF/XRFF/JSON files. This Weka list post prompted me today, to quickly hack together a WEKA package with filters that allow setting the weights…
Read more
Mainly a release with added support for parameter optimization and some tools for making life easier when dealing with options. Here is the detailed list of changes since the 0.3.0 release: added get_tags class method to Tags class for easier…
Read more
It’s been a while since the last release and there were quite a number of bugfixes and additions this time (eg database access, text mining), so well worth the upgrade. A major addition is the workflow component, encapsulating a lot…
Read more
Just released a new version of my new Weka package for natural language processing (NLP): https://github.com/fracpete/nlp-weka-package Changes: added example parser model: wekafiles/packages/nlp/models/englishPCFG.ser.gz added Explorer tab for experimenting with parser setups and visualizing the parse trees Here is a screenshot of…
Read more
Something that ADAMS’ Preview Browser has had for years, I’ve now added to Weka as a standalone tab in the Explorer: displaying the content of serialized model files. It allows the user to load a serialized model file (or actually…
Read more
Just released the first version of my new Weka package for natural language processing (NLP): https://github.com/fracpete/nlp-weka-package At the moment, it contains only some filters (ChangeCase, PartOfSpeechTagging) and tokenizers (WhiteSpaceTokenizer, PTBTokenizer). It uses the Stanford parser for the NLP heavy lifting.