Search: 
 
 
Pentaho Data Mining
Pentaho Data Mining, based on Weka project, is a comprehensive set of tools for machine learning and data mining. Its broad suite of classification, regression, association rules and clustering algorithms can be used to help you understand the business better and also be exploited to improve future performance through predictive analytics.
Recent News and Releases
- 06/05/09 Weka 3.7.0 is now available.
- 06/05/09 Weka 3.6.1 is now available.
- 06/05/09 Weka 3.4.15 is now available.
- 06/05/09 English documentation for Weka 3.7.0 is now available.
- 06/05/09 English documentation for Weka 3.6.1 is now available.
- 12/19/08 Weka 3.4.14 is now available.
- 12/19/08 English documentation for Weka 3.6.0 is now available.
- 12/19/08 English documentation for Weka 3.4.14 is now available.
- 12/15/08 National Health Service Islington Selects Pentaho Business Intelligence to Improve Patient Services (press release).
- 09/11/08 Support for importing PMML models into Weka (press release).
- 12/06/07 Weka Plugins for Pentaho Data Integration 3.0 are now available.
- 12/06/07 Pentaho streamlines delivery of predictive analytics (press release).
Stable

Weka 3.4.15 (GA) (Release Notes)
This is a patch release to Weka 3.4 containing a number of bug fixes. For a detailed list of improvements, please refer to the release notes.

New Features since 3.2
- ARFF Viewer
- General purpose graph visualizer
- Improvements to Knowledge Flow including support for ROC curves, CSV data
  sink, clustering, database connectivity and a prediction appender step
- 10 new Algorithms
- XML serialization support
- Click here for a detailed list of new features

Weka 3.6.1 (GA) (Release Notes)
This is a stable version created from the head of the development code line. The 3.6 code line will receive bug-fixes only (development of new features continues in 3.7). For a detailed list of improvements, please refer to the release notes.

New Features since 3.4
- 35 new learning schemes
- 17 new filters
- Grouping of steps (MetaBean) in Knowledge Flow
- New SQL viewer and visualization plugin support in Explorer
- Area under ROC (AUC) evaluation type
- Relation-valued attributes (supports multi-instance learning)
- Support for incremental clusterers
- XML format for instances
- Text directory to ARFF tool
- Several new data generators
In Development

Weka 3.7
This is the new development branch of Weka, continuing from 3.5.8 and will include new features as well as bug fixes.

Panned Features
- Final committed feature list TBD
How to Contribute
You can participate by contributing new code, reporting bugs, testing new releases, answering questions and more; Email us the proposed contribution and any other relevant details. Welcome to the team.
- Write a tech tip
- Report a bug in JIRA
- Answer posts on the forums
- Write some code
Whats Next
To suggest a new feature or view our roadmap, click here.

Major features planned in future releases:
- Further PMML support (import/export)
- Pluggable estimators in EM
- Execution of Kettle transforms in
  KnowledgeFlow
- KnowledgeFlow plugin for Kettle
- Data mining component for the BI Platform
- Enhancements to the Kettle Scoring
  Plugin including:
  • Support for training/updating incremental Weka models
  • Support for PMML models

Additional Information

Also Check Out:

Pentaho Data Integration
Pentaho Analysis Services - Mondrian Project
Pentaho Reporting
 
   Terms of Use    Privacy Statement    Contributor Agreement    Site Map    © 2009, Pentaho Corporation