Friday, October 29, 2010

Updated Stacking Metaclassifier

Since the reworking of hydrat to generalize 'CrossValidation' and 'TrainTest' into a DataSet-level declaration of a split, the Stacking metaclassifier had been unusable due to its use of the CrossValidation TaskSet subclass. This has now been fixed - Stacking now does a crossvalidation using the same machinery that is used to generate a crossvalidation taskset from a multi-fold split. The idea of crossvalidation has been hardcoded into the stacking metaclassifier, but it might be possible to generalize this further by implementing functionality similar to splits.

The new Stacking is also not sequence-aware. More work is required to determine exactly how sequence information should interact with the stacking. For example, the crossvalidation would need to respect sequence boundaries.

No comments:

Post a Comment