Weka (KnowledgeFlow) - Part III: Descriptor scaling

Scale the training set

  1. Put ArffLoader component (DataSources) to layout area and configure it to load a training set from a file.
  2. Put ClassAssigner component (Filters) to layout area and connect the dataSet connection from the ArffLoader component to it.
    • Configure it by setting the classIndex to the class column.
  3. Put Normalize component (Filters) layout area and connect the dataSet connection from the ClassAssigner component to it.
  4. Run.

It is easy to use KnowledgeFlow to scale descriptors in the training set. However, it seems that there is no option to save the parameters used to scale the descriptors in the training set and then apply them on a testing set. This would make it difficult to assess the performance of a model on an independent validation set.

Share This

Leave a Reply


Close
E-mail It