org.apache.spark.mllib.feature
Train a entropy minimization discretizer given an RDD of LabeledPoints.
Train a entropy minimization discretizer given an RDD of LabeledPoints.
RDD of LabeledPoint's.
Indexes of features to be discretized. If it is not provided, the algorithm selects those features with more than 256 (byte range) distinct values.
Maximum number of thresholds to select per feature.
Maximum number of elements by partition.
A DiscretizerModel with the subsequent thresholds.