wlys

    xiaoxiao2021-12-14  20

    a code testing in csdn blog

    content

    content

    import org.apache.spark.ml.feature.VectorIndexer val data = spark.read.format("libsvm").load("data/mllib/sample_libsvm_data.txt") val indexer = new VectorIndexer() .setInputCol("features") .setOutputCol("indexed") .setMaxCategories(10) val indexerModel = indexer.fit(data) val categoricalFeatures: Set[Int] = indexerModel.categoryMaps.keys.toSet println(s"Chose ${categoricalFeatures.size} categorical features: " + categoricalFeatures.mkString(", ")) // Create new column "indexed" with categorical values transformed to indices val indexedData = indexerModel.transform(data) indexedData.show()

    转载请注明原文地址: https://ju.6miu.com/read-964775.html

    最新回复(0)