/Spark-ml-algo-lib

Primary LanguageScalaApache License 2.0Apache-2.0

Spark-ml-algo-lib

Introduction

MACHINE LEARNING Algorithm library, running on KUNPENG chipset, is an accelerated library that provides a rich set of higher-level tools for machine learning algorithms. It is based on the original APIs from Apache Spark version 2.3.2. The accelerated library for performance optimization greatly improves the computational performance of big data algorithm scenarios.

The current version provides three common learning algorithms: Support Vector Machine (SVM) Algorithm, Random Forest Classifier (RFC) algorithm, and Gradient Boosting Decision Tree (GBDT) algorithm. You can find the latest documentation, including a programming guide, on the project web page. This README file only contains basic setup instructions.

Building

(1) Create a lib file under Spark-ml-algo-lib/ml-accelerator :

mkdir Spark-ml-algo-lib/ml-accelerator/lib

(2) Download the “ml-kernel.jar”, put into the lib file you just created

(3) Compile your project under “Spark-ml-algo-lib/”

cd Spark-ml-algo-lib/

sbt package

Get ml-core.jar under “ml-core/target/scala-2.11”

Get ml-acc.jar under “ml-accelerator/target/scala-2.11”

Contact

Spark is distributed through GitHub. For the latest version, a bug tracker, and other information, see

http://spark.apache.org/

or the repository at

https://github.com/apache/spark/tree/v2.3.2