All Projects → rapidsai → spark-examples

rapidsai / spark-examples

Licence: Apache-2.0 license
[ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples

Programming Languages

Jupyter Notebook
11667 projects
shell
77523 projects
Dockerfile
14818 projects

Please note that this repo has been moved to the new repo spark-xgboost-examples.

This repo provides docs and example applications that demonstrate the RAPIDS.ai GPU-accelerated XGBoost-Spark project.

Examples

Getting Started Guides

Try one of the Getting Started guides below. Please note that they target the Mortgage dataset as written, but with a few changes to EXAMPLE_CLASS, trainDataPath, and evalDataPath, they can be easily adapted to the Taxi or Agaricus datasets.

You can get a small size datasets for each example in the datasets folder. These datasets are only provided for convenience. In order to test for performance, please prepare a larger dataset by following Preparing Datasets. We also provide a larger dataset: Morgage Dataset (1 GB uncompressed), which is used in the guides below.

These examples use default parameters for demo purposes. For a full list please see Supported XGBoost Parameters for Scala or Python

XGBoost-Spark API

Advanced Topics

Contact Us

Please see the RAPIDS website for contact information.

License

This content is licensed under the Apache License 2.0

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].