All Projects β†’ picnicml β†’ Doddle Model

picnicml / Doddle Model

Licence: apache-2.0
🍰 doddle-model: machine learning in Scala.

Programming Languages

scala
5932 projects

Projects that are alternatives of or similar to Doddle Model

2016 Ml Contest
Machine learning contest - October 2016 TLE
Stars: ✭ 135 (-4.93%)
Mutual labels:  data-science
Datasciencecoursera
Data Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.
Stars: ✭ 1,928 (+1257.75%)
Mutual labels:  data-science
Nlpaug
Data augmentation for NLP
Stars: ✭ 2,761 (+1844.37%)
Mutual labels:  data-science
Accelerator
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-3.52%)
Mutual labels:  data-science
Traffic
A toolbox for processing and analysing air traffic data
Stars: ✭ 138 (-2.82%)
Mutual labels:  data-science
Toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
Stars: ✭ 139 (-2.11%)
Mutual labels:  data-science
Qlik Py Tools
Data Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
Stars: ✭ 135 (-4.93%)
Mutual labels:  data-science
Coffee Quality Database
Building the Coffee Quality Institute Database
Stars: ✭ 141 (-0.7%)
Mutual labels:  data-science
Python For Data Science
A blog for data analytics using data science technologies
Stars: ✭ 139 (-2.11%)
Mutual labels:  data-science
Python Cheat Sheet
Python Cheat Sheet NumPy, Matplotlib
Stars: ✭ 1,739 (+1124.65%)
Mutual labels:  data-science
Youtube Like Predictor
YouTube Like Count Predictions using Machine Learning
Stars: ✭ 137 (-3.52%)
Mutual labels:  data-science
Scilab
Free and Open Source software for numerical computation providing a powerful computing environment for engineering and scientific applications.
Stars: ✭ 138 (-2.82%)
Mutual labels:  data-science
Interactive machine learning
IPython widgets, interactive plots, interactive machine learning
Stars: ✭ 140 (-1.41%)
Mutual labels:  data-science
Data Science Wg
SF Brigade's Data Science Working Group.
Stars: ✭ 135 (-4.93%)
Mutual labels:  data-science
Matrixprofile
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Stars: ✭ 141 (-0.7%)
Mutual labels:  data-science
Pandasschema
A validation library for Pandas data frames using user-friendly schemas
Stars: ✭ 135 (-4.93%)
Mutual labels:  data-science
Complete Life Cycle Of A Data Science Project
Complete-Life-Cycle-of-a-Data-Science-Project
Stars: ✭ 140 (-1.41%)
Mutual labels:  data-science
Raspberryturk
The Raspberry Turk is a robot that can play chessβ€”it's entirely open source, based on Raspberry Pi, and inspired by the 18th century chess playing machine, the Mechanical Turk.
Stars: ✭ 140 (-1.41%)
Mutual labels:  data-science
Book
This book serves as an introduction to a whole new way of thinking systematically about geographic data, using geographical analysis and computation to unlock new insights hidden within data.
Stars: ✭ 141 (-0.7%)
Mutual labels:  data-science
Ripser.py
A Lean Persistent Homology Library for Python
Stars: ✭ 139 (-2.11%)
Mutual labels:  data-science
doddle-model
Latest Release Build Status Coverage Code Quality License Chat
latest release build status coverage code quality license chat

doddle-model is an in-memory machine learning library that can be summed up with three main characteristics:

How does it compare to existing solutions?

doddle-model takes the position of scikit-learn in Scala and as a consequence, it's much more lightweight than e.g. Spark ML. Fitted models can be deployed anywhere, from simple applications to concurrent, distributed systems built with Akka, Apache Beam or a framework of your choice. Training of estimators happens in-memory, which is advantageous unless you are dealing with enormous datasets that absolutely cannot fit into RAM.

Installation

The project is published for Scala versions 2.11, 2.12 and 2.13. Add the dependency to your SBT project definition:

libraryDependencies  ++= Seq(
  "io.github.picnicml" %% "doddle-model" % "<latest_version>",
  // add optionally to utilize native libraries for a significant performance boost
  "org.scalanlp" %% "breeze-natives" % "1.0"
)

Note that the latest version is displayed in the Latest Release badge above and that the v prefix should be removed from the SBT definition.

Getting Started

For a complete list of code examples see doddle-model-examples.

Contributing

Want to help us? πŸ™Œ We have a document that will make deciding how to do that much easier.

Performance

Performance of implementations is described here. Also, take a peek at what's written in that document if you encounter java.lang.OutOfMemoryError: Java heap space.

Core Maintainers

This is a collaborative project which wouldn't be possible without all the awesome contributors. The core team currently consists of the following developers:

Resources

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].