All Projects → Speedml → Speedml

Speedml / Speedml

Licence: mit
Speedml is a Python package to speed start machine learning projects.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Speedml

Collapse
Advanced and Fast Data Transformation in R
Stars: ✭ 184 (-4.17%)
Mutual labels:  data-science
Virgilio
Virgilio is developed and maintained by these awesome people. You can email us virgilio.datascience (at) gmail.com or join the Discord chat.
Stars: ✭ 13,200 (+6775%)
Mutual labels:  data-science
Free Ai Resources
🚀 FREE AI Resources - 🎓 Courses, 👷 Jobs, 📝 Blogs, 🔬 AI Research, and many more - for everyone!
Stars: ✭ 192 (+0%)
Mutual labels:  data-science
Pca Magic
PCA that iteratively replaces missing data
Stars: ✭ 185 (-3.65%)
Mutual labels:  data-science
Lgo
Interactive Go programming with Jupyter
Stars: ✭ 2,225 (+1058.85%)
Mutual labels:  data-science
Gradio
Create UIs for your machine learning model in Python in 3 minutes
Stars: ✭ 4,358 (+2169.79%)
Mutual labels:  data-science
Awesome R Learning Resources
A curated collection of free resources to help deepen your understanding of the R programming language. Updated regularly. Contributions encouraged via pull request (see contributing.md).
Stars: ✭ 181 (-5.73%)
Mutual labels:  data-science
Uci Ml Api
Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)
Stars: ✭ 190 (-1.04%)
Mutual labels:  data-science
Pytorch Lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
Stars: ✭ 16,641 (+8567.19%)
Mutual labels:  data-science
Delbot
It understands your voice commands, searches news and knowledge sources, and summarizes and reads out content to you.
Stars: ✭ 191 (-0.52%)
Mutual labels:  data-science
Dataaspirant codes
Complete machine learning model codes
Stars: ✭ 185 (-3.65%)
Mutual labels:  data-science
Dtale
Visualizer for pandas data structures
Stars: ✭ 2,864 (+1391.67%)
Mutual labels:  data-science
Observations
Tools for loading standard data sets in machine learning
Stars: ✭ 190 (-1.04%)
Mutual labels:  data-science
Homlr
Supplementary material for Hands-On Machine Learning with R, an applied book covering the fundamentals of machine learning with R.
Stars: ✭ 185 (-3.65%)
Mutual labels:  data-science
Klib
Easy to use Python library of customized functions for cleaning and analyzing data.
Stars: ✭ 192 (+0%)
Mutual labels:  data-science
Pywarm
A cleaner way to build neural networks for PyTorch.
Stars: ✭ 184 (-4.17%)
Mutual labels:  data-science
Vec4ir
Word Embeddings for Information Retrieval
Stars: ✭ 188 (-2.08%)
Mutual labels:  data-science
Pydbgen
Random dataframe and database table generator
Stars: ✭ 191 (-0.52%)
Mutual labels:  data-science
Zebras
Data analysis library for JavaScript built with Ramda
Stars: ✭ 192 (+0%)
Mutual labels:  data-science
Deon
A command line tool to easily add an ethics checklist to your data science projects.
Stars: ✭ 191 (-0.52%)
Mutual labels:  data-science

====================================== Speedml Machine Learning Speed Start

Speedml is a Python package for speed starting machine learning projects.

The speedml.com_ website documents API use cases, behind-the-scenes implementation, features, best practices, and demos in detail.

To see Speedml in action run or download the notebook_ Titanic Solution Using Speedml which walks through an end-to-end machine learning solution documenting features of the Speedml API.

Latest Speedml release is always available on the PyPi_ website.

Install Speedml

On a clean system start by installing the awesome Anaconda Python distribution_.

XGBoost is one of the most popular frameworks used by winning solutions solving for Machine Learning competitions on Kaggle. Speedml makes it easy to get started with XGBoost::

conda install -c https://conda.anaconda.org/akode xgboost

Install Speedml package using pip like so::

pip install speedml

Docs and Demos

We manage the project on GitHub.

  • Demo notebooks_
  • GitHub repo_
  • Project roadmap_
  • Issues_ tracking

We are authoring Speedml API with four goals in mind.

Popular. Best packages together

Speedml already imports and properly initializes the popular ML packages including pandas, numpy, sklearn, xgboost, and matplotlib. All you need to do is import speedml to get started::

from speedml import Speedml

Rapid. Machine learning speed start

Coding is up to 3X faster when using Speedml because of (1) iterative development, (2) linear workflow, and (3) component-based API.

These three lines of Speedml code (a) load the training, test datasets, (b) define the target and unique id features, (c) plot the feature correlation matrix heatmap for numerical features, (d) perform a detailed EDA returning 10-15 observations and next steps for making the datasets model ready::

sml = Speedml('train.csv', 'test.csv', target='Survived', uid='PassengerId') sml.plot.correlate() sml.eda()

Easy. Concise commands with sensible defaults

A notebook using Speedml reduces coding required by up to 70%. Speedml API implements methods requiring zero to minimal number of parameters, working on sensible defaults.

Call to this single method replaces empty values in the entire dataframe with median value for numerical features and most common values for text features::

sml.feature.impute()

Productive. Intuitive linear workflow

Understanding machine learning fundamentals is a breeze with Speedml as we have designed the API to follow a linear workflow with sensible prerequisites and intuitive next steps.

These three lines of Speedml code perform feature engineering by replacing null values, extracting a new feature matching a regular expression, and dropping a feature that is no longer required::

sml.feature.fillna(a='Cabin', new='Z') sml.feature.extract(new='Deck', a='Cabin', regex='([A-Z]){1}') sml.feature.drop(['Cabin'])

Hope you enjoy using Speedml in your projects. Watch this space as we intend to update Speedml frequently with more cool features.

.. _PyPi: https://pypi.python.org/pypi/speedml .. _documentation: http://pythonhosted.org/speedml/ .. _speedml.com: https://speedml.com .. _repo: https://github.com/Speedml/speedml .. _roadmap: https://github.com/Speedml/speedml/projects/1 .. _notebooks: https://github.com/Speedml/notebooks .. _Issues: https://github.com/Speedml/speedml/issues .. _notebook: https://github.com/Speedml/notebooks/blob/master/titanic/titanic-solution-using-speedml.ipynb .. _distribution: https://www.anaconda.com/download/

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].