All Projects → kaggledatasets → kaggledatasets

kaggledatasets / kaggledatasets

Licence: Apache-2.0 license
Collection of Kaggle Datasets ready to use for Everyone (Looking for contributors)

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to kaggledatasets

Pins
Pin, Discover and Share Resources
Stars: ✭ 149 (+238.64%)
Mutual labels:  kaggle, datasets
PracticalMachineLearning
A collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+36.36%)
Mutual labels:  scikit-learn, kaggle
Healthcheck
Health Check ✔ is a Machine Learning Web Application made using Flask that can predict mainly three diseases i.e. Diabetes, Heart Disease, and Cancer.
Stars: ✭ 35 (-20.45%)
Mutual labels:  kaggle, datasets
kaggle-code
A repository for some of the code I used in kaggle data science & machine learning tasks.
Stars: ✭ 100 (+127.27%)
Mutual labels:  kaggle, datasets
Hungabunga
HungaBunga: Brute-Force all sklearn models with all parameters using .fit .predict!
Stars: ✭ 614 (+1295.45%)
Mutual labels:  scikit-learn, kaggle
How-to-score-0.8134-in-Titanic-Kaggle-Challenge
Solution of the Titanic Kaggle competition
Stars: ✭ 114 (+159.09%)
Mutual labels:  scikit-learn, kaggle
data-science-learning
📊 All of courses, assignments, exercises, mini-projects and books that I've done so far in the process of learning by myself Machine Learning and Data Science.
Stars: ✭ 32 (-27.27%)
Mutual labels:  scikit-learn, kaggle
Fraud Detection
Credit Card Fraud Detection using ML: IEEE style paper + Jupyter Notebook
Stars: ✭ 58 (+31.82%)
Mutual labels:  scikit-learn, kaggle
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+50009.09%)
Mutual labels:  scikit-learn, kaggle
Machinejs
[UNMAINTAINED] Automated machine learning- just give it a data file! Check out the production-ready version of this project at ClimbsRocks/auto_ml
Stars: ✭ 412 (+836.36%)
Mutual labels:  scikit-learn, kaggle
Machinelearningcourse
A collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (-20.45%)
Mutual labels:  scikit-learn, kaggle
Artificial Intelligence Deep Learning Machine Learning Tutorials
A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.
Stars: ✭ 2,966 (+6640.91%)
Mutual labels:  scikit-learn, kaggle
torchgeo
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Stars: ✭ 1,125 (+2456.82%)
Mutual labels:  datasets
open2ch-dialogue-corpus
おーぷん2ちゃんねるをクロールして作成した対話コーパス
Stars: ✭ 65 (+47.73%)
Mutual labels:  datasets
DiscEval
Discourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-59.09%)
Mutual labels:  datasets
Data-Science-Articles
A collection of my blogs on Data Science and Machine learning.
Stars: ✭ 66 (+50%)
Mutual labels:  kaggle
mlhandbook
My textbook for teaching Machine Learning
Stars: ✭ 23 (-47.73%)
Mutual labels:  scikit-learn
rs datasets
Tool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-50%)
Mutual labels:  datasets
Thirukkural-Tamil-Dataset
திருக்குறள் by திருவள்ளுவர்.
Stars: ✭ 44 (+0%)
Mutual labels:  datasets
Google-Playstore-Dataset
Google PlayStore App dataset. (2.3 million App Data) and 24 attributes
Stars: ✭ 27 (-38.64%)
Mutual labels:  datasets

kaggledatasets

Collection of Kaggle Datasets ready to use for Everyone

License Release Platform Support

System Python 3.5 Python 3.6 Python 3.7
Linux Build Status Build Status Build Status
macOS Build Status Build Status
Windows Build Status Build Status Build Status

More About Kaggle Datasets

import kaggledatasets as kd

heart_disease = kd.structured.HeartDiseaseUCI(download=True)

# Returns the pandas data frame to be used in Scikit Learn or any other framework
df = heart_disease.data_frame()

# Returns the tensorflow dataset type compatible with TF 2.0
dataset = heart_disease.load()
for batch, label in dataset.take(1):
    for key, value in batch.items():
        ...

# Returns the data loader for PyTorch
# Work in progress for PyTorch support

Installation

Binaries

Commands to install from binaries via Conda or pip wheels are on our website: https://kaggledatasets.github.io

From Source

Get the kaggledatasets Source

git clone --recursive https://github.com/kaggledatasets/kaggledatasets
cd kaggledatasets

Install Dependencies

pip install -r requirements.txt

Install kaggledatasets

python setup.py install

Getting Started

Communication

  • GitHub Issues: bug reports, feature requests, dataset requests, install issues, help wanted, thoughts, etc.
  • Slack: The Kaggle Datasets Slack hosts a primary audience of moderate to experienced Kaggle Datasets users and developers for general chat, online discussions, collaboration etc. If you need a slack invite, please visit: http://bit.ly/kdslack

Releases and Contributing

kaggledatasets is expecting to have a 30 day release cycle (major releases). Please let us know if you encounter a bug by filing an issue.

We appreciate all contributions and make sure you go through our Contributing Guide. If you are planning to contribute back bug-fixes, please do so without any further discussion.

If you plan to contribute new features, new datasets, utility functions or extensions to the core, please first open an issue and discuss the feature with us. Sending a PR without discussion might end up resulting in a rejected PR, because we might be taking kaggledatasets in a different direction than you might be aware of.

License

kaggledatasets is Apache-2.0 licensed, as found in the LICENSE file.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].