Datasist A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-23.6%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (+1039.75%)
Data Analysis主要是爬虫与数据分析项目总结,外加建模与机器学习,模型的评估。
Stars: ✭ 142 (-11.8%)
CleanlabThe standard package for machine learning with noisy labels, finding mislabeled data, and uncertainty quantification. Works with most datasets and models.
Stars: ✭ 2,526 (+1468.94%)
ImageclusterCluster images based on image content using a pre-trained deep neural network, optional time distance scaling and hierarchical clustering.
Stars: ✭ 122 (-24.22%)
RaspberryturkThe Raspberry Turk is a robot that can play chess—it's entirely open source, based on Raspberry Pi, and inspired by the 18th century chess playing machine, the Mechanical Turk.
Stars: ✭ 140 (-13.04%)
Spark AlchemyCollection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-24.22%)
Labeled Tweet GeneratorSearch for tweets and download the data labeled with its polarity in CSV format
Stars: ✭ 111 (-31.06%)
GmvaeImplementation of Gaussian Mixture Variational Autoencoder (GMVAE) for Unsupervised Clustering
Stars: ✭ 111 (-31.06%)
Ml Hub🧰 Multi-user development platform for machine learning teams. Simple to setup within minutes.
Stars: ✭ 148 (-8.07%)
Autoregressive Predictive CodingAutoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
Stars: ✭ 138 (-14.29%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-32.3%)
PyftsAn open source library for Fuzzy Time Series in Python
Stars: ✭ 154 (-4.35%)
Ml CheatsheetA constantly updated python machine learning cheatsheet
Stars: ✭ 136 (-15.53%)
Learn PythonPython Top 45 Articles of 2017
Stars: ✭ 148 (-8.07%)
Awesome Embedding ModelsA curated list of awesome embedding models tutorials, projects and communities.
Stars: ✭ 1,486 (+822.98%)
SfmlearnerAn unsupervised learning framework for depth and ego-motion estimation from monocular videos
Stars: ✭ 1,661 (+931.68%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-32.92%)
2016 Ml ContestMachine learning contest - October 2016 TLE
Stars: ✭ 135 (-16.15%)
Scikit Learnscikit-learn: machine learning in Python
Stars: ✭ 48,322 (+29913.66%)
Fasttext4j Implementing Facebook's FastText with java
Stars: ✭ 148 (-8.07%)
AllennlpAn open-source NLP research library, built on PyTorch.
Stars: ✭ 10,699 (+6545.34%)
KateCode & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
Stars: ✭ 135 (-16.15%)
Py QuantmodPowerful financial charting library based on R's Quantmod | http://py-quantmod.readthedocs.io/en/latest/
Stars: ✭ 155 (-3.73%)
WooeyA Django app that creates automatic web UIs for Python scripts.
Stars: ✭ 1,680 (+943.48%)
TflearnDeep learning library featuring a higher-level API for TensorFlow.
Stars: ✭ 9,573 (+5845.96%)
OneshottranslationPytorch implementation of "One-Shot Unsupervised Cross Domain Translation" NIPS 2018
Stars: ✭ 135 (-16.15%)
EvalmlEvalML is an AutoML library written in python.
Stars: ✭ 145 (-9.94%)
ArflowThe official PyTorch implementation of the paper "Learning by Analogy: Reliable Supervision from Transformations for Unsupervised Optical Flow Estimation".
Stars: ✭ 134 (-16.77%)
TextclfTextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。
Stars: ✭ 105 (-34.78%)
HermioneML made simple
Stars: ✭ 135 (-16.15%)
Back2future.pytorchUnsupervised Learning of Multi-Frame Optical Flow with Occlusions
Stars: ✭ 104 (-35.4%)
ClusteringClustering / Subspace Clustering Algorithms on MATLAB
Stars: ✭ 134 (-16.77%)
Python Data Science HandbookA Chinese translation of Jake Vanderplas' "Python Data Science Handbook". 《Python数据科学手册》在线Jupyter notebook中文翻译
Stars: ✭ 102 (-36.65%)
Data Science Stack Cookiecutter🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Stars: ✭ 153 (-4.97%)
Ml LibAn extensive machine learning library, made from scratch (Python).
Stars: ✭ 102 (-36.65%)
Datasciencera curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (+972.67%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-8.7%)
ModelchimpExperiment tracking for machine and deep learning projects
Stars: ✭ 121 (-24.84%)
Book This book serves as an introduction to a whole new way of thinking systematically about geographic data, using geographical analysis and computation to unlock new insights hidden within data.
Stars: ✭ 141 (-12.42%)
Pandas VideosJupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+965.84%)
VarietyA schema analyzer for MongoDB
Stars: ✭ 1,592 (+888.82%)
Nlp researchNLP research:基于tensorflow的nlp深度学习项目,支持文本分类/句子匹配/序列标注/文本生成 四大任务
Stars: ✭ 141 (-12.42%)
Rectorchrectorch is a pytorch-based framework for state-of-the-art top-N recommendation
Stars: ✭ 121 (-24.84%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+969.57%)
Auto ml[UNMAINTAINED] Automated machine learning for analytics & production
Stars: ✭ 1,559 (+868.32%)
LatentspacevisualizationVisualization techniques for the latent space of a convolutional autoencoder in Keras
Stars: ✭ 155 (-3.73%)