Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1087.95%)
Ntds 2017Material for the EPFL master course "A Network Tour of Data Science", edition 2017.
Stars: ✭ 61 (-26.51%)
ParalleldistR Package: Parallel Distance Matrix Computation using Multiple Threads
Stars: ✭ 37 (-55.42%)
Mit Deep LearningTutorials, assignments, and competitions for MIT Deep Learning related courses.
Stars: ✭ 8,912 (+10637.35%)
BubblyA python package for plotting animated and interactive bubble charts using Plotly
Stars: ✭ 37 (-55.42%)
Janitorsimple tools for data cleaning in R
Stars: ✭ 981 (+1081.93%)
Tsv UtilseBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Stars: ✭ 1,215 (+1363.86%)
Interactive.NET Interactive takes the power of .NET and embeds it into your interactive experiences. Share code, explore data, write, and learn across your apps in ways you couldn't before.
Stars: ✭ 978 (+1078.31%)
MachinelearningcourseA collection of notebooks of my Machine Learning class written in python 3
Stars: ✭ 35 (-57.83%)
BlenderdatavisData visualisation addon for Blender
Stars: ✭ 72 (-13.25%)
Dvc🦉Data Version Control | Git for Data & Models | ML Experiments Management
Stars: ✭ 9,004 (+10748.19%)
EventkitA template conference app, featuring real-time schedule and data changes & running on Realm 🚀
Stars: ✭ 59 (-28.92%)
FreemlA List of Data Science/Machine Learning Resources (Mostly Free)
Stars: ✭ 974 (+1073.49%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+1380.72%)
Javascript React Chat AppOpen-source Voice & Video Calling and Text Chat App for React (JavaScript/Web)
Stars: ✭ 59 (-28.92%)
Feagen(deprecated) A fast and memory-efficient Python data engineering framework for machine learning.
Stars: ✭ 33 (-60.24%)
Dream3dData Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
Stars: ✭ 73 (-12.05%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+10178.31%)
Simple Sh DatascienceA collection of Bash scripts and Dockerfiles to install data science Tool, Lib and application
Stars: ✭ 32 (-61.45%)
Learning pythonSource material for Python Like You Mean it
Stars: ✭ 78 (-6.02%)
Page clusteringA simple algorithm for clustering web pages, suitable for crawlers
Stars: ✭ 30 (-63.86%)
Arcgis Python ApiDocumentation and samples for ArcGIS API for Python
Stars: ✭ 954 (+1049.4%)
Gorilla NotebookA clojure/clojurescript notebook application/-library based on Gorilla-REPL
Stars: ✭ 73 (-12.05%)
Python for mlbrief introduction to Python for machine learning
Stars: ✭ 29 (-65.06%)
Drake ExamplesExample workflows for the drake R package
Stars: ✭ 57 (-31.33%)
Mlnet WorkshopML.NET Workshop to predict car sales prices
Stars: ✭ 29 (-65.06%)
DatabenchData analysis tool.
Stars: ✭ 82 (-1.2%)
Ds and ml projectsData Science & Machine Learning projects and tutorials in python from beginner to advanced level.
Stars: ✭ 56 (-32.53%)
Intro PythonPython pour Statistique et Science des Données -- Syntaxe, Trafic de Données, Graphes, Programmation, Apprentissage
Stars: ✭ 21 (-74.7%)
AsneA sparsity aware and memory efficient implementation of "Attributed Social Network Embedding" (TKDE 2018).
Stars: ✭ 73 (-12.05%)
Crime AnalysisAssociation Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-75.9%)
PydatasetInstant access to many datasets in Python.
Stars: ✭ 880 (+960.24%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-33.73%)
BayesliteBayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data itself.
Stars: ✭ 877 (+956.63%)
Allstate capstoneAllstate Kaggle Competition ML Capstone Project
Stars: ✭ 72 (-13.25%)
Neural Image CaptioningImplementation of Neural Image Captioning model using Keras with Theano backend
Stars: ✭ 12 (-85.54%)
Awesome Google ColabGoogle Colaboratory Notebooks and Repositories (by @firmai)
Stars: ✭ 863 (+939.76%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+928.92%)
FasttextUnofficial implementation of the paper "Bag of Tricks for Efficient Text Classification" by Joulin et al.
Stars: ✭ 53 (-36.14%)
React.london🌟 react.london conference & community website 🌟
Stars: ✭ 9 (-89.16%)
UgfraudAn Unsupervised Graph-based Toolbox for Fraud Detection
Stars: ✭ 38 (-54.22%)
25daysinmachinelearningI will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (-36.14%)
DltkDeep Learning Toolkit for Medical Image Analysis
Stars: ✭ 1,249 (+1404.82%)
ConfcitizensOpen-source and crowd-sourced conference speakers website
Stars: ✭ 83 (+0%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+1391.57%)
SaynData processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-4.82%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (-9.64%)