All Projects → W2v → Similar Projects or Alternatives

7314 Open source projects that are alternatives of or similar to W2v

Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1990.63%)
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1440.63%)
Mydatascienceportfolio
Applying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (+254.69%)
Mutual labels:  jupyter-notebook, data-science, spark
kafka-compose
🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (-50%)
Mutual labels:  twitter, spark, pyspark
Sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+1390.63%)
Mutual labels:  jupyter-notebook, spark, pyspark
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (+68.75%)
Mutual labels:  data-science, spark, pyspark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (+146.88%)
Mutual labels:  jupyter-notebook, spark, pyspark
Pysparkgeoanalysis
🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-1.56%)
Mutual labels:  jupyter-notebook, spark, pyspark
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+889.06%)
Mutual labels:  data-science, spark, pyspark
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (+157.81%)
Mutual labels:  jupyter-notebook, spark, pyspark
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (+75%)
Mutual labels:  jupyter-notebook, data-science, spark
Pyspark Learning
Updated repository
Stars: ✭ 147 (+129.69%)
Mutual labels:  jupyter-notebook, spark, pyspark
Scalable Data Science Platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Stars: ✭ 158 (+146.88%)
Mutual labels:  jupyter-notebook, data-science, spark
Pixiedust
Python Helper library for Jupyter Notebooks
Stars: ✭ 998 (+1459.38%)
Mutual labels:  jupyter-notebook, data-science, spark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+212.5%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (-64.06%)
Mutual labels:  jupyter-notebook, spark, pyspark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+545.31%)
Mutual labels:  jupyter-notebook, data-science, spark
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+134.38%)
Mutual labels:  jupyter-notebook, spark, pyspark
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+8737.5%)
Mutual labels:  jupyter-notebook, data-science, spark
Data Science Cookbook
🎓 Jupyter notebooks from UFC data science course
Stars: ✭ 60 (-6.25%)
Mutual labels:  jupyter-notebook, data-science, spark
Mds
Modern Data Science
Stars: ✭ 19 (-70.31%)
Mutual labels:  jupyter-notebook, data-science
Spark Scala Tutorial
A free tutorial for Apache Spark.
Stars: ✭ 907 (+1317.19%)
Mutual labels:  jupyter-notebook, spark
4th Place Home Credit Default Risk
Codes and dashboards for 4th place solution for Kaggle's Home Credit Default Risk competition
Stars: ✭ 23 (-64.06%)
Mutual labels:  jupyter-notebook, data-science
Python Introducing Pandas
Introduction to pandas Treehouse course
Stars: ✭ 24 (-62.5%)
Mutual labels:  jupyter-notebook, data-science
Har Keras Coreml
Human Activity Recognition (HAR) with Keras and CoreML
Stars: ✭ 23 (-64.06%)
Mutual labels:  jupyter-notebook, data-science
Twitter sentiment analysis word2vec convnet
Twitter Sentiment Analysis with Gensim Word2Vec and Keras Convolutional Network
Stars: ✭ 24 (-62.5%)
Mutual labels:  jupyter-notebook, twitter
Resources
PyMC3 educational resources
Stars: ✭ 930 (+1353.13%)
Mutual labels:  jupyter-notebook, data-science
Spring2017 proffosterprovost
Introduction to Data Science
Stars: ✭ 18 (-71.87%)
Mutual labels:  jupyter-notebook, data-science
Yandex Big Data Engineering
Stars: ✭ 17 (-73.44%)
Mutual labels:  jupyter-notebook, spark
Lambdaschooldatascience
Completed assignments and coding challenges from the Lambda School Data Science program.
Stars: ✭ 22 (-65.62%)
Mutual labels:  jupyter-notebook, data-science
Skdata
Python tools for data analysis
Stars: ✭ 16 (-75%)
Mutual labels:  jupyter-notebook, data-science
Pyspark Setup Demo
Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-62.5%)
Mutual labels:  jupyter-notebook, pyspark
Kubeflow Data Science On Steroids
The blog post about Kubeflow, including all materials
Stars: ✭ 25 (-60.94%)
Mutual labels:  jupyter-notebook, data-science
Python Machine Learning Book 2nd Edition
The "Python Machine Learning (2nd edition)" book code repository and info resource
Stars: ✭ 6,422 (+9934.38%)
Mutual labels:  jupyter-notebook, data-science
Sparkling Titanic
Training models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-81.25%)
Mutual labels:  spark, pyspark
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+1250%)
Mutual labels:  jupyter-notebook, data-science
Awesome Google Colab
Google Colaboratory Notebooks and Repositories (by @firmai)
Stars: ✭ 863 (+1248.44%)
Mutual labels:  jupyter-notebook, data-science
Live log analyzer spark
Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-78.12%)
Mutual labels:  spark, pyspark
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+12914.06%)
Mutual labels:  jupyter-notebook, data-science
Tedsds
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-78.12%)
Mutual labels:  jupyter-notebook, spark
Mlnet Workshop
ML.NET Workshop to predict car sales prices
Stars: ✭ 29 (-54.69%)
Mutual labels:  jupyter-notebook, data-science
Intro Python
Python pour Statistique et Science des Données -- Syntaxe, Trafic de Données, Graphes, Programmation, Apprentissage
Stars: ✭ 21 (-67.19%)
Mutual labels:  jupyter-notebook, data-science
Python for ml
brief introduction to Python for machine learning
Stars: ✭ 29 (-54.69%)
Mutual labels:  jupyter-notebook, data-science
Coursera
Quiz & Assignment of Coursera
Stars: ✭ 774 (+1109.38%)
Mutual labels:  jupyter-notebook, data-science
Tiledb Vcf
Efficient variant-call data storage and retrieval library using the TileDB storage library.
Stars: ✭ 26 (-59.37%)
Mutual labels:  data-science, spark
Crime Analysis
Association Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-68.75%)
Mutual labels:  jupyter-notebook, data-science
Docker Iocaml Datascience
Dockerfile of Jupyter (IPython notebook) and IOCaml (OCaml kernel) with libraries for data science and machine learning
Stars: ✭ 30 (-53.12%)
Mutual labels:  jupyter-notebook, data-science
Python Training
Python training for business analysts and traders
Stars: ✭ 972 (+1418.75%)
Mutual labels:  jupyter-notebook, data-science
Ds Take Home
My solution to the book A Collection of Data Science Take-Home Challenges
Stars: ✭ 1,004 (+1468.75%)
Mutual labels:  jupyter-notebook, data-science
Computervision Recipes
Best Practices, code samples, and documentation for Computer Vision.
Stars: ✭ 8,214 (+12734.38%)
Mutual labels:  jupyter-notebook, data-science
Minerva Training Materials
Learn advanced data science on real-life, curated problems
Stars: ✭ 37 (-42.19%)
Mutual labels:  jupyter-notebook, data-science
Machine Learning From Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
Stars: ✭ 42 (-34.37%)
Mutual labels:  jupyter-notebook, data-science
Data Science Lunch And Learn
Resources for weekly Data Science Lunch & Learns
Stars: ✭ 49 (-23.44%)
Mutual labels:  jupyter-notebook, data-science
Numerical Linear Algebra
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
Stars: ✭ 8,263 (+12810.94%)
Mutual labels:  jupyter-notebook, data-science
Presentations
Talks & Workshops by the CODAIT team
Stars: ✭ 50 (-21.87%)
Mutual labels:  jupyter-notebook, data-science
Ppd599
USC urban data science course series with Python and Jupyter
Stars: ✭ 1,062 (+1559.38%)
Mutual labels:  jupyter-notebook, data-science
25daysinmachinelearning
I will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (-17.19%)
Mutual labels:  jupyter-notebook, data-science
Mlj.jl
A Julia machine learning framework
Stars: ✭ 982 (+1434.38%)
Mutual labels:  jupyter-notebook, data-science
Mckinsey Smartcities Traffic Prediction
Adventure into using multi attention recurrent neural networks for time-series (city traffic) for the 2017-11-18 McKinsey IronMan (24h non-stop) prediction challenge
Stars: ✭ 49 (-23.44%)
Mutual labels:  jupyter-notebook, data-science
Metrotwitter
What Twitter reveals about the differences between cities and the monoculture of the Bay Area
Stars: ✭ 52 (-18.75%)
Mutual labels:  jupyter-notebook, twitter
1-60 of 7314 similar projects