All Projects → Handyspark → Similar Projects or Alternatives

6686 Open source projects that are alternatives of or similar to Handyspark

Geopython
Notebooks and libraries for spatial/geo Python explorations
Stars: ✭ 268 (+69.62%)
Mutual labels:  jupyter-notebook, pandas
Helk
The Hunting ELK
Stars: ✭ 3,097 (+1860.13%)
Mutual labels:  jupyter-notebook, spark
Spark Syntax
This is a repo documenting the best practices in PySpark.
Stars: ✭ 412 (+160.76%)
Mutual labels:  jupyter-notebook, pyspark
Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+156.96%)
Mutual labels:  spark, pyspark
Pytablewriter
pytablewriter is a Python library to write a table in various formats: CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pandas / Python / reStructuredText / SQLite / TOML / TSV.
Stars: ✭ 422 (+167.09%)
Mutual labels:  jupyter-notebook, pandas
basin
Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-84.18%)
Mutual labels:  spark, pyspark
Justenoughscalaforspark
A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+240.51%)
Mutual labels:  jupyter-notebook, spark
Data Science Your Way
Ways of doing Data Science Engineering and Machine Learning in R and Python
Stars: ✭ 530 (+235.44%)
Bamboolib
bamboolib - a GUI for pandas DataFrames
Stars: ✭ 622 (+293.67%)
Mutual labels:  jupyter-notebook, pandas
Or Pandas
【运筹OR帷幄|数据科学】pandas教程系列电子书
Stars: ✭ 492 (+211.39%)
Mutual labels:  jupyter-notebook, pandas
Jdata
京东JData算法大赛-高潜用户购买意向预测入门程序(starter code)
Stars: ✭ 662 (+318.99%)
Mutual labels:  jupyter-notebook, pandas
Just Pandas Things
An ongoing list of pandas quirks
Stars: ✭ 660 (+317.72%)
Mutual labels:  jupyter-notebook, pandas
Elasticsearch Spark Recommender
Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch
Stars: ✭ 707 (+347.47%)
Mutual labels:  jupyter-notebook, spark
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-65.82%)
Mutual labels:  spark, pandas
Quickviz
Visualize a pandas dataframe in a few clicks
Stars: ✭ 18 (-88.61%)
Mutual labels:  jupyter-notebook, pandas
Yandex Big Data Engineering
Stars: ✭ 17 (-89.24%)
Mutual labels:  jupyter-notebook, spark
Lux
Python API for Intelligent Visual Data Discovery
Stars: ✭ 787 (+398.1%)
Pbpython
Code, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+991.14%)
Mutual labels:  jupyter-notebook, pandas
Sparkling Titanic
Training models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-92.41%)
Mutual labels:  spark, pyspark
Crime Analysis
Association Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-87.34%)
Mutual labels:  jupyter-notebook, pandas
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+371.52%)
Mutual labels:  jupyter-notebook, spark
Pandas basics
basic pandas tutorials
Stars: ✭ 34 (-78.48%)
Mutual labels:  jupyter-notebook, pandas
Machine Learning Alpine
Alpine Container for Machine Learning
Stars: ✭ 30 (-81.01%)
Mutual labels:  jupyter-notebook, pandas
Gdeltpyr
Python based framework to retreive Global Database of Events, Language, and Tone (GDELT) version 1.0 and version 2.0 data.
Stars: ✭ 124 (-21.52%)
Mutual labels:  jupyter-notebook, pandas
Jupyter Datatables
Jupyter Notebook extension leveraging pandas DataFrames by integrating DataTables and ChartJS.
Stars: ✭ 127 (-19.62%)
Mutual labels:  jupyter-notebook, pandas
Ds and ml projects
Data Science & Machine Learning projects and tutorials in python from beginner to advanced level.
Stars: ✭ 56 (-64.56%)
Mutual labels:  jupyter-notebook, pandas
Data Science Complete Tutorial
For extensive instructor led learning
Stars: ✭ 1,027 (+550%)
Mutual labels:  jupyter-notebook, pandas
data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Stars: ✭ 34 (-78.48%)
Mutual labels:  spark, pyspark
Machine Learning Projects
This repository consists of all my Machine Learning Projects.
Stars: ✭ 135 (-14.56%)
Mutual labels:  jupyter-notebook, pandas
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-55.06%)
Mutual labels:  jupyter-notebook, spark
Disease Prediction From Symptoms
Disease Prediction based on Symptoms.
Stars: ✭ 70 (-55.7%)
Mutual labels:  jupyter-notebook, pandas
Hops Examples
Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-46.84%)
Mutual labels:  jupyter-notebook, spark
Pandas Tutorial
Tutorial on Using Pandas
Stars: ✭ 66 (-58.23%)
Mutual labels:  jupyter-notebook, pandas
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (-43.67%)
Mutual labels:  jupyter-notebook, spark
Spark Nlp Models
Models and Pipelines for the Spark NLP library
Stars: ✭ 88 (-44.3%)
Mutual labels:  jupyter-notebook, spark
Alphalens
Performance analysis of predictive (alpha) stock factors
Stars: ✭ 2,130 (+1248.1%)
Mutual labels:  jupyter-notebook, pandas
Pydata Pandas Workshop
Material for my PyData Jupyter & Pandas Workshops, I'm also available for personal in-house trainings on request
Stars: ✭ 65 (-58.86%)
Mutual labels:  jupyter-notebook, pandas
Relation extraction
Relation Extraction using Deep learning(CNN)
Stars: ✭ 96 (-39.24%)
Mutual labels:  spark, pyspark
Data Science For Marketing Analytics
Achieve your marketing goals with the data analytics power of Python
Stars: ✭ 127 (-19.62%)
Mutual labels:  jupyter-notebook, pandas
Practical Machine Learning With Python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+1082.28%)
Mutual labels:  jupyter-notebook, pandas
Data Analysis
主要是爬虫与数据分析项目总结,外加建模与机器学习,模型的评估。
Stars: ✭ 142 (-10.13%)
Mutual labels:  jupyter-notebook, pandas
Maps Location History
Get, Concatenate and Process you location history from Google Maps TimeLine
Stars: ✭ 99 (-37.34%)
Mutual labels:  jupyter-notebook, pandas
Data Mining Python
《python数据分析与挖掘实战》项目实践及拓展
Stars: ✭ 92 (-41.77%)
Mutual labels:  jupyter-notebook, pandas
Pyspark Cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-31.65%)
Mutual labels:  spark, pyspark
Hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-31.65%)
Mutual labels:  spark, pyspark
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (-29.11%)
Mutual labels:  jupyter-notebook, spark
Python
Jupyter notebooks and datasets for the interesting pandas/python/data science video series.
Stars: ✭ 65 (-58.86%)
Mutual labels:  jupyter-notebook, pandas
Pandas Videos
Jupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+986.08%)
Mutual labels:  jupyter-notebook, pandas
Eat pyspark in 10 days
pyspark🍒🥭 is delicious,just eat it!😋😋
Stars: ✭ 116 (-26.58%)
Mutual labels:  spark, pyspark
Cape Python
Collaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-20.89%)
Mutual labels:  spark, pandas
Ibis
A pandas-like deferred expression system, with first-class SQL support
Stars: ✭ 1,630 (+931.65%)
Mutual labels:  pandas, spark
Repo 2019
BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
Stars: ✭ 133 (-15.82%)
Mutual labels:  jupyter-notebook, pyspark
Jupyter notebooks
Collection of jupyter notebooks
Stars: ✭ 127 (-19.62%)
Mutual labels:  jupyter-notebook, pandas
Data science blogs
A repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-12.03%)
Mutual labels:  jupyter-notebook, spark
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+859.49%)
Mutual labels:  jupyter-notebook, pandas
Opendatawrangling
공공데이터 분석
Stars: ✭ 148 (-6.33%)
Mutual labels:  jupyter-notebook, pandas
Stock Price Predictor
This project seeks to utilize Deep Learning models, Long-Short Term Memory (LSTM) Neural Network algorithm, to predict stock prices.
Stars: ✭ 146 (-7.59%)
Mutual labels:  jupyter-notebook, pandas
visions
Type System for Data Analysis in Python
Stars: ✭ 136 (-13.92%)
Mutual labels:  spark, pandas
spark-extension
A library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (-84.18%)
Mutual labels:  spark, pyspark
Seaborn Tutorial
This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-27.85%)
Mutual labels:  jupyter-notebook, pandas
61-120 of 6686 similar projects