All Projects → Pyspark Cheatsheet → Similar Projects or Alternatives

3255 Open source projects that are alternatives of or similar to Pyspark Cheatsheet

Datacompy
Pandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (+36.11%)
Mutual labels:  data-science, spark, data
Pico8 Api
Unofficial PICO-8 API with a lovely design ! ::
Stars: ✭ 115 (+6.48%)
Mutual labels:  documentation, docs, cheatsheet
Github Template Guidelines
Guidelines for building GitHub templates.
Stars: ✭ 137 (+26.85%)
Mutual labels:  documentation, docs, reference
Python Cheatsheet
Basic Cheat Sheet for Python (PDF, Markdown and Jupyter Notebook)
Stars: ✭ 1,334 (+1135.19%)
Mutual labels:  cheatsheet, reference, cheatsheets
Doc
🦋 Raku documentation (tools and docs)
Stars: ✭ 259 (+139.81%)
Mutual labels:  documentation, docs, reference
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+282.41%)
Mutual labels:  data-science, spark, data
Bad Data Guide
An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
Stars: ✭ 3,862 (+3475.93%)
Mutual labels:  documentation, data, guide
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-40.74%)
Mutual labels:  data-science, spark, pyspark
C Sharp Cheatsheet
C# Cheatsheet
Stars: ✭ 111 (+2.78%)
Mutual labels:  cheat, cheatsheet, cheatsheets
react-cheatsheets
Create and generate cheat sheets using React
Stars: ✭ 21 (-80.56%)
Mutual labels:  cheatsheet, cheat, cheatsheets
yii2-manual-chm
Yii 2 Guide/API/Docs compiled in various formats
Stars: ✭ 63 (-41.67%)
Mutual labels:  docs, reference, guide
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1138.89%)
Mutual labels:  data-science, spark, pyspark
Pandocs
The infamous Pan Docs historical document: the single, most comprehensive Game Boy technical reference.
Stars: ✭ 158 (+46.3%)
Mutual labels:  documentation, cheatsheet, reference
Py2rs
A quick reference guide for the Pythonista in the process of becoming a Rustacean
Stars: ✭ 690 (+538.89%)
Mutual labels:  cheatsheet, guide, reference
Visual Scala Reference
Visual Scala Reference
Stars: ✭ 198 (+83.33%)
Mutual labels:  cheatsheet, guide, reference
Guides
Documentation guides and tutorials for Clojure. Various authors.
Stars: ✭ 361 (+234.26%)
Mutual labels:  documentation, docs, guides
Cheatsheets
Community-sourced cheatsheets
Stars: ✭ 430 (+298.15%)
Mutual labels:  cheat, reference, cheatsheets
Javascripter
Helping junior developers navigate the complex world of software engineering without experiencing information overload.
Stars: ✭ 203 (+87.96%)
Mutual labels:  cheatsheet, reference, cheatsheets
Docma
A powerful tool to easily generate beautiful HTML documentation from JavaScript (JSDoc), Markdown and HTML files.
Stars: ✭ 287 (+165.74%)
Mutual labels:  documentation, docs, reference
Javascript Cheatsheet
Basic Javascript Cheat Sheet
Stars: ✭ 262 (+142.59%)
Mutual labels:  cheat, cheatsheet, reference
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+486.11%)
Mutual labels:  data-science, spark, pyspark
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+812.96%)
Mutual labels:  data-science, spark, pyspark
Pycm
Multi-class confusion matrix library in Python
Stars: ✭ 1,076 (+896.3%)
Mutual labels:  data-science, data
Rumble
⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-46.3%)
Mutual labels:  data-science, spark
Awesome Bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+9601.85%)
Mutual labels:  data-science, data
Data Science Cookbook
🎓 Jupyter notebooks from UFC data science course
Stars: ✭ 60 (-44.44%)
Mutual labels:  data-science, spark
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-49.07%)
Mutual labels:  data-science, spark
Parse Comments
Parse JavaScript code comments. Works with block and line comments, and should work with CSS, LESS, SASS, or any language with the same comment formats.
Stars: ✭ 53 (-50.93%)
Mutual labels:  documentation, docs
Go Compression.github.io
The Hitchhiker's Guide to Compression
Stars: ✭ 106 (-1.85%)
Mutual labels:  documentation, guide
Feedmereadmes
Free README editing+feedback to make your open-source projects grow. See the README maturity model to help you keep going.
Stars: ✭ 1,064 (+885.19%)
Mutual labels:  documentation, docs
Data Science Best Resources
Carefully curated resource links for data science in one place
Stars: ✭ 1,104 (+922.22%)
Mutual labels:  data-science, cheatsheet
Datacomparer
dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-46.3%)
Mutual labels:  data-science, data
Jsdoc Baseline
An experimental, extensible template for JSDoc.
Stars: ✭ 51 (-52.78%)
Mutual labels:  documentation, docs
Docs
OpenBMC Documentation
Stars: ✭ 105 (-2.78%)
Mutual labels:  documentation, cheatsheet
Openrefine
OpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+7799.07%)
Mutual labels:  data-science, data
Docs
Documentation for The Things Network
Stars: ✭ 61 (-43.52%)
Mutual labels:  documentation, docs
Algorithms Cheatsheet Resources
🤓All the geeky stuffs you need to know at one place!
Stars: ✭ 60 (-44.44%)
Mutual labels:  cheatsheet, cheatsheets
Nord Docs
The official Nord website and documentation
Stars: ✭ 63 (-41.67%)
Mutual labels:  documentation, docs
Docsify Tabs
A docsify.js plugin for rendering tabbed content from markdown
Stars: ✭ 65 (-39.81%)
Mutual labels:  documentation, docs
Quickstart
🎯 A micro-form for user-specific installation instructions
Stars: ✭ 66 (-38.89%)
Mutual labels:  documentation, docs
Pysparkgeoanalysis
🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-41.67%)
Mutual labels:  spark, pyspark
Rsparkling
RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-39.81%)
Mutual labels:  data-science, spark
Graphia
A visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-37.96%)
Mutual labels:  data-science, data
Magicbox
A platform that uses real-time data to inform life-saving humanitarian responses to emergency situations
Stars: ✭ 73 (-32.41%)
Mutual labels:  data-science, data
Csharp8cheatsheet
C# 8 Cheat Sheet, Default Interface Methods, Pattern Matching, Indices and Ranges, Nullable Reference Types, Asynchronous Streams, Caller Expression Attribute ,Static Local Functions, Default in Deconstruction., Alternative Interpolated Verbatim Strings, Using Declarations, Relax Ordering of ref and partial Modifiers, Disposable ref structs, Generic Attributes, Null Coalescing Assignment ,Disposable ref structs
Stars: ✭ 73 (-32.41%)
Mutual labels:  cheat, cheatsheets
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-31.48%)
Mutual labels:  spark, cheatsheet
Ds Cheatsheets
List of Data Science Cheatsheets to rule the world
Stars: ✭ 9,452 (+8651.85%)
Mutual labels:  spark, cheatsheet
Cmd Command Cheat Sheet
CMD - Command Cheat Sheat ✅
Stars: ✭ 50 (-53.7%)
Mutual labels:  cheatsheet, cheatsheets
Spark Doc Zh
Apache Spark 官方文档中文版
Stars: ✭ 1,126 (+942.59%)
Mutual labels:  documentation, spark
Docnado
Rapid documentation tool that will blow you away...
Stars: ✭ 67 (-37.96%)
Mutual labels:  documentation, docs
Foliant
Comprehensive markdown-based documentation toolkit
Stars: ✭ 74 (-31.48%)
Mutual labels:  documentation, docs
Hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (+0%)
Mutual labels:  spark, pyspark
Flyte
Accelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+1050%)
Mutual labels:  data-science, data
Wazuh Documentation
Wazuh - Project documentation
Stars: ✭ 82 (-24.07%)
Mutual labels:  documentation, reference
Deeplearning Mindmap
A mindmap summarising Deep Learning concepts.
Stars: ✭ 1,251 (+1058.33%)
Mutual labels:  data, cheatsheet
Globbing
Introduction to "globbing" or glob matching, a programming concept that allows "filepath expansion" and matching using wildcards.
Stars: ✭ 86 (-20.37%)
Mutual labels:  cheatsheet, guide
Gopup
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+1037.96%)
Mutual labels:  data-science, data
Maze
Maze Applied Reinforcement Learning Framework
Stars: ✭ 85 (-21.3%)
Mutual labels:  documentation, data-science
Spark python ml examples
Spark 2.0 Python Machine Learning examples
Stars: ✭ 87 (-19.44%)
Mutual labels:  spark, pyspark
Deepin Develop Guide
deepin develop guide(containing development environment configuration and debian package tutorial)
Stars: ✭ 90 (-16.67%)
Mutual labels:  docs, guide
1-60 of 3255 similar projects