DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (+36.11%)
Pico8 ApiUnofficial PICO-8 API with a lovely design ! ::
Stars: ✭ 115 (+6.48%)
Python CheatsheetBasic Cheat Sheet for Python (PDF, Markdown and Jupyter Notebook)
Stars: ✭ 1,334 (+1135.19%)
Doc🦋 Raku documentation (tools and docs)
Stars: ✭ 259 (+139.81%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+282.41%)
Bad Data GuideAn exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
Stars: ✭ 3,862 (+3475.93%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-40.74%)
yii2-manual-chmYii 2 Guide/API/Docs compiled in various formats
Stars: ✭ 63 (-41.67%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1138.89%)
PandocsThe infamous Pan Docs historical document: the single, most comprehensive Game Boy technical reference.
Stars: ✭ 158 (+46.3%)
Py2rsA quick reference guide for the Pythonista in the process of becoming a Rustacean
Stars: ✭ 690 (+538.89%)
GuidesDocumentation guides and tutorials for Clojure. Various authors.
Stars: ✭ 361 (+234.26%)
CheatsheetsCommunity-sourced cheatsheets
Stars: ✭ 430 (+298.15%)
JavascripterHelping junior developers navigate the complex world of software engineering without experiencing information overload.
Stars: ✭ 203 (+87.96%)
DocmaA powerful tool to easily generate beautiful HTML documentation from JavaScript (JSDoc), Markdown and HTML files.
Stars: ✭ 287 (+165.74%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+486.11%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+812.96%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+896.3%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-46.3%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+9601.85%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-49.07%)
Parse Comments Parse JavaScript code comments. Works with block and line comments, and should work with CSS, LESS, SASS, or any language with the same comment formats.
Stars: ✭ 53 (-50.93%)
FeedmereadmesFree README editing+feedback to make your open-source projects grow. See the README maturity model to help you keep going.
Stars: ✭ 1,064 (+885.19%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-46.3%)
Jsdoc BaselineAn experimental, extensible template for JSDoc.
Stars: ✭ 51 (-52.78%)
DocsOpenBMC Documentation
Stars: ✭ 105 (-2.78%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+7799.07%)
DocsDocumentation for The Things Network
Stars: ✭ 61 (-43.52%)
Nord DocsThe official Nord website and documentation
Stars: ✭ 63 (-41.67%)
Docsify TabsA docsify.js plugin for rendering tabbed content from markdown
Stars: ✭ 65 (-39.81%)
Quickstart🎯 A micro-form for user-specific installation instructions
Stars: ✭ 66 (-38.89%)
Pysparkgeoanalysis🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-41.67%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-39.81%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-37.96%)
MagicboxA platform that uses real-time data to inform life-saving humanitarian responses to emergency situations
Stars: ✭ 73 (-32.41%)
Csharp8cheatsheetC# 8 Cheat Sheet, Default Interface Methods, Pattern Matching, Indices and Ranges, Nullable Reference Types, Asynchronous Streams, Caller Expression Attribute ,Static Local Functions, Default in Deconstruction., Alternative Interpolated Verbatim Strings, Using Declarations, Relax Ordering of ref and partial Modifiers, Disposable ref structs, Generic Attributes, Null Coalescing Assignment ,Disposable ref structs
Stars: ✭ 73 (-32.41%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-31.48%)
Ds CheatsheetsList of Data Science Cheatsheets to rule the world
Stars: ✭ 9,452 (+8651.85%)
DocnadoRapid documentation tool that will blow you away...
Stars: ✭ 67 (-37.96%)
FoliantComprehensive markdown-based documentation toolkit
Stars: ✭ 74 (-31.48%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (+0%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+1050%)
GlobbingIntroduction to "globbing" or glob matching, a programming concept that allows "filepath expansion" and matching using wildcards.
Stars: ✭ 86 (-20.37%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+1037.96%)
MazeMaze Applied Reinforcement Learning Framework
Stars: ✭ 85 (-21.3%)
Deepin Develop Guidedeepin develop guide(containing development environment configuration and debian package tutorial)
Stars: ✭ 90 (-16.67%)