Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+325.53%)
dbt-sugardbt-sugar is a CLI tool that allows users of dbt to have fun and ease performing actions around dbt models
Stars: ✭ 139 (+195.74%)
templatestsParticles website templates collection
Stars: ✭ 42 (-10.64%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+4842.55%)
uptasticsearchAn Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (+0%)
Covid-19-d3Created with CodeSandbox
Stars: ✭ 13 (-72.34%)
Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (+212.77%)
ebispEmbedded Lisp
Stars: ✭ 46 (-2.13%)
kuwalaKuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+908.51%)
live deckA Real-Time Presentation Application Powered by Phoenix LiveView
Stars: ✭ 71 (+51.06%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (+129.79%)
frontatishA React native common components kit and helper methods,find the package at this link https://www.npmjs.com/package/frontatish
Stars: ✭ 14 (-70.21%)
cejaPySpark phonetic and string matching algorithms
Stars: ✭ 24 (-48.94%)
eksAWS EKS - kubernetes project
Stars: ✭ 149 (+217.02%)
preprocessyPython package for Customizable Data Preprocessing Pipelines
Stars: ✭ 34 (-27.66%)
Pysparkgeoanalysis🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (+34.04%)
Login-Register-FlutterAppLogin Register Auth App by Delicia Fernandes using Google and Facebook sign in.
Stars: ✭ 87 (+85.11%)
Awesome SparkA curated list of awesome Apache Spark packages and resources.
Stars: ✭ 1,061 (+2157.45%)
data-structures-algorithms-interviews👨💻 Repo contains my solutions to coding interview problems on various platforms. Will later convert into a React based web app for personal revision.
Stars: ✭ 16 (-65.96%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+1929.79%)
first-pr-repoA step by step guide to help people make their first Pull Request
Stars: ✭ 29 (-38.3%)
Sparkling TitanicTraining models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-74.47%)
Commandline-Games-hacktoberfestA repository to share command line games. An opportunity to start and learn about open source code contributions flow.
Stars: ✭ 16 (-65.96%)
pyspark-ML-in-ColabPyspark in Google Colab: A simple machine learning (Linear Regression) model
Stars: ✭ 32 (-31.91%)
Plasma-Donor-AppAn open-source app that helps in connecting patients and plasma donors. This is a beginner-friendly repository that helps you learn the basics of android development, git, and GitHub. Happy Hacktober!
Stars: ✭ 58 (+23.4%)
Spark SyntaxThis is a repo documenting the best practices in PySpark.
Stars: ✭ 412 (+776.6%)
check-engineData validation library for PySpark 3.0.0
Stars: ✭ 29 (-38.3%)
javascript-jokesPR your joke if you know good ( or horrible ) js joke . I will post it on coding valley's insta page.
Stars: ✭ 66 (+40.43%)
Tdigestt-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
Stars: ✭ 274 (+482.98%)
jupyterlab-sparkmonitorJupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Stars: ✭ 78 (+65.96%)
mmtf-workshop-2018Structural Bioinformatics Training Workshop & Hackathon 2018
Stars: ✭ 50 (+6.38%)
spark-extensionA library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (-46.81%)
Hackerrank-CodesHere are some of the solutions to HackerRank questions.
Stars: ✭ 63 (+34.04%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+136.17%)
kafka-compose🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (-31.91%)
J.A.R.V.I.SJust A Rather Very Intelligent System
Stars: ✭ 36 (-23.4%)
eks-clusterQuickly spin up an AWS EKS Kubernetes cluster using AWS CloudFormation
Stars: ✭ 41 (-12.77%)
andaluh-jsTransliterate español (spanish) spelling to andaluz proposals using javascript
Stars: ✭ 22 (-53.19%)
pixieInstant Kubernetes-Native Application Observability
Stars: ✭ 3,238 (+6789.36%)
ResourcesNo description or website provided.
Stars: ✭ 25 (-46.81%)
TraverserTraverser is a Java library that helps software engineers implement advanced iteration of a data structure.
Stars: ✭ 45 (-4.26%)
AlgorithmsShort explanations and implementations of different algorithms in multiple languages
Stars: ✭ 37 (-21.28%)
ray-tracerMy ongoing effort to learn how to make Ray Tracers
Stars: ✭ 14 (-70.21%)