All Projects → DataEngineering → Similar Projects or Alternatives

519 Open source projects that are alternatives of or similar to DataEngineering

Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+325.53%)
Mutual labels:  pyspark
dbt-sugar
dbt-sugar is a CLI tool that allows users of dbt to have fun and ease performing actions around dbt models
Stars: ✭ 139 (+195.74%)
Mutual labels:  data-engineering
Spark Iforest
Isolation Forest on Spark
Stars: ✭ 166 (+253.19%)
Mutual labels:  pyspark
templates
tsParticles website templates collection
Stars: ✭ 42 (-10.64%)
Mutual labels:  hacktoberfest2020
Linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+4842.55%)
Mutual labels:  pyspark
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (+0%)
Mutual labels:  data-engineering
Learningapachespark
LearningApacheSpark
Stars: ✭ 155 (+229.79%)
Mutual labels:  pyspark
Covid-19-d3
Created with CodeSandbox
Stars: ✭ 13 (-72.34%)
Mutual labels:  hacktoberfest2020
Cc Pyspark
Process Common Crawl data with Python and Spark
Stars: ✭ 147 (+212.77%)
Mutual labels:  pyspark
ebisp
Embedded Lisp
Stars: ✭ 46 (-2.13%)
Mutual labels:  hacktoberfest2020
kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+908.51%)
Mutual labels:  pyspark
Eat pyspark in 10 days
pyspark🍒🥭 is delicious,just eat it!😋😋
Stars: ✭ 116 (+146.81%)
Mutual labels:  pyspark
live deck
A Real-Time Presentation Application Powered by Phoenix LiveView
Stars: ✭ 71 (+51.06%)
Mutual labels:  hacktoberfest2020
Hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (+129.79%)
Mutual labels:  pyspark
frontatish
A React native common components kit and helper methods,find the package at this link https://www.npmjs.com/package/frontatish
Stars: ✭ 14 (-70.21%)
Mutual labels:  hacktoberfest2020
Relation extraction
Relation Extraction using Deep learning(CNN)
Stars: ✭ 96 (+104.26%)
Mutual labels:  pyspark
ceja
PySpark phonetic and string matching algorithms
Stars: ✭ 24 (-48.94%)
Mutual labels:  pyspark
Pyspark Tutorial
PySpark Code for Hands-on Learners
Stars: ✭ 91 (+93.62%)
Mutual labels:  pyspark
eks
AWS EKS - kubernetes project
Stars: ✭ 149 (+217.02%)
Mutual labels:  eks
Spark python ml examples
Spark 2.0 Python Machine Learning examples
Stars: ✭ 87 (+85.11%)
Mutual labels:  pyspark
preprocessy
Python package for Customizable Data Preprocessing Pipelines
Stars: ✭ 34 (-27.66%)
Mutual labels:  data-engineering
Pysparkgeoanalysis
🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (+34.04%)
Mutual labels:  pyspark
Login-Register-FlutterApp
Login Register Auth App by Delicia Fernandes using Google and Facebook sign in.
Stars: ✭ 87 (+85.11%)
Mutual labels:  hacktoberfest2020
Awesome Spark
A curated list of awesome Apache Spark packages and resources.
Stars: ✭ 1,061 (+2157.45%)
Mutual labels:  pyspark
data-structures-algorithms-interviews
👨‍💻 Repo contains my solutions to coding interview problems on various platforms. Will later convert into a React based web app for personal revision.
Stars: ✭ 16 (-65.96%)
Mutual labels:  hacktoberfest2020
Sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+1929.79%)
Mutual labels:  pyspark
first-pr-repo
A step by step guide to help people make their first Pull Request
Stars: ✭ 29 (-38.3%)
Mutual labels:  hacktoberfest2020
Sparkling Titanic
Training models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-74.47%)
Mutual labels:  pyspark
eks-deep-dive-2019
Amazon EKS Deep Dive 2019
Stars: ✭ 61 (+29.79%)
Mutual labels:  eks
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (-51.06%)
Mutual labels:  pyspark
Commandline-Games-hacktoberfest
A repository to share command line games. An opportunity to start and learn about open source code contributions flow.
Stars: ✭ 16 (-65.96%)
Mutual labels:  hacktoberfest2020
pyspark-ML-in-Colab
Pyspark in Google Colab: A simple machine learning (Linear Regression) model
Stars: ✭ 32 (-31.91%)
Mutual labels:  pyspark
Plasma-Donor-App
An open-source app that helps in connecting patients and plasma donors. This is a beginner-friendly repository that helps you learn the basics of android development, git, and GitHub. Happy Hacktober!
Stars: ✭ 58 (+23.4%)
Mutual labels:  hacktoberfest2020
Spark Syntax
This is a repo documenting the best practices in PySpark.
Stars: ✭ 412 (+776.6%)
Mutual labels:  pyspark
check-engine
Data validation library for PySpark 3.0.0
Stars: ✭ 29 (-38.3%)
Mutual labels:  pyspark
Pyspark Boilerplate
A boilerplate for writing PySpark Jobs
Stars: ✭ 318 (+576.6%)
Mutual labels:  pyspark
javascript-jokes
PR your joke if you know good ( or horrible ) js joke . I will post it on coding valley's insta page.
Stars: ✭ 66 (+40.43%)
Mutual labels:  hacktoberfest2020
Tdigest
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
Stars: ✭ 274 (+482.98%)
Mutual labels:  pyspark
jupyterlab-sparkmonitor
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Stars: ✭ 78 (+65.96%)
Mutual labels:  pyspark
mmtf-workshop-2018
Structural Bioinformatics Training Workshop & Hackathon 2018
Stars: ✭ 50 (+6.38%)
Mutual labels:  pyspark
pyspark-for-data-processing
Code for my presentation: Using PySpark to Process Boat Loads of Data
Stars: ✭ 20 (-57.45%)
Mutual labels:  pyspark
spark-extension
A library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (-46.81%)
Mutual labels:  pyspark
Hackerrank-Codes
Here are some of the solutions to HackerRank questions.
Stars: ✭ 63 (+34.04%)
Mutual labels:  hacktoberfest2020
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+136.17%)
Mutual labels:  pyspark
lovelace-light-soft-ui-theme
🎨 Home Assistant soft UI light theme, with help from @JuanMTech, @thomasloven, and @N-l1.
Stars: ✭ 59 (+25.53%)
Mutual labels:  hacktoberfest2020
kafka-compose
🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (-31.91%)
Mutual labels:  pyspark
J.A.R.V.I.S
Just A Rather Very Intelligent System
Stars: ✭ 36 (-23.4%)
Mutual labels:  hacktoberfest2020
data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (+6.38%)
Mutual labels:  pyspark
eks-cluster
Quickly spin up an AWS EKS Kubernetes cluster using AWS CloudFormation
Stars: ✭ 41 (-12.77%)
Mutual labels:  eks
Azure-Databricks-NYC-Taxi-Workshop
An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset
Stars: ✭ 71 (+51.06%)
Mutual labels:  pyspark
andaluh-js
Transliterate español (spanish) spelling to andaluz proposals using javascript
Stars: ✭ 22 (-53.19%)
Mutual labels:  hacktoberfest2020
Leetcoding-Challenge
This repository contains Leetcode Challenge Submissions.
Stars: ✭ 26 (-44.68%)
Mutual labels:  hacktoberfest2020
NASSCOM-MHRD-IOT-Practical-Module 1-2
Arduino on TinkerCad
Stars: ✭ 26 (-44.68%)
Mutual labels:  hacktoberfest2020
pixie
Instant Kubernetes-Native Application Observability
Stars: ✭ 3,238 (+6789.36%)
Mutual labels:  eks
Resources
No description or website provided.
Stars: ✭ 25 (-46.81%)
Mutual labels:  hacktoberfest2020
SpoketoberfestResources
No description or website provided.
Stars: ✭ 16 (-65.96%)
Mutual labels:  hacktoberfest2020
Traverser
Traverser is a Java library that helps software engineers implement advanced iteration of a data structure.
Stars: ✭ 45 (-4.26%)
Mutual labels:  hacktoberfest2020
simplePythonProgram
No description or website provided.
Stars: ✭ 21 (-55.32%)
Mutual labels:  hacktoberfest2020
Algorithms
Short explanations and implementations of different algorithms in multiple languages
Stars: ✭ 37 (-21.28%)
Mutual labels:  hacktoberfest2020
ray-tracer
My ongoing effort to learn how to make Ray Tracers
Stars: ✭ 14 (-70.21%)
Mutual labels:  hacktoberfest2020
301-360 of 519 similar projects