soda-sparkSoda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (+23.4%)
Mutual labels: pyspark, data-engineering
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (+168.09%)
Mutual labels: pyspark, data-engineering
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+1246.81%)
Mutual labels: pyspark, data-engineering
jobAnalytics and searchJobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-46.81%)
Mutual labels: pyspark, data-engineering
hacktoberfest2020This is a hacktoberfest repo with learning propose to make pull request(PR) and get contribute on opensource project
Stars: ✭ 13 (-72.34%)
Mutual labels: hacktoberfest2020
kdtree-in-pythonSource Code for K-d tree in Python series
Stars: ✭ 61 (+29.79%)
Mutual labels: hacktoberfest2020
VSCodePopThemeA port of Pop! Theme for VSCode
Stars: ✭ 33 (-29.79%)
Mutual labels: hacktoberfest2020
git-autocommitA bash script to automate pushing changes to github
Stars: ✭ 17 (-63.83%)
Mutual labels: hacktoberfest2020
iiitdmj-gpaGPA Calculator + Quiz Bot for IIITDM Jabalpur
Stars: ✭ 16 (-65.96%)
Mutual labels: hacktoberfest2020
algorithm-zooImplementations of algorithms from http://quantumalgorithmzoo.org/
Stars: ✭ 17 (-63.83%)
Mutual labels: hacktoberfest2020
hacktoberfest-beginnerMake your first contribution for Hacktoberfest in the simplest way
Stars: ✭ 16 (-65.96%)
Mutual labels: hacktoberfest2020
eksutilSample project to call Kubernetes API of an Amazon EKS cluster from AWS Lambda
Stars: ✭ 26 (-44.68%)
Mutual labels: eks
kuwalaKuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+908.51%)
Mutual labels: pyspark
first-pr-repoA step by step guide to help people make their first Pull Request
Stars: ✭ 29 (-38.3%)
Mutual labels: hacktoberfest2020
node-mongoose-setupNodejs MongoDB REST API Sarter
Stars: ✭ 48 (+2.13%)
Mutual labels: hacktoberfest2020
sparklanesA lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-63.83%)
Mutual labels: pyspark
ml-in-productionThe practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.
Stars: ✭ 29 (-38.3%)
Mutual labels: data-engineering