All Projects → jobAnalytics_and_search → Similar Projects or Alternatives

845 Open source projects that are alternatives of or similar to jobAnalytics_and_search

Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+3072%)
Mutual labels:  airflow, s3, data-engineering, redshift
Data-Engineering-Projects
Personal Data Engineering Projects
Stars: ✭ 167 (+568%)
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (+256%)
Mutual labels:  airflow, s3, redshift
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (+228%)
Mutual labels:  airflow, s3, data-engineering
AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-20%)
awesome-sustainability-jobs
Dev jobs in the sustainability sector
Stars: ✭ 149 (+496%)
Mutual labels:  jobs, jobseeker, jobsearch
career-resources
Some SWE/PM/Designer related career resources for students
Stars: ✭ 154 (+516%)
Mutual labels:  jobs, jobseeker, jobsearch
collector
A job board data collector
Stars: ✭ 27 (+8%)
Mutual labels:  jobs, jobsearch
ob bulkstash
Bulk Stash is a docker rclone service to sync, or copy, files between different storage services. For example, you can copy files either to or from a remote storage services like Amazon S3 to Google Cloud Storage, or locally from your laptop to a remote storage.
Stars: ✭ 113 (+352%)
Mutual labels:  s3, data-pipeline
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+112%)
Mutual labels:  airflow, data-engineering
DataEngineering
This repo contains commands that data engineers use in day to day work.
Stars: ✭ 47 (+88%)
Mutual labels:  pyspark, data-engineering
astro
Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+216%)
Mutual labels:  airflow, s3
Airflow Autoscaling Ecs
Airflow Deployment on AWS ECS Fargate Using Cloudformation
Stars: ✭ 136 (+444%)
Mutual labels:  airflow, data-engineering
counter-interview.dev
a collaborative collection of interview questions collected from both sides of the game: Interviewer(s) and Interviewee.
Stars: ✭ 102 (+308%)
Mutual labels:  jobseeker, jobsearch
js jobs bot
JS Jobs search telegram channel
Stars: ✭ 24 (-4%)
Mutual labels:  jobs, jobsearch
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+9440%)
Mutual labels:  data-engineering, redshift
Data Engineering Howto
A list of useful resources to learn Data Engineering from scratch
Stars: ✭ 2,056 (+8124%)
Mutual labels:  data-engineering, data-pipeline
Locopy
locopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (+192%)
Mutual labels:  s3, redshift
vagas
Mural de vagas para desenvolvedor Android.
Stars: ✭ 748 (+2892%)
Mutual labels:  jobs, jobsearch
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+2432%)
Mutual labels:  pyspark, data-engineering
Cluster Pack
A library on top of either pex or conda-pack to make your Python code easily available on a cluster
Stars: ✭ 23 (-8%)
Mutual labels:  s3, pyspark
Awesome Aws
A curated list of awesome Amazon Web Services (AWS) libraries, open source repos, guides, blogs, and other resources. Featuring the Fiery Meter of AWSome.
Stars: ✭ 9,895 (+39480%)
Mutual labels:  s3, redshift
Around Dataengineering
A Data Engineering & Machine Learning Knowledge Hub
Stars: ✭ 257 (+928%)
Mutual labels:  airflow, data-engineering
Udacity Data Engineering Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+1732%)
Mutual labels:  airflow, data-engineering
Objinsync
Continuously synchronize directories from remote object store to local filesystem
Stars: ✭ 29 (+16%)
Mutual labels:  airflow, s3
Soda Sql
Metric collection, data testing and monitoring for SQL accessible data
Stars: ✭ 173 (+592%)
Mutual labels:  airflow, data-engineering
airflow-dbt-python
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
Stars: ✭ 111 (+344%)
Mutual labels:  airflow, data-engineering
aws-pdf-textract-pipeline
🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
Stars: ✭ 141 (+464%)
Mutual labels:  s3, data-pipeline
udacity-data-eng-proj2
A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract data from S3, apply a series of transformations and load into S3 and Redshift.
Stars: ✭ 25 (+0%)
Mutual labels:  airflow, redshift
viewflow
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+340%)
Mutual labels:  airflow, data-engineering
AirflowDataPipeline
Example of an ETL Pipeline using Airflow
Stars: ✭ 24 (-4%)
Mutual labels:  airflow, data-engineering
Azure-Certification-DP-200
Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution
Stars: ✭ 54 (+116%)
Mutual labels:  data-engineering, data-lake
saisoku
Saisoku is a Python module that helps you build complex pipelines of batch file/directory transfer/sync jobs.
Stars: ✭ 40 (+60%)
Mutual labels:  s3, data-pipeline
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+56%)
Mutual labels:  pyspark, data-pipeline
practical-data-engineering
Real estate dagster pipeline
Stars: ✭ 110 (+340%)
Mutual labels:  data-engineering, data-pipeline
growthbook
Open Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+9268%)
Mutual labels:  data-engineering, redshift
tellery
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Stars: ✭ 219 (+776%)
Mutual labels:  redshift, data-modeling
Yuniql
Free and open source schema versioning and database migration made natively with .NET Core.
Stars: ✭ 156 (+524%)
Mutual labels:  data-engineering, redshift
Remote Jobs
A list of semi to fully remote-friendly companies (jobs) in tech.
Stars: ✭ 17,863 (+71352%)
Mutual labels:  jobseeker, jobsearch
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (+404%)
Mutual labels:  pyspark, data-engineering
Foundatio
Pluggable foundation blocks for building distributed apps.
Stars: ✭ 1,365 (+5360%)
Mutual labels:  s3, jobs
soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (+132%)
Mutual labels:  pyspark, data-engineering
go-localstack
Go Wrapper for using localstack
Stars: ✭ 56 (+124%)
Mutual labels:  s3, redshift
opentrials-airflow
Configuration and definitions of Airflow for OpenTrials
Stars: ✭ 18 (-28%)
Mutual labels:  airflow, data-pipeline
dbt-ml-preprocessing
A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
Stars: ✭ 128 (+412%)
Mutual labels:  redshift
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (+76%)
Mutual labels:  data-engineering
junior.guru
Learn to code and get your first job in tech 🐣
Stars: ✭ 27 (+8%)
Mutual labels:  jobs
LetsHack
Notes & HowTo's covering the Raspberry Pi, Arduino, ESP8266, ESP32, etc.
Stars: ✭ 37 (+48%)
Mutual labels:  s3
node-redshift
A simple collection of tools to help you get started with Amazon Redshift from node.js
Stars: ✭ 66 (+164%)
Mutual labels:  redshift
gozeit
GoZeit
Stars: ✭ 19 (-24%)
Mutual labels:  s3
s3-proxy
S3 Reverse Proxy with GET, PUT and DELETE methods and authentication (OpenID Connect and Basic Auth)
Stars: ✭ 106 (+324%)
Mutual labels:  s3
s3storage
Simple rails plugin that makes it easy to store uploaded files on Amazon S3
Stars: ✭ 15 (-40%)
Mutual labels:  s3
Dive-Into-AWS
Links to the Repos and Sections in our Dive into AWS Course.
Stars: ✭ 27 (+8%)
Mutual labels:  s3
vagas
💼 É dev? É devops? É bom? Quer mexer com muita tecnologia e desafios? Vem pro match!
Stars: ✭ 21 (-16%)
Mutual labels:  jobs
jobor
支持秒级分布式定时任务系统, A high performance distributed task scheduling system, Support multi protocol scheduling tasks
Stars: ✭ 52 (+108%)
Mutual labels:  jobs
vagas
Vagas e empresas que ativamente contratam pessoas desenvolvedoras Clojure no Brasil
Stars: ✭ 75 (+200%)
Mutual labels:  jobs
herd-mdl
Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.
Stars: ✭ 11 (-56%)
Mutual labels:  data-lake
terraform-aws-cloudtrail
Terraform module to provision an AWS CloudTrail and an encrypted S3 bucket with versioning to store CloudTrail logs
Stars: ✭ 78 (+212%)
Mutual labels:  s3
hiring-system
CodeCareer is seeking core contributors to take the lead on this project.
Stars: ✭ 16 (-36%)
Mutual labels:  jobs
firehoser
A wrapper around AWS Kinesis Firehose with retry logic and custom queuing behavior. Requires node >= 6.0.0
Stars: ✭ 22 (-12%)
Mutual labels:  redshift
1-60 of 845 similar projects