All Projects → openverse-catalog → Similar Projects or Alternatives

942 Open source projects that are alternatives of or similar to openverse-catalog

openverse-api
The Openverse API allows programmatic access to search for CC-licensed and public domain digital media.
Stars: ✭ 41 (+51.85%)
Airflow Pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (+374.07%)
Mutual labels:  airflow, spark
Sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+1240.74%)
Mutual labels:  search-engine, spark
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+2837.04%)
Mutual labels:  airflow, spark
airflow-client-python
Apache Airflow - OpenApi Client for Python
Stars: ✭ 172 (+537.04%)
Mutual labels:  airflow, apache-airflow
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+1429.63%)
Mutual labels:  airflow, spark
airflow-user-management-plugin
A plugin for Apache Airflow that allows you to manage the users that can login
Stars: ✭ 13 (-51.85%)
Mutual labels:  airflow, apache-airflow
Around Dataengineering
A Data Engineering & Machine Learning Knowledge Hub
Stars: ✭ 257 (+851.85%)
Mutual labels:  airflow, spark
Airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Stars: ✭ 24,101 (+89162.96%)
Mutual labels:  airflow, apache-airflow
airflow-code-editor
A plugin for Apache Airflow that allows you to edit DAGs in browser
Stars: ✭ 195 (+622.22%)
Mutual labels:  airflow, apache-airflow
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+4325.93%)
Mutual labels:  airflow, spark
Awesome Apache Airflow
Curated list of resources about Apache Airflow
Stars: ✭ 2,755 (+10103.7%)
Mutual labels:  airflow, apache-airflow
fairflow
Functional Airflow DAG definitions.
Stars: ✭ 38 (+40.74%)
Mutual labels:  airflow, apache-airflow
bigkube
Minikube for big data with Scala and Spark
Stars: ✭ 16 (-40.74%)
Mutual labels:  airflow, spark
Search Ads Web Service
Online search advertisement platform & Realtime Campaign Monitoring [Maybe Deprecated]
Stars: ✭ 30 (+11.11%)
Mutual labels:  search-engine, spark
airflow-prometheus-exporter
Export Airflow metrics (from mysql) in prometheus format
Stars: ✭ 25 (-7.41%)
Mutual labels:  airflow, apache-airflow
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (+229.63%)
Mutual labels:  airflow, spark
airflow-boilerplate
A complete development environment setup for working with Airflow
Stars: ✭ 94 (+248.15%)
Mutual labels:  airflow, apache-airflow
viewflow
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+307.41%)
Mutual labels:  airflow, apache-airflow
vim-www
Toolbox to open & search URLs from vim
Stars: ✭ 32 (+18.52%)
Mutual labels:  search-engine
visualize-data-with-python
A Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.
Stars: ✭ 60 (+122.22%)
Mutual labels:  spark
pytest-notebook
A pytest plugin for regression testing and regenerating Jupyter Notebooks
Stars: ✭ 35 (+29.63%)
Mutual labels:  pytest
lazarus-beginners-guide
A book written for new Lazarus users, named "Beginners’ Guide to Lazarus IDE". Moved to: https://gitlab.com/adnan360/lazarus-beginners-guide
Stars: ✭ 26 (-3.7%)
Mutual labels:  creative-commons
sparkar-volts
An extensive non-reactive Typescript framework that eases the development experience in Spark AR
Stars: ✭ 15 (-44.44%)
Mutual labels:  spark
pytest-faulthandler
py.test plugin that activates the fault handler module during testing
Stars: ✭ 27 (+0%)
Mutual labels:  pytest
evildork
Evildork targeting your fiancee👁️
Stars: ✭ 46 (+70.37%)
Mutual labels:  search-engine
pytest-localstack
Pytest plugin for local AWS integration tests
Stars: ✭ 66 (+144.44%)
Mutual labels:  pytest
spark-stringmetric
Spark functions to run popular phonetic and string matching algorithms
Stars: ✭ 51 (+88.89%)
Mutual labels:  spark
dotnetlive.search
Asp.Net Core + ElasticSearch
Stars: ✭ 18 (-33.33%)
Mutual labels:  search-engine
swordfish
Open-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (+29.63%)
Mutual labels:  spark
pytest-pytorch
pytest plugin for a better developer experience when working with the PyTorch test suite
Stars: ✭ 36 (+33.33%)
Mutual labels:  pytest
iresearch
IResearch is a cross-platform, high-performance document oriented search engine library written entirely in C++ with the focus on a pluggability of different ranking/similarity models
Stars: ✭ 121 (+348.15%)
Mutual labels:  search-engine
ml-in-production
The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.
Stars: ✭ 29 (+7.41%)
Mutual labels:  apache-airflow
awesome-AI-kubernetes
❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Stars: ✭ 95 (+251.85%)
Mutual labels:  spark
collector-filesystem
Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.
Stars: ✭ 17 (-37.04%)
Mutual labels:  search-engine
hsploit
An advanced command-line search engine for Exploit-DB
Stars: ✭ 16 (-40.74%)
Mutual labels:  search-engine
python-page-object
📔 Page object design pattern implementation (python, pom, selenium, pytest, travisCI)
Stars: ✭ 41 (+51.85%)
Mutual labels:  pytest
indexer4j
Simple full text indexing and searching library for Java
Stars: ✭ 47 (+74.07%)
Mutual labels:  search-engine
Spark-Ar
Resources for Spark AR
Stars: ✭ 43 (+59.26%)
Mutual labels:  spark
airflow-dbt
Apache Airflow integration for dbt
Stars: ✭ 233 (+762.96%)
Mutual labels:  airflow
starter
Create vertical search web application in minutes with generator (based on ItemsAPI)
Stars: ✭ 21 (-22.22%)
Mutual labels:  search-engine
flow-indexer
Flow-Indexer indexes flows found in chunked log files from bro,nfdump,syslog, or pcap files
Stars: ✭ 43 (+59.26%)
Mutual labels:  search-engine
airflow-tutorial
Use Airflow to move data from multiple MySQL databases to BigQuery
Stars: ✭ 96 (+255.56%)
Mutual labels:  airflow
pytest-eth
PyTest plugin for testing smart contracts for Ethereum blockchain.
Stars: ✭ 23 (-14.81%)
Mutual labels:  pytest
module-search-mysql-legacy
No description or website provided.
Stars: ✭ 52 (+92.59%)
Mutual labels:  search-engine
data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (+85.19%)
Mutual labels:  spark
pytest-it
Decorate your pytest suite with RSpec-style pytest markers, then run `pytest --it` to see a plaintext spec of the test structure.
Stars: ✭ 26 (-3.7%)
Mutual labels:  pytest
experiments
Code examples for my blog posts
Stars: ✭ 21 (-22.22%)
Mutual labels:  spark
pytest-snapshot
A plugin for snapshot testing with pytest.
Stars: ✭ 68 (+151.85%)
Mutual labels:  pytest
lupyne
Pythonic search engine based on PyLucene.
Stars: ✭ 61 (+125.93%)
Mutual labels:  search-engine
Data-Engineering-Projects
Personal Data Engineering Projects
Stars: ✭ 167 (+518.52%)
Mutual labels:  airflow
astro
Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+192.59%)
Mutual labels:  airflow
pytest-watcher
Rerun pytest when your code changes
Stars: ✭ 60 (+122.22%)
Mutual labels:  pytest
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-7.41%)
Mutual labels:  airflow
myrepo
continuous integration rep
Stars: ✭ 41 (+51.85%)
Mutual labels:  pytest
Free-Internet-Plugin
A free Internet is a better Internet. This Chrome browser plugin removes paywalled content from Google search results.
Stars: ✭ 121 (+348.15%)
Mutual labels:  search-engine
pytest-mock-server
Mock server plugin for pytest
Stars: ✭ 19 (-29.63%)
Mutual labels:  pytest
Horizon
A ZeroNet search engine
Stars: ✭ 15 (-44.44%)
Mutual labels:  search-engine
AirDataComputer
Air Data Computer
Stars: ✭ 29 (+7.41%)
Mutual labels:  airflow
code-compass
a contextual search engine for software packages built on import2vec embeddings (https://www.code-compass.com)
Stars: ✭ 33 (+22.22%)
Mutual labels:  search-engine
1-60 of 942 similar projects