All Projects → krawler → Similar Projects or Alternatives

1099 Open source projects that are alternatives of or similar to krawler

Panther
Detect threats with log data and improve cloud security posture
Stars: ✭ 885 (+1635.29%)
Mutual labels:  etl
openrefine-batch
Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+49.02%)
Mutual labels:  etl
Jupyter Renderers
Renderers and renderer extensions for JupyterLab
Stars: ✭ 395 (+674.51%)
Mutual labels:  geojson
pg-bifrost
PostgreSQL Logical Replication tool into Kinesis, S3 and RabbitMQ
Stars: ✭ 31 (-39.22%)
Mutual labels:  s3
pangeo-forge-recipes
Python library for building Pangeo Forge recipes.
Stars: ✭ 64 (+25.49%)
Mutual labels:  etl
Polybooljs
Boolean operations on polygons (union, intersection, difference, xor)
Stars: ✭ 333 (+552.94%)
Mutual labels:  geojson
Transformalize
Configurable Extract, Transform, and Load
Stars: ✭ 125 (+145.1%)
Mutual labels:  etl
Android Maps Utils
Maps SDK for Android Utility Library
Stars: ✭ 3,330 (+6429.41%)
Mutual labels:  geojson
Addax
Addax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (+1105.88%)
Mutual labels:  etl
Geodata Br
Arquivos Geojson com perímetros dos municípios brasileiros por estado ( Brasil / Brazil )
Stars: ✭ 307 (+501.96%)
Mutual labels:  geojson
terraform-aws-s3-bucket
Terraform module that creates an S3 bucket with an optional IAM user for external CI/CD systems
Stars: ✭ 138 (+170.59%)
Mutual labels:  s3
Alltheplaces
A set of spiders and scrapers to extract location information from places that post their location on the internet.
Stars: ✭ 277 (+443.14%)
Mutual labels:  geojson
openrefine-docker
OpenRefine is a free, open source power tool for working with messy data and improving it. This repository contains Dockerbuild files for automated builds.
Stars: ✭ 19 (-62.75%)
Mutual labels:  etl
turf dart
A turf.js-like geospatial analysis library working with GeoJSON, written in pure Dart.
Stars: ✭ 14 (-72.55%)
Mutual labels:  geojson
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+4576.47%)
Mutual labels:  etl
etl
M-Lab ingestion pipeline
Stars: ✭ 15 (-70.59%)
Mutual labels:  etl
FlowMaster
ETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (-62.75%)
Mutual labels:  etl
geojson-bbox
Calculates extent/bbox for a given valid geojson object.
Stars: ✭ 25 (-50.98%)
Mutual labels:  geojson
graphchain
⚡️ An efficient cache for the execution of dask graphs.
Stars: ✭ 63 (+23.53%)
Mutual labels:  s3
spdr-etf-holdings
ETL for the SPDR ETF holdings XLS documents
Stars: ✭ 14 (-72.55%)
Mutual labels:  etl
logstash-output-s3
No description or website provided.
Stars: ✭ 55 (+7.84%)
Mutual labels:  s3
Mongo Es
A MongoDB to Elasticsearch connector
Stars: ✭ 185 (+262.75%)
Mutual labels:  etl
Dswarm Backoffice Web
The backoffice web application of d:swarm (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 11 (-78.43%)
Mutual labels:  etl
iex-stocks
ETL for the IEX Stocks API
Stars: ✭ 19 (-62.75%)
Mutual labels:  etl
countriesNowAPI
CountriesNow is an Open source API for retrieving geo-information for countries, including their states, cities, population, etc. 🌎
Stars: ✭ 78 (+52.94%)
Mutual labels:  geojson
TEAM
The Taxonomy for ETL Automation Metadata (TEAM) is a metadata management tool for data warehouse automation. It is part of the ecosystem for data warehouse automation, alongside the Virtual Data Warehouse pattern manager and the generic schema for Data Warehouse Automation.
Stars: ✭ 27 (-47.06%)
Mutual labels:  etl
xyr
Query any data source using SQL, works with the local filesystem, s3, and more. It should be a very tiny and lightweight alternative to AWS Athena, Presto ... etc.
Stars: ✭ 58 (+13.73%)
Mutual labels:  s3
TomboloDigitalConnector
The Tombolo Digital Connector enables users to combine different sources of data in a transparent and reproducible way.
Stars: ✭ 56 (+9.8%)
Mutual labels:  geojson
koza
Data transformation framework for LinkML data models
Stars: ✭ 21 (-58.82%)
Mutual labels:  etl
geojson-to-svg-cli
Command line tool to convert GeoJSON to SVG.
Stars: ✭ 22 (-56.86%)
Mutual labels:  geojson
Datax
DataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (+127.45%)
Mutual labels:  etl
civil-services-geojson-app
Electron App for Loading GeoJSON files with Mapbox
Stars: ✭ 18 (-64.71%)
Mutual labels:  geojson
dddplus-archetype-demo
♨️ Using dddplus-archetype build a WMS in 5 minutes. 5分钟搭建一个仓储中台WMS!
Stars: ✭ 56 (+9.8%)
Mutual labels:  wms
awesome-integration
A curated list of awesome system integration software and resources.
Stars: ✭ 117 (+129.41%)
Mutual labels:  etl
Tuna
🐟 A streaming ETL for fish
Stars: ✭ 11 (-78.43%)
Mutual labels:  etl
neo4j-jdbc
JDBC driver for Neo4j
Stars: ✭ 110 (+115.69%)
Mutual labels:  etl
GeoJSON4EntityFramework
Create GeoJSON from Entity Framework Spatial Data or WKT
Stars: ✭ 18 (-64.71%)
Mutual labels:  geojson
lineage
Generate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-68.63%)
Mutual labels:  etl
Aws Ecs Airflow
Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (+109.8%)
Mutual labels:  etl
link-move
A model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (-37.25%)
Mutual labels:  etl
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-66.67%)
Mutual labels:  etl
leaflet-geojson-selector
Show GeoJSON Layer like as Interactive Menu List
Stars: ✭ 88 (+72.55%)
Mutual labels:  geojson
s3-concat
Concat multiple files in s3
Stars: ✭ 35 (-31.37%)
Mutual labels:  s3
rivery cli
Rivery CLI
Stars: ✭ 16 (-68.63%)
Mutual labels:  etl
Metl
Metl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file based Extract/Transform/Load (ETL), and remote procedure invocation via Web Services. Read more at www.jumpmind.com/products/metl/overview
Stars: ✭ 185 (+262.75%)
Mutual labels:  etl
Bandar Log
Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-62.75%)
Mutual labels:  etl
dtd2mysql
MySQL / MariaDB import for DTD feeds (fares, timetable and routeing)
Stars: ✭ 25 (-50.98%)
Mutual labels:  etl
openrouteservice-docs
📝 This repository stores the swagger specifications of the openrouteservice API. Browse to swagger for a detailed overview.
Stars: ✭ 59 (+15.69%)
Mutual labels:  geojson
Csv2db
The CSV to database command line loader
Stars: ✭ 102 (+100%)
Mutual labels:  etl
GpsPrune
GpsPrune is a map-based application for viewing, editing and converting coordinate data from GPS systems.
Stars: ✭ 46 (-9.8%)
Mutual labels:  geojson
redis-connect-dist
Real-Time Event Streaming & Change Data Capture
Stars: ✭ 21 (-58.82%)
Mutual labels:  etl
open-geo-data-education
Open Geospatial Datasets for GIS Education: This is a repository of open geospatial datasets to be used in an educational context. I created these files over years of teaching Geographic Data Science and GIS. All original datasets are freely available online with open data licenses (see the dataset attribution for details). All the datasets in t…
Stars: ✭ 52 (+1.96%)
Mutual labels:  geojson
thain
Thain is a distributed flow schedule platform.
Stars: ✭ 81 (+58.82%)
Mutual labels:  etl
wikirepo
Python based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (-35.29%)
Mutual labels:  etl
Phila Airflow
Stars: ✭ 16 (-68.63%)
Mutual labels:  etl
etl
[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+447.06%)
Mutual labels:  etl
DataX-src
DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (-58.82%)
Mutual labels:  etl
AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-60.78%)
Mutual labels:  etl
cogj-spec
Cloud Optimized GeoJSON spec
Stars: ✭ 36 (-29.41%)
Mutual labels:  geojson
Aws Serverless Data Lake Framework
Enterprise-grade, production-hardened, serverless data lake on AWS
Stars: ✭ 179 (+250.98%)
Mutual labels:  etl
301-360 of 1099 similar projects