All Projects → etl → Similar Projects or Alternatives

333 Open source projects that are alternatives of or similar to etl

Texar Pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 636 (+127.96%)
Mutual labels:  data-processing
Pandera
A light-weight, flexible, and expressive pandas data validation library
Stars: ✭ 506 (+81.36%)
Mutual labels:  data-processing
Awesome Web Scraping
List of libraries, tools and APIs for web scraping and data processing.
Stars: ✭ 4,510 (+1516.49%)
Mutual labels:  data-processing
Awesome Kafka
A list about Apache Kafka
Stars: ✭ 397 (+42.29%)
Mutual labels:  data-processing
Xidel
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+20.07%)
Mutual labels:  data-processing
Eternal
👾~ music, eternal ~ 👾
Stars: ✭ 323 (+15.77%)
Mutual labels:  data-processing
Dali
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Stars: ✭ 3,624 (+1198.92%)
Mutual labels:  data-processing
Nonechucks
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Stars: ✭ 304 (+8.96%)
Mutual labels:  data-processing
Rapidtables
Super fast list of dicts to pre-formatted tables conversion library for Python 2/3
Stars: ✭ 292 (+4.66%)
Mutual labels:  data-processing
Hub
Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+1334.77%)
Mutual labels:  data-processing
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-80.65%)
Mutual labels:  data-processing
baleen3
Baleen 3 is a data processing tool based on the Annot8 framework
Stars: ✭ 15 (-94.62%)
Mutual labels:  data-processing
data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (-82.08%)
Mutual labels:  data-processing
pulserl
Apache Pulsar client library for Erlang/Elixir
Stars: ✭ 15 (-94.62%)
Mutual labels:  data-processing
alfa
♿ Suite of open and standards-based tools for performing reliable accessibility conformance testing at scale
Stars: ✭ 75 (-73.12%)
Mutual labels:  data-processing
meta-schema
Little DSL to make data processing sane with clojure.spec and spec-tools
Stars: ✭ 25 (-91.04%)
Mutual labels:  data-processing
pyGAPS
A framework for processing adsorption data and isotherm fitting
Stars: ✭ 36 (-87.1%)
Mutual labels:  data-processing
bonobo-sqlalchemy
PREVIEW - SQL databases in Bonobo, using sqlalchemy
Stars: ✭ 23 (-91.76%)
Mutual labels:  data-processing
cq
Clojure Command-line Data Processor for JSON, YAML, EDN, XML and more
Stars: ✭ 111 (-60.22%)
Mutual labels:  data-processing
Speech-Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-92.47%)
Mutual labels:  data-processing
mech
🦾 Main repository for the Mech programming language. Start here!
Stars: ✭ 135 (-51.61%)
Mutual labels:  data-processing
traceml
Engine for ML/Data tracking, visualization, dashboards, and model UI for Polyaxon.
Stars: ✭ 445 (+59.5%)
Mutual labels:  data-processing
stargate
An Apache Pulsar client written in Elixir
Stars: ✭ 33 (-88.17%)
Mutual labels:  data-processing
Anatomy-of-System-Engineering
System Engineering Memory Map
Stars: ✭ 17 (-93.91%)
Mutual labels:  data-processing
ECG analysis
No description or website provided.
Stars: ✭ 32 (-88.53%)
Mutual labels:  data-processing
parallel-corpora-tools
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Stars: ✭ 35 (-87.46%)
Mutual labels:  data-processing
rec-core
Data pipelining service
Stars: ✭ 19 (-93.19%)
Mutual labels:  data-processing
Processor
Ontology-driven Linked Data processor and server for SPARQL backends. Apache License.
Stars: ✭ 54 (-80.65%)
Mutual labels:  data-processing
blinkist-m4a-downloader
Grabs all of the audio files from all of the Blinkist books
Stars: ✭ 100 (-64.16%)
Mutual labels:  data-processing
machine-learning-data-pipeline
Pipeline module for parallel real-time data processing for machine learning models development and production purposes.
Stars: ✭ 22 (-92.11%)
Mutual labels:  data-processing
rsgislib
Remote Sensing and GIS Software Library; python module tools for processing spatial data.
Stars: ✭ 103 (-63.08%)
Mutual labels:  data-processing
processor
A simple and lightweight JavaScript data processing tool. Live demo:
Stars: ✭ 27 (-90.32%)
Mutual labels:  data-processing
perke
A keyphrase extractor for Persian
Stars: ✭ 60 (-78.49%)
Mutual labels:  data-processing
301-333 of 333 similar projects