All Projects → baleen3 → Similar Projects or Alternatives

59 Open source projects that are alternatives of or similar to baleen3

data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (+233.33%)
Mutual labels:  data-processing
pulserl
Apache Pulsar client library for Erlang/Elixir
Stars: ✭ 15 (+0%)
Mutual labels:  data-processing
alfa
♿ Suite of open and standards-based tools for performing reliable accessibility conformance testing at scale
Stars: ✭ 75 (+400%)
Mutual labels:  data-processing
meta-schema
Little DSL to make data processing sane with clojure.spec and spec-tools
Stars: ✭ 25 (+66.67%)
Mutual labels:  data-processing
pyGAPS
A framework for processing adsorption data and isotherm fitting
Stars: ✭ 36 (+140%)
Mutual labels:  data-processing
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (+13.33%)
Mutual labels:  data-processing
bonobo-sqlalchemy
PREVIEW - SQL databases in Bonobo, using sqlalchemy
Stars: ✭ 23 (+53.33%)
Mutual labels:  data-processing
cq
Clojure Command-line Data Processor for JSON, YAML, EDN, XML and more
Stars: ✭ 111 (+640%)
Mutual labels:  data-processing
Speech-Recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (+40%)
Mutual labels:  data-processing
mech
🦾 Main repository for the Mech programming language. Start here!
Stars: ✭ 135 (+800%)
Mutual labels:  data-processing
traceml
Engine for ML/Data tracking, visualization, dashboards, and model UI for Polyaxon.
Stars: ✭ 445 (+2866.67%)
Mutual labels:  data-processing
stargate
An Apache Pulsar client written in Elixir
Stars: ✭ 33 (+120%)
Mutual labels:  data-processing
Anatomy-of-System-Engineering
System Engineering Memory Map
Stars: ✭ 17 (+13.33%)
Mutual labels:  data-processing
ECG analysis
No description or website provided.
Stars: ✭ 32 (+113.33%)
Mutual labels:  data-processing
parallel-corpora-tools
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Stars: ✭ 35 (+133.33%)
Mutual labels:  data-processing
rec-core
Data pipelining service
Stars: ✭ 19 (+26.67%)
Mutual labels:  data-processing
Processor
Ontology-driven Linked Data processor and server for SPARQL backends. Apache License.
Stars: ✭ 54 (+260%)
Mutual labels:  data-processing
blinkist-m4a-downloader
Grabs all of the audio files from all of the Blinkist books
Stars: ✭ 100 (+566.67%)
Mutual labels:  data-processing
machine-learning-data-pipeline
Pipeline module for parallel real-time data processing for machine learning models development and production purposes.
Stars: ✭ 22 (+46.67%)
Mutual labels:  data-processing
rsgislib
Remote Sensing and GIS Software Library; python module tools for processing spatial data.
Stars: ✭ 103 (+586.67%)
Mutual labels:  data-processing
processor
A simple and lightweight JavaScript data processing tool. Live demo:
Stars: ✭ 27 (+80%)
Mutual labels:  data-processing
perke
A keyphrase extractor for Persian
Stars: ✭ 60 (+300%)
Mutual labels:  data-processing
etl
[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+1760%)
Mutual labels:  data-processing
Miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Stars: ✭ 4,633 (+30786.67%)
Mutual labels:  data-processing
Pxi
🧚 pxi (pixie) is a small, fast, and magical command-line data processor similar to jq, mlr, and awk.
Stars: ✭ 248 (+1553.33%)
Mutual labels:  data-processing
Amadeus
Harmonious distributed data analysis in Rust.
Stars: ✭ 240 (+1500%)
Mutual labels:  data-processing
Pysparkling
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Stars: ✭ 231 (+1440%)
Mutual labels:  data-processing
Machine Learning Notebooks
Machine Learning notebooks for refreshing concepts.
Stars: ✭ 222 (+1380%)
Mutual labels:  data-processing
Vaspy
Manipulating VASP files with Python.
Stars: ✭ 185 (+1133.33%)
Mutual labels:  data-processing
Collapse
Advanced and Fast Data Transformation in R
Stars: ✭ 184 (+1126.67%)
Mutual labels:  data-processing
Texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 2,236 (+14806.67%)
Mutual labels:  data-processing
Padasip
Python Adaptive Signal Processing
Stars: ✭ 138 (+820%)
Mutual labels:  data-processing
Pulsar Flink
Elastic data processing with Apache Pulsar and Apache Flink
Stars: ✭ 126 (+740%)
Mutual labels:  data-processing
Data Processing Agreements
Collection of Data Processing Agreement (DPA) and GDPR compliance resources
Stars: ✭ 110 (+633.33%)
Mutual labels:  data-processing
Distributed Dataset
A distributed data processing framework in Haskell.
Stars: ✭ 108 (+620%)
Mutual labels:  data-processing
Bonobo
Extract Transform Load for Python 3.5+
Stars: ✭ 1,475 (+9733.33%)
Mutual labels:  data-processing
Bash Oneliner
A collection of handy Bash One-Liners and terminal tricks for data processing and Linux system maintenance.
Stars: ✭ 1,359 (+8960%)
Mutual labels:  data-processing
Machine Learning For Solar Energy Prediction
Predict the Power Production of a solar panel farm from Weather Measurements using Machine Learning
Stars: ✭ 94 (+526.67%)
Mutual labels:  data-processing
Broadway
Concurrent and multi-stage data ingestion and data processing with Elixir
Stars: ✭ 1,310 (+8633.33%)
Mutual labels:  data-processing
Forte
Forte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (+493.33%)
Mutual labels:  data-processing
Dialogpt
Large-scale pretraining for dialogue
Stars: ✭ 1,177 (+7746.67%)
Mutual labels:  data-processing
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (+266.67%)
Mutual labels:  data-processing
2019 Electronic Design Competition
【电赛】2019 全国大学生电子设计竞赛 (F题)纸张数量检测装置 (基于STM32F407 & FDC2214 & USART HMI)
Stars: ✭ 53 (+253.33%)
Mutual labels:  data-processing
Cbrain
CBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.
Stars: ✭ 51 (+240%)
Mutual labels:  data-processing
Mdsplus
The MDSplus data management system
Stars: ✭ 47 (+213.33%)
Mutual labels:  data-processing
Tdm
R package for normalizing RNA-seq data to make them comparable to microarray data.
Stars: ✭ 33 (+120%)
Mutual labels:  data-processing
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+5660%)
Mutual labels:  data-processing
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+5593.33%)
Mutual labels:  data-processing
Texar Pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 636 (+4140%)
Mutual labels:  data-processing
Pandera
A light-weight, flexible, and expressive pandas data validation library
Stars: ✭ 506 (+3273.33%)
Mutual labels:  data-processing
Awesome Web Scraping
List of libraries, tools and APIs for web scraping and data processing.
Stars: ✭ 4,510 (+29966.67%)
Mutual labels:  data-processing
Awesome Kafka
A list about Apache Kafka
Stars: ✭ 397 (+2546.67%)
Mutual labels:  data-processing
Xidel
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+2133.33%)
Mutual labels:  data-processing
Eternal
👾~ music, eternal ~ 👾
Stars: ✭ 323 (+2053.33%)
Mutual labels:  data-processing
Dali
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Stars: ✭ 3,624 (+24060%)
Mutual labels:  data-processing
Nonechucks
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Stars: ✭ 304 (+1926.67%)
Mutual labels:  data-processing
Rapidtables
Super fast list of dicts to pre-formatted tables conversion library for Python 2/3
Stars: ✭ 292 (+1846.67%)
Mutual labels:  data-processing
Hub
Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+26586.67%)
Mutual labels:  data-processing
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (+260%)
Mutual labels:  data-processing
1-59 of 59 similar projects