All Projects → autonomio → wrangle

autonomio / wrangle

Licence: MIT license
A data transformation package for deep learning with Autonomio, Keras and TensorFlow.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to wrangle

uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (+213.33%)
Mutual labels:  etl
hamilton
A scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+3980%)
Mutual labels:  etl
architect big data solutions with spark
code, labs and lectures for the course
Stars: ✭ 40 (+166.67%)
Mutual labels:  etl
python mozetl
ETL jobs for Firefox Telemetry
Stars: ✭ 25 (+66.67%)
Mutual labels:  etl
nasdaq-symbols
ETL for the NASDAQ symbol file
Stars: ✭ 13 (-13.33%)
Mutual labels:  etl
sql-to-redis
🔄 Simple tool for ETL. From SQL to Redis.
Stars: ✭ 18 (+20%)
Mutual labels:  etl
flock
Flock: A Low-Cost Streaming Query Engine on FaaS Platforms
Stars: ✭ 232 (+1446.67%)
Mutual labels:  etl
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+153.33%)
Mutual labels:  etl
DQCS
数据质量控制系统
Stars: ✭ 34 (+126.67%)
Mutual labels:  etl
butterfly
Application transformation tool
Stars: ✭ 35 (+133.33%)
Mutual labels:  transformation
csvplus
csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+346.67%)
Mutual labels:  etl
covid-19
Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-6.67%)
Mutual labels:  etl
dogETL
A lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (+0%)
Mutual labels:  etl
CVparser
CVparser is software for parsing or extracting data out of CV/resumes.
Stars: ✭ 28 (+86.67%)
Mutual labels:  etl
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (+13.33%)
Mutual labels:  etl
django-calaccess-raw-data
A Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (+306.67%)
Mutual labels:  etl
PNG-Upscale
AI Super - Resolution
Stars: ✭ 116 (+673.33%)
Mutual labels:  resampling
mik
The Move to Islandora Kit is an extensible PHP command-line tool for converting source content and metadata into packages suitable for importing into Islandora (or other digital repository and preservations systems).
Stars: ✭ 32 (+113.33%)
Mutual labels:  etl
modeltime.resample
Resampling Tools for Time Series Forecasting with Modeltime
Stars: ✭ 12 (-20%)
Mutual labels:  resampling
singer-runner
A CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (+120%)
Mutual labels:  etl


Wrangle

Data preparation for deep learning

Talos Travis Talos Coveralls

WrangleKey FeaturesInstallSupportIssuesLicenseDownload


Wrangle provides the building blocks for entirely avoiding redundant, easy-to-automate, data preparation tasks.

Wrangle

TL;DR

Wrangle dramatically simplifies 95% of data preparation tasks involved in advanced deep learning practice and provides the required building blocks for near-future automated machine intelligence workflows. Wrangle is created to solve the problem of avoiding beneficial workflow steps due to complexity, cognitive overhead, and the anxiety that comes with it.

Key Features

Because of the large number of functions, many of which are frequently used in common deep learning data preparation workflows, Wrangle is notably focused on namespace. All functions are named in a way where the name explains exactly what can be expected in terms of capability. Let's dissect a few as an example:

In col_to_binary col refers to what is being processed, in this case a column of a dataframe. to refers to the particular process, in this case a conversion. binary refers to the output. In this case a given column in a dataframe is converted into binary values. For example, a continuous column is converted to binary classes based on if the values are below or above mean value. Similarly array_reshape_conv1d can be understood as taking in an array, and reshaping it to conv1d layer required shape.

Wrangle key features include:

  • Resampling
  • Transformation
  • Renaming
  • Grouping
  • Merging
  • Correlations
  • Reshaping
  • Cleaning

Wrangle works on Linux, Mac OSX, and Windows systems.

Install

Stable version:

pip install wrangle

Daily development version:

pip install git+https://github.com/autonomio/wrangle.git@daily-dev

Support

If you want ask a "how can I use Wrangle to..." question, the right place is StackOverflow.

If you found a bug or want to suggest a feature, check the issues or create a new issue.

License

MIT License

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].