All Projects → alteryx → autonormalize

alteryx / autonormalize

Licence: BSD-3-Clause license
python library for automated dataset normalization

Programming Languages

python
139335 projects - #7 most used programming language
Makefile
30231 projects
shell
77523 projects

Projects that are alternatives of or similar to autonormalize

Compile Hero
🔰Visual Studio Code Extension For Compiling Language
Stars: ✭ 169 (+62.5%)
Mutual labels:  automatic
Errant
ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.
Stars: ✭ 208 (+100%)
Mutual labels:  automatic
scSeqR
This package has migrated to https://github.com/rezakj/iCellR please use iCellR instead of scSeqR for more functionalities and updates.
Stars: ✭ 16 (-84.62%)
Mutual labels:  normalization
Scapix
Scapix Language Bridge
Stars: ✭ 171 (+64.42%)
Mutual labels:  automatic
Django Easy Timezones
Easy timezones for Django based on GeoIP
Stars: ✭ 195 (+87.5%)
Mutual labels:  automatic
WeTextProcessing
Text Normalization & Inverse Text Normalization
Stars: ✭ 213 (+104.81%)
Mutual labels:  normalization
Automatic Gatsbyjs App Landing Page
Automatic GatsbyJS App Landing Page - Automatically generate iOS app landing page using GatsbyJS
Stars: ✭ 137 (+31.73%)
Mutual labels:  automatic
ORNA
Fast in-silico normalization algorithm for NGS data
Stars: ✭ 21 (-79.81%)
Mutual labels:  normalization
Ts Auto Mock
Typescript transformer to unlock automatic mock creation for interfaces and classes
Stars: ✭ 204 (+96.15%)
Mutual labels:  automatic
pyFileManager
This script lets you automatically relocate files based on their extensions. Very useful from the downloads folder !
Stars: ✭ 22 (-78.85%)
Mutual labels:  automatic
Simfix
Automatically fix programs by leveraging existing patches from other projects and similar code snippets from the faulty project.
Stars: ✭ 173 (+66.35%)
Mutual labels:  automatic
3d Bat
3D Bounding Box Annotation Tool (3D-BAT) Point cloud and Image Labeling
Stars: ✭ 179 (+72.12%)
Mutual labels:  automatic
class-norm
Class Normalization for Continual Zero-Shot Learning
Stars: ✭ 34 (-67.31%)
Mutual labels:  normalization
Coot
玩转 IFTTT 体验极客生活,互联网自动化神器【已经停止开发】
Stars: ✭ 172 (+65.38%)
Mutual labels:  automatic
rclone4pi
rclone4pi - Easy Install of rclone to Raspberry Pi Computer with automated folder sync demo
Stars: ✭ 91 (-12.5%)
Mutual labels:  automatic
Automatic Youtube Reddit Text To Speech Video Generator And Uploader
A series of 3 programs that will automatically receive scripts from Reddit, allow the user to edit them, then be sent off to a video generator where they will be uploaded to YouTube automatically.
Stars: ✭ 152 (+46.15%)
Mutual labels:  automatic
Knock
🔑 Scan the entire internet for SSH and Telnet services. Then hack them.
Stars: ✭ 60 (-42.31%)
Mutual labels:  automatic
MeetNinja-Google-Meet-Bot
A super dope tool that attends your Google Meet(s) for you. Flawlessly handles scheduled multiple (subsequent) Meet sessions. Also disables the camera & microphone, and shows timestamps of joining & ending times for each Meet. Supported: Google Chrome / Mozilla Firefox running on Linux / Mac / Windows
Stars: ✭ 56 (-46.15%)
Mutual labels:  automatic
keras-utility-layer-collection
Collection of custom layers and utility functions for Keras which are missing in the main framework.
Stars: ✭ 63 (-39.42%)
Mutual labels:  normalization
VF-BlenderAutoSaveRender
Automatically saves a numbered or dated image after every render and can extend the Blender output path with dynamic variables
Stars: ✭ 34 (-67.31%)
Mutual labels:  automatic

AutoNormalize

Tests

AutoNormalize is a Python library for automated datatable normalization. It allows you to build an EntitySet from a single denormalized table and generate features for machine learning using Featuretools.

Getting Started

Install

pip install featuretools[autonormalize]

Uninstall

pip uninstall autonormalize

Demos

API Reference

auto_entityset

auto_entityset(df, accuracy=0.98, index=None, name=None, time_index=None)

Creates a normalized entityset from a dataframe.

Arguments:

  • df (pd.Dataframe) : the dataframe containing data

  • accuracy (0 < float <= 1.00; default = 0.98) : the accuracy threshold required in order to conclude a dependency (i.e. with accuracy = 0.98, 0.98 of the rows must hold true the dependency LHS --> RHS)

  • index (str, optional) : name of column that is intended index of df

  • name (str, optional) : the name of created EntitySet

  • time_index (str, optional) : name of time column in the dataframe.

Returns:

  • entityset (ft.EntitySet) : created entity set

find_dependencies

find_dependencies(df, accuracy=0.98, index=None)

Finds dependencies within dataframe with the DFD search algorithm.

Returns:

  • dependencies (Dependencies) : the dependencies found in the data within the contraints provided

normalize_dataframe

normalize_dataframe(df, dependencies)

Normalizes dataframe based on the dependencies given. Keys for the newly created DataFrames can only be columns that are strings, ints, or categories. Keys are chosen according to the priority:

  1. shortest lenghts
  2. has "id" in some form in the name of an attribute
  3. has attribute furthest to left in the table

Returns:

  • new_dfs (list[pd.DataFrame]) : list of new dataframes

make_entityset

make_entityset(df, dependencies, name=None, time_index=None)

Creates a normalized EntitySet from dataframe based on the dependencies given. Keys are chosen in the same fashion as for normalize_dataframeand a new index will be created if any key has more than a single attribute.

Returns:

  • entityset (ft.EntitySet) : created EntitySet

normalize_entityset

normalize_entityset(es, accuracy=0.98)

Returns a new normalized EntitySet from an EntitySet with a single entity.

Arguments:

  • es (ft.EntitySet) : EntitySet with a single entity to normalize

Returns:

  • new_es (ft.EntitySet) : new normalized EntitySet

Built at Alteryx Innovation Labs

Alteryx Innovation Labs
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].