All Projects → dflib → Similar Projects or Alternatives

745 Open source projects that are alternatives of or similar to dflib

DataFrame
DataFrame Library for Java
Stars: ✭ 51 (+2%)
Eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+370%)
Mutual labels:  etl, data-analysis, dataframe
daany
Daany - .NET DAta ANalYtics .NET library with the implementation of DataFrame, Time series decompositions and Linear Algebra routines BLASS and LAPACK.
Stars: ✭ 49 (-2%)
Mutual labels:  data-frame, series, dataframe
heidi
heidi : tidy data in Haskell
Stars: ✭ 24 (-52%)
Tablesaw
Java dataframe and visualization library
Stars: ✭ 2,785 (+5470%)
Mutual labels:  data-frame, data-analysis, dataframe
polars
Fast multi-threaded DataFrame library in Rust | Python | Node.js
Stars: ✭ 6,368 (+12636%)
Mutual labels:  dataframe, dataframe-library
Dominando-Pandas
Este repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-56%)
Mutual labels:  data-analysis, dataframe
Static Frame
Immutable and grow-only Pandas-like DataFrames with a more explicit and consistent interface.
Stars: ✭ 217 (+334%)
Mutual labels:  data-analysis, dataframe
Dataframe Js
A javascript library providing a new data structure for datascientists and developpers
Stars: ✭ 376 (+652%)
Mutual labels:  data-frame, dataframe
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+58%)
Mutual labels:  etl, data-analysis
Awesome Business Intelligence
Actively curated list of awesome BI tools. PRs welcome!
Stars: ✭ 1,157 (+2214%)
Mutual labels:  etl, data-analysis
Datatable
A go in-memory table
Stars: ✭ 215 (+330%)
Mutual labels:  series, dataframe
Styleframe
A library that wraps pandas and openpyxl and allows easy styling of dataframes in excel
Stars: ✭ 252 (+404%)
Mutual labels:  data-frame, dataframe
Datacleaner
The premier open source Data Quality solution
Stars: ✭ 391 (+682%)
Mutual labels:  etl, data-analysis
Ether sql
A python library to push ethereum blockchain data into an sql database.
Stars: ✭ 41 (-18%)
Mutual labels:  etl, data-analysis
architect big data solutions with spark
code, labs and lectures for the course
Stars: ✭ 40 (-20%)
Mutual labels:  etl, data-analysis
Pandastable
Table analysis in Tkinter using pandas DataFrames.
Stars: ✭ 376 (+652%)
Mutual labels:  data-analysis, dataframe
Qframe
Immutable data frame for Go
Stars: ✭ 282 (+464%)
Mutual labels:  data-frame, dataframe
Etl unicorn
数据可视化, 数据挖掘, 数据处理 ETL
Stars: ✭ 156 (+212%)
Mutual labels:  etl, data-analysis
Dataframe
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+1556%)
Mutual labels:  data-analysis, dataframe
Morpheus Core
The foundational library of the Morpheus data science framework
Stars: ✭ 203 (+306%)
Mutual labels:  data-analysis, dataframe
Nanny
A tidyverse suite for (pre-) machine-learning: cluster, PCA, permute, impute, rotate, redundancy, triangular, smart-subset, abundant and variable features.
Stars: ✭ 17 (-66%)
Mutual labels:  data-frame, data-analysis
hamilton
A scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+1124%)
Mutual labels:  etl, dataframe
Getting Started
This repository is a getting started guide to Singer.
Stars: ✭ 734 (+1368%)
Mutual labels:  etl, data-analysis
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+9738%)
Mutual labels:  etl, data-analysis
bow
Go data analysis / manipulation library built on top of Apache Arrow
Stars: ✭ 20 (-60%)
Mutual labels:  data-frame, dataframe
DataBridge.NET
Configurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-68%)
Mutual labels:  etl
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (-24%)
Mutual labels:  etl
Infinite Stories with Data
This repo consists of my analysis of random datasets using various statistical and visualization techniques.
Stars: ✭ 21 (-58%)
Mutual labels:  data-analysis
DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
Stars: ✭ 843 (+1586%)
Mutual labels:  data-analysis
tianchi-diabetes
天池精准医疗大赛——人工智能辅助糖尿病遗传风险预测 第一赛季
Stars: ✭ 20 (-60%)
Mutual labels:  data-analysis
advanced-pandas
Pandas is a powerful tool for data exploration and analysis (including timeseries).
Stars: ✭ 22 (-56%)
Mutual labels:  data-analysis
awesome-dev.to
[UNMAINTAINED] A collection of awesome blog series on DEV.to
Stars: ✭ 18 (-64%)
Mutual labels:  series
Loan-Approval-Prediction
Loan Application Data Analysis
Stars: ✭ 61 (+22%)
Mutual labels:  data-analysis
iMOKA
interactive Multi Objective K-mer Analysis
Stars: ✭ 19 (-62%)
Mutual labels:  data-analysis
woodwork
Woodwork is a Python library that provides robust methods for managing and communicating data typing information.
Stars: ✭ 97 (+94%)
Mutual labels:  dataframe
ttbbeer
An R Dataset Package for US Beer Statistics From TTB 🍺
Stars: ✭ 23 (-54%)
Mutual labels:  data-analysis
mydataharbor
🇨🇳 MyDataHarbor是一个致力于解决任意数据源到任意数据源的分布式、高扩展性、高性能、事务级的数据同步中间件。帮助用户可靠、快速、稳定的对海量数据进行准实时增量同步或者定时全量同步,主要定位是为实时交易系统服务,亦可用于大数据的数据同步(ETL领域)。
Stars: ✭ 28 (-44%)
Mutual labels:  etl
Fraud-Detection-in-Online-Transactions
Detecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Algorithm may result in Overfitting
Stars: ✭ 41 (-18%)
Mutual labels:  data-analysis
mixedvines
Python package for canonical vine copula trees with mixed continuous and discrete marginals
Stars: ✭ 36 (-28%)
Mutual labels:  data-analysis
IndexedTables.jl
Flexible tables with ordered indices
Stars: ✭ 108 (+116%)
Mutual labels:  data-analysis
go-bqloader
bqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-68%)
Mutual labels:  etl
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (-66%)
Mutual labels:  etl
cognipy
In-memory Graph Database and Knowledge Graph with Natural Language Interface, compatible with Pandas
Stars: ✭ 31 (-38%)
Mutual labels:  dataframe
computational-neuroscience
Short undergraduate course taught at University of Pennsylvania on computational and theoretical neuroscience. Provides an introduction to programming in MATLAB, single-neuron models, ion channel models, basic neural networks, and neural decoding.
Stars: ✭ 36 (-28%)
Mutual labels:  data-analysis
cobrix
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Stars: ✭ 109 (+118%)
Mutual labels:  etl
singer-runner
A CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (-34%)
Mutual labels:  etl
ipaddress
Data analysis of IP addresses and networks
Stars: ✭ 20 (-60%)
Mutual labels:  data-analysis
Moose
MOOSE - Platform for software and data analysis.
Stars: ✭ 110 (+120%)
Mutual labels:  data-analysis
ipychart
The power of Chart.js with Python
Stars: ✭ 48 (-4%)
Mutual labels:  data-analysis
arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
Stars: ✭ 2,360 (+4620%)
Mutual labels:  dataframe
versatile-data-kit
Versatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+188%)
Mutual labels:  etl
akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 5,155 (+10210%)
Mutual labels:  data-analysis
PDAP-Scrapers
Code relating to scraping public police data.
Stars: ✭ 72 (+44%)
Mutual labels:  etl
ospi
Open Source Presence Infographic of Indian Startups
Stars: ✭ 25 (-50%)
Mutual labels:  data-analysis
dsr
Introduction to Data Science with R (2017)
Stars: ✭ 25 (-50%)
Mutual labels:  data-analysis
dask-awkward
Native Dask collection for awkward arrays, and the library to use it.
Stars: ✭ 25 (-50%)
Mutual labels:  data-analysis
PandasVersusExcel
Python数据分析入门,数据分析师入门
Stars: ✭ 120 (+140%)
Mutual labels:  data-analysis
saddle
SADDLE: Scala Data Library
Stars: ✭ 23 (-54%)
Mutual labels:  dataframe
torch-dataframe
Utility class to manipulate dataset from CSV file
Stars: ✭ 67 (+34%)
Mutual labels:  dataframe
1-60 of 745 similar projects