PyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 647 (-33.3%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (-36.91%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+213.81%)
JardinA pandas.DataFrame-based ORM.
Stars: ✭ 81 (-91.65%)
saddleSADDLE: Scala Data Library
Stars: ✭ 23 (-97.63%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (-14.64%)
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-97.73%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-84.33%)
raccoonPython DataFrame with fast insert and appends
Stars: ✭ 64 (-93.4%)
PandasvaultAdvanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Stars: ✭ 316 (-67.42%)
FoxcrossAsyncIO serving for data science models
Stars: ✭ 18 (-98.14%)
PandastableTable analysis in Tkinter using pandas DataFrames.
Stars: ✭ 376 (-61.24%)
Aws Data WranglerPandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+145.88%)
SequoiaA股自动选股程序,实现了海龟交易法则、缠中说禅牛市买点,以及其他若干种技术形态
Stars: ✭ 564 (-41.86%)
Dataframe GoDataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Stars: ✭ 487 (-49.79%)
MarsMars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Stars: ✭ 2,308 (+137.94%)
tableau-scrapingTableau scraper python library. R and Python scripts to scrape data from Tableau viz
Stars: ✭ 91 (-90.62%)
D6t PythonAccelerate data science
Stars: ✭ 118 (-87.84%)
PandahousePandas interface for Clickhouse database
Stars: ✭ 126 (-87.01%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (-38.87%)
ModinModin: Speed up your Pandas workflows by changing a single line of code
Stars: ✭ 6,639 (+584.43%)
Danfojsdanfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
Stars: ✭ 1,304 (+34.43%)
BoltzmanncleanFill missing values in Pandas DataFrames using Restricted Boltzmann Machines
Stars: ✭ 23 (-97.63%)
Gspread PandasA package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (-76.7%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (-75.77%)
StyleframeA library that wraps pandas and openpyxl and allows easy styling of dataframes in excel
Stars: ✭ 252 (-74.02%)
PandasguiPandasGUI is a GUI for viewing, plotting and analyzing Pandas DataFrames.
Stars: ✭ 2,495 (+157.22%)
cognipyIn-memory Graph Database and Knowledge Graph with Natural Language Interface, compatible with Pandas
Stars: ✭ 31 (-96.8%)
Pandas TaTechnical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 130+ Indicators
Stars: ✭ 962 (-0.82%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (-39.18%)
PystoreFast data store for Pandas time-series data
Stars: ✭ 325 (-66.49%)
PantheraData-frames & arrays on Clojure
Stars: ✭ 168 (-82.68%)
DaskParallel computing with task scheduling
Stars: ✭ 9,309 (+859.69%)
Pandas DatareaderExtract data from a wide range of Internet sources into a pandas DataFrame.
Stars: ✭ 2,183 (+125.05%)
anestheticNested Sampling post-processing and plotting
Stars: ✭ 34 (-96.49%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (-93.81%)
ferFacial Expression Recognition
Stars: ✭ 32 (-96.7%)
preprocessyPython package for Customizable Data Preprocessing Pipelines
Stars: ✭ 34 (-96.49%)
EngeznyEngezny is a python package that quickly generates all possible charts from your dataframe and saves them for you, and engezny is only supporting now uni-parameter visualization using the pie, bar and barh visualizations.
Stars: ✭ 25 (-97.42%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-94.12%)
muneSimple stock price analytics
Stars: ✭ 14 (-98.56%)
ChatisticsA WhatsApp Chat analyzer and statistics.
Stars: ✭ 32 (-96.7%)
heidiheidi : tidy data in Haskell
Stars: ✭ 24 (-97.53%)
tempoAPI for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation
Stars: ✭ 212 (-78.14%)
spark-vcfSpark VCF data source implementation for Dataframes
Stars: ✭ 15 (-98.45%)
skutilNOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-learn and h2o extension classes (as well as caret classes for python). See more here: https://tgsmith61591.github.io/skutil
Stars: ✭ 29 (-97.01%)
dataframeStructured data processing in Kotlin
Stars: ✭ 319 (-67.11%)
espandasReading and writing pandas DataFrames in Elasticsearch
Stars: ✭ 24 (-97.53%)
bowGo data analysis / manipulation library built on top of Apache Arrow
Stars: ✭ 20 (-97.94%)
covid-19Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-98.56%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-94.54%)
tales-science-dataCompanion repo to the GitBook, notes on Data Science topics
Stars: ✭ 41 (-95.77%)
dstoolboxTools that make working with scikit-learn and pandas easier.
Stars: ✭ 43 (-95.57%)
datartDatart is a next generation Data Visualization Open Platform
Stars: ✭ 1,042 (+7.42%)