pyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 970 (+669.84%)
MarsMars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Stars: ✭ 2,308 (+1731.75%)
PantheraData-frames & arrays on Clojure
Stars: ✭ 168 (+33.33%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+2315.87%)
PystoreFast data store for Pandas time-series data
Stars: ✭ 325 (+157.94%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+86.51%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (+368.25%)
cognipyIn-memory Graph Database and Knowledge Graph with Natural Language Interface, compatible with Pandas
Stars: ✭ 31 (-75.4%)
PandasvaultAdvanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Stars: ✭ 316 (+150.79%)
StyleframeA library that wraps pandas and openpyxl and allows easy styling of dataframes in excel
Stars: ✭ 252 (+100%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+557.14%)
Danfojsdanfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
Stars: ✭ 1,304 (+934.92%)
SequoiaA股自动选股程序,实现了海龟交易法则、缠中说禅牛市买点,以及其他若干种技术形态
Stars: ✭ 564 (+347.62%)
JardinA pandas.DataFrame-based ORM.
Stars: ✭ 81 (-35.71%)
raccoonPython DataFrame with fast insert and appends
Stars: ✭ 64 (-49.21%)
tableau-scrapingTableau scraper python library. R and Python scripts to scrape data from Tableau viz
Stars: ✭ 91 (-27.78%)
Dataframe GoDataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Stars: ✭ 487 (+286.51%)
ModinModin: Speed up your Pandas workflows by changing a single line of code
Stars: ✭ 6,639 (+5169.05%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+385.71%)
saddleSADDLE: Scala Data Library
Stars: ✭ 23 (-81.75%)
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-82.54%)
vulknLove your Data. Love the Environment. Love VULKИ.
Stars: ✭ 43 (-65.87%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (+370.63%)
PyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 647 (+413.49%)
onelinerhub2.5k code solutions with clear explanation @ onelinerhub.com
Stars: ✭ 645 (+411.9%)
BoltzmanncleanFill missing values in Pandas DataFrames using Restricted Boltzmann Machines
Stars: ✭ 23 (-81.75%)
PandasguiPandasGUI is a GUI for viewing, plotting and analyzing Pandas DataFrames.
Stars: ✭ 2,495 (+1880.16%)
PandastableTable analysis in Tkinter using pandas DataFrames.
Stars: ✭ 376 (+198.41%)
FoxcrossAsyncIO serving for data science models
Stars: ✭ 18 (-85.71%)
Pandas TaTechnical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 130+ Indicators
Stars: ✭ 962 (+663.49%)
Sigmoidal aiTutoriais de Python, Data Science, Machine Learning e Deep Learning - Sigmoidal
Stars: ✭ 103 (-18.25%)
Apijson🚀 零代码、热更新、全自动 ORM 库,后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构。 🚀 A JSON Transmission Protocol and an ORM Library for automatically providing APIs and Docs.
Stars: ✭ 12,559 (+9867.46%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+983.33%)
Maps Location HistoryGet, Concatenate and Process you location history from Google Maps TimeLine
Stars: ✭ 99 (-21.43%)
UrbsA linear optimisation model for distributed energy systems
Stars: ✭ 123 (-2.38%)
DataxDataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (-7.94%)
Tabula PySimple wrapper of tabula-java: extract table from PDF into pandas DataFrame
Stars: ✭ 1,351 (+972.22%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+1103.17%)
KglabGraph-Based Data Science: an abstraction layer in Python for building knowledge graphs, integrated with popular graph libraries – atop Pandas, RDFlib, pySHACL, RAPIDS, NetworkX, iGraph, PyVis, pslpython, pyarrow, etc.
Stars: ✭ 98 (-22.22%)
SspipeSimple Smart Pipe: python productivity-tool for rapid data manipulation
Stars: ✭ 96 (-23.81%)
SwifterA package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Stars: ✭ 1,844 (+1363.49%)
Sqliorm sql interface, Criteria, CriteriaBuilder, ResultMapBuilder
Stars: ✭ 1,644 (+1204.76%)
Df2gspreadManage Google Spreadsheets in Pandas DataFrame with Python
Stars: ✭ 114 (-9.52%)
Seaborn TutorialThis repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-9.52%)
TricksterOpen Source HTTP Reverse Proxy Cache and Time Series Dashboard Accelerator
Stars: ✭ 1,306 (+936.51%)
Pymc Example ProjectExample PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.
Stars: ✭ 90 (-28.57%)
PbpythonCode, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+1268.25%)
Baidu poi search一个基于pyqt5的百度地图兴趣点GUI采集工具,可根据关键词搜索指定区域的兴趣点,并导出为excel文件
Stars: ✭ 113 (-10.32%)
MoonshotVectorized backtester and trading engine for QuantRocket
Stars: ✭ 88 (-30.16%)
SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+1369.05%)
Flow PipelineA set of tools and examples to run a flow-pipeline (sFlow, NetFlow)
Stars: ✭ 86 (-31.75%)
Jupyter DatatablesJupyter Notebook extension leveraging pandas DataFrames by integrating DataTables and ChartJS.
Stars: ✭ 127 (+0.79%)
Aws Data WranglerPandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+1792.86%)
Pandas VideosJupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+1261.9%)
PandarallelA simple and efficient tool to parallelize Pandas operations on all available CPUs
Stars: ✭ 1,887 (+1397.62%)