PypikaPyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
Stars: ✭ 1,111 (+195.48%)
DatafusionDataFusion has now been donated to the Apache Arrow project
Stars: ✭ 611 (+62.5%)
ModinModin: Speed up your Pandas workflows by changing a single line of code
Stars: ✭ 6,639 (+1665.69%)
Morphism⚡ Type-safe data transformer for JavaScript, TypeScript & Node.js.
Stars: ✭ 336 (-10.64%)
Climate Change Data🌍 A curated list of APIs, open data and ML/AI projects on climate change
Stars: ✭ 195 (-48.14%)
AthenaxSQL-based streaming analytics platform at scale
Stars: ✭ 1,178 (+213.3%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (+57.71%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (+56.91%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-60.11%)
Morpheus CoreThe foundational library of the Morpheus data science framework
Stars: ✭ 203 (-46.01%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+186.17%)
PyfunctionalPython library for creating data pipelines with chain functional programming
Stars: ✭ 1,943 (+416.76%)
Anaconda ProjectTool for encapsulating, running, and reproducing data science projects
Stars: ✭ 153 (-59.31%)
DatabookA facebook for data
Stars: ✭ 26 (-93.09%)
Kranglkrangl is a {K}otlin DSL for data w{rangl}ing
Stars: ✭ 430 (+14.36%)
BigbashA converter that generates a bash one-liner from an SQL Select query (no DB necessary)
Stars: ✭ 230 (-38.83%)
PyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 647 (+72.07%)
DeveeldbDeveelDB is a complete SQL database system, primarly developed for .NET/Mono frameworks
Stars: ✭ 80 (-78.72%)
Umbrella"A collection of functional programming libraries that can be composed together.
Unlike a framework, thi.ng is a suite of instruments and you (the user) must be
the composer of. Geared towards versatility, not any specific type of music."
— @loganpowell via Twitter
Stars: ✭ 2,186 (+481.38%)
hawkweedYet another implementation of missing functions for Python
Stars: ✭ 20 (-94.68%)
bowGo data analysis / manipulation library built on top of Apache Arrow
Stars: ✭ 20 (-94.68%)
pedsType safe persistent/immutable data structures for Go
Stars: ✭ 57 (-84.84%)
Typed ImmutableImmutable and structurally typed data
Stars: ✭ 263 (-30.05%)
Data science blogsA repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-63.03%)
Blockchain2graphBlockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Stars: ✭ 134 (-64.36%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-54.52%)
BitsA bite sized library for dealing with bytes.
Stars: ✭ 16 (-95.74%)
Datagear数据可视化分析平台,使用Java语言开发,采用浏览器/服务器架构,支持SQL、CSV、Excel、HTTP接口、JSON等多种数据源
Stars: ✭ 266 (-29.26%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-80.59%)
Dataframes.jlIn-memory tabular data in Julia
Stars: ✭ 951 (+152.93%)
SplitgraphSplitgraph command line client and python library
Stars: ✭ 209 (-44.41%)
ArqueroQuery processing and transformation of array-backed data tables.
Stars: ✭ 384 (+2.13%)
CubesLight-weight Python OLAP framework for multi-dimensional data analysis
Stars: ✭ 1,393 (+270.48%)
Android NosqlLightweight, simple structured NoSQL database for Android
Stars: ✭ 284 (-24.47%)
Tech.ml.datasetA Clojure high performance data processing system
Stars: ✭ 205 (-45.48%)
vec-la-fp↗️ A tiny (functional) 2d linear algebra library
Stars: ✭ 21 (-94.41%)
PyrsistentPersistent/Immutable/Functional data structures for Python
Stars: ✭ 1,621 (+331.12%)
LensA utility for working with nested data structures.
Stars: ✭ 104 (-72.34%)
NannyA tidyverse suite for (pre-) machine-learning: cluster, PCA, permute, impute, rotate, redundancy, triangular, smart-subset, abundant and variable features.
Stars: ✭ 17 (-95.48%)
Datacurator Filetreea standard filetree for /r/datacurator [ and r/datahoarder ]
Stars: ✭ 753 (+100.27%)
daanyDaany - .NET DAta ANalYtics .NET library with the implementation of DataFrame, Time series decompositions and Linear Algebra routines BLASS and LAPACK.
Stars: ✭ 49 (-86.97%)
PeroxideRust numeric library with R, MATLAB & Python syntax
Stars: ✭ 191 (-49.2%)
MimesisMimesis is a high-performance fake data generator for Python, which provides data for a variety of purposes in a variety of languages.
Stars: ✭ 3,439 (+814.63%)
StyleframeA library that wraps pandas and openpyxl and allows easy styling of dataframes in excel
Stars: ✭ 252 (-32.98%)
DataFrameDataFrame Library for Java
Stars: ✭ 51 (-86.44%)
dflibIn-memory Java DataFrame library
Stars: ✭ 50 (-86.7%)
Data Structures AlgorithmsMy implementation of 85+ popular data structures and algorithms and interview questions in Python 3 and C++
Stars: ✭ 273 (-27.39%)
TablesawJava dataframe and visualization library
Stars: ✭ 2,785 (+640.69%)
saddleSADDLE: Scala Data Library
Stars: ✭ 23 (-93.88%)
QframeImmutable data frame for Go
Stars: ✭ 282 (-25%)
KeypathkitKeyPathKit is a library that provides the standard functions to manipulate data along with a call-syntax that relies on typed keypaths to make the call sites as short and clean as possible.
Stars: ✭ 376 (+0%)
Bigquery UtilsUseful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
Stars: ✭ 338 (-10.11%)
SylphStream computing platform for bigdata
Stars: ✭ 362 (-3.72%)
AnonA UNIX Command To Anonymise Data
Stars: ✭ 341 (-9.31%)