AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+1158.06%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+6754.99%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-95.91%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+1167.52%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+214.32%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+217.65%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-79.8%)
ElasticR client for the Elasticsearch HTTP API
Stars: ✭ 227 (-41.94%)
Reddit DetectivePlay detective on Reddit: Discover political disinformation campaigns, secret influencers and more
Stars: ✭ 129 (-67.01%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+175.19%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-85.17%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+2081.84%)
Etl with pythonETL with Python - Taught at DWH course 2017 (TAU)
Stars: ✭ 68 (-82.61%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-72.63%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+1008.44%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-81.33%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-30.18%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-82.86%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+2579.8%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-56.27%)
PydbgenRandom dataframe and database table generator
Stars: ✭ 191 (-51.15%)
Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-50.64%)
TadA desktop application for viewing and analyzing tabular data
Stars: ✭ 2,275 (+481.84%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-44.25%)
ZebrasData analysis library for JavaScript built with Ramda
Stars: ✭ 192 (-50.9%)
KlibEasy to use Python library of customized functions for cleaning and analyzing data.
Stars: ✭ 192 (-50.9%)
Free Ai Resources🚀 FREE AI Resources - 🎓 Courses, 👷 Jobs, 📝 Blogs, 🔬 AI Research, and many more - for everyone!
Stars: ✭ 192 (-50.9%)
ChordPython package for creating beautiful interactive Chord Diagrams. Pro version available at https://m8.fyi/chord
Stars: ✭ 217 (-44.5%)
CqlCategorical Query Language IDE
Stars: ✭ 196 (-49.87%)
GradioCreate UIs for your machine learning model in Python in 3 minutes
Stars: ✭ 4,358 (+1014.58%)
Climate Change Data🌍 A curated list of APIs, open data and ML/AI projects on climate change
Stars: ✭ 195 (-50.13%)
Machine Learning ResourcesA curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (-42.2%)
Gspread PandasA package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (-42.2%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+680.31%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (-40.66%)
ArqueroQuery processing and transformation of array-backed data tables.
Stars: ✭ 384 (-1.79%)
Igela delightful machine learning tool that allows you to train, test, and use models without writing code
Stars: ✭ 2,956 (+656.01%)
TablesawJava dataframe and visualization library
Stars: ✭ 2,785 (+612.28%)
RetrieverQuickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (-38.36%)
CjworkbenchThe data journalism platform with built in training
Stars: ✭ 244 (-37.6%)
dflibIn-memory Java DataFrame library
Stars: ✭ 50 (-87.21%)
validadaAnother library for defensive data analysis.
Stars: ✭ 29 (-92.58%)
PrettypandasA Pandas Styler class for making beautiful tables
Stars: ✭ 376 (-3.84%)
DtaleVisualizer for pandas data structures
Stars: ✭ 2,864 (+632.48%)
StreamlitStreamlit — The fastest way to build data apps in Python
Stars: ✭ 16,906 (+4223.79%)
fairlensIdentify bias and measure fairness of your data
Stars: ✭ 51 (-86.96%)
UrsUniversal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (-29.67%)
SealionThe first machine learning framework that encourages learning ML concepts instead of memorizing class functions.
Stars: ✭ 278 (-28.9%)
XlearnHigh performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Stars: ✭ 2,968 (+659.08%)
SamplesSample projects using Material, Graph, and Algorithm.
Stars: ✭ 386 (-1.28%)
DagsterAn orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+948.34%)
PreqlAn interpreted relational query language that compiles to SQL.
Stars: ✭ 257 (-34.27%)