RetrieverQuickly download, clean up, and install public datasets into a database management system
Stars: ✭ 241 (+159.14%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+83.87%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+193.55%)
Dbg PdsDeutsche Boerse's Financial Trading Public Data Set
Stars: ✭ 124 (+33.33%)
Climate Change Data🌍 A curated list of APIs, open data and ML/AI projects on climate change
Stars: ✭ 195 (+109.68%)
JschemaA simple, easy to use data modeling framework for JavaScript
Stars: ✭ 261 (+180.65%)
HubDataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+4204.3%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+4560.22%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+5189.25%)
Datasets For GoodList of datasets to apply stats/machine learning/technology to the world of social good.
Stars: ✭ 174 (+87.1%)
Gspread PandasA package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (+143.01%)
Trump LiesTutorial: Web scraping in Python with Beautiful Soup
Stars: ✭ 201 (+116.13%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (+204.3%)
DatasetsA repository of pretty cool datasets that I collected for network science and machine learning research.
Stars: ✭ 302 (+224.73%)
Datagear数据可视化分析平台,使用Java语言开发,采用浏览器/服务器架构,支持SQL、CSV、Excel、HTTP接口、JSON等多种数据源
Stars: ✭ 266 (+186.02%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+344.09%)
Browser Compat DataThis repository contains compatibility data for Web technologies as displayed on MDN
Stars: ✭ 3,710 (+3889.25%)
RioA Swiss-Army Knife for Data I/O
Stars: ✭ 467 (+402.15%)
Voice datasets🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (+431.18%)
RowsA common, beautiful interface to tabular data, no matter the format
Stars: ✭ 739 (+694.62%)
Awesome StreamlitThe purpose of this project is to share knowledge on how awesome Streamlit is and can be
Stars: ✭ 769 (+726.88%)
Datastream.ioAn open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Stars: ✭ 814 (+775.27%)
Data PolygamyData Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.
Stars: ✭ 39 (-58.06%)
Sweetie DataThis repo contains logstash of various honeypots
Stars: ✭ 163 (+75.27%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-82.8%)
Qriyou're invited to a data party!
Stars: ✭ 1,003 (+978.49%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+1056.99%)
Free Ai Resources🚀 FREE AI Resources - 🎓 Courses, 👷 Jobs, 📝 Blogs, 🔬 AI Research, and many more - for everyone!
Stars: ✭ 192 (+106.45%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (+58.06%)
ChordPython package for creating beautiful interactive Chord Diagrams. Pro version available at https://m8.fyi/chord
Stars: ✭ 217 (+133.33%)
Covid19zaCoronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa
Stars: ✭ 208 (+123.66%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-27.96%)
CartolaExtração de dados da API do CartolaFC, análise exploratória dos dados e modelos preditivos em R e Python - 2014-20. [EN] Data munging, analysis and modeling of CartolaFC - the most popular fantasy football game in Brazil and maybe in the world. Data cover years 2014-19.
Stars: ✭ 304 (+226.88%)
Covid19JSON time-series of coronavirus cases (confirmed, deaths and recovered) per country - updated daily
Stars: ✭ 1,177 (+1165.59%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-15.05%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (+320.43%)
Eseur Code DataCode and data used to create the examples in "Evidence-based Software Engineering based on the publicly available data"
Stars: ✭ 340 (+265.59%)
Machine Learning RoadmapA roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
Stars: ✭ 5,277 (+5574.19%)
Blockchain2graphBlockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Stars: ✭ 134 (+44.09%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (+537.63%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (+534.41%)
Osint collectionMaintained collection of OSINT related resources. (All Free & Actionable)
Stars: ✭ 809 (+769.89%)
DataconfsA list of conferences connected with data worldwide.
Stars: ✭ 36 (-61.29%)
Php MlPHP-ML - Machine Learning library for PHP
Stars: ✭ 7,900 (+8394.62%)
Disk.frameFast Disk-Based Parallelized Data Manipulation Framework for Larger-than-RAM Data
Stars: ✭ 517 (+455.91%)
ColourColour Science for Python
Stars: ✭ 1,131 (+1116.13%)
LegislatorInterface to the Comparative Legislators Database
Stars: ✭ 62 (-33.33%)
MagicboxA platform that uses real-time data to inform life-saving humanitarian responses to emergency situations
Stars: ✭ 73 (-21.51%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-37.63%)
Just Dashboard📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+1524.73%)
Openml RR package to interface with OpenML
Stars: ✭ 81 (-12.9%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+5229.03%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+9073.12%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+1221.51%)
FlyteAccelerate your ML and Data workflows to production. Flyte is a production grade orchestration system for your Data and ML workloads. It has been battle tested at Lyft, Spotify, freenome and others and truly open-source.
Stars: ✭ 1,242 (+1235.48%)