Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+4247.72%)
Dataframes.jlIn-memory tabular data in Julia
Stars: ✭ 951 (+294.61%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-55.19%)
Sweetie DataThis repo contains logstash of various honeypots
Stars: ✭ 163 (-32.37%)
Climate Change Data🌍 A curated list of APIs, open data and ML/AI projects on climate change
Stars: ✭ 195 (-19.09%)
dbcollectionA collection of popular datasets for deep learning.
Stars: ✭ 26 (-89.21%)
Datasets🎁 3,000,000+ Unsplash images made available for research and machine learning
Stars: ✭ 1,805 (+648.96%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-39%)
Pandas DatareaderExtract data from a wide range of Internet sources into a pandas DataFrame.
Stars: ✭ 2,183 (+805.81%)
MeglassAn eyeglass face dataset collected and cleaned for face recognition evaluation, CCBR 2018.
Stars: ✭ 281 (+16.6%)
Gspread PandasA package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
Stars: ✭ 226 (-6.22%)
CartolaExtração de dados da API do CartolaFC, análise exploratória dos dados e modelos preditivos em R e Python - 2014-20. [EN] Data munging, analysis and modeling of CartolaFC - the most popular fantasy football game in Brazil and maybe in the world. Data cover years 2014-19.
Stars: ✭ 304 (+26.14%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+71.37%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (+62.24%)
Machine Learning RoadmapA roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
Stars: ✭ 5,277 (+2089.63%)
DatasetsA repository of pretty cool datasets that I collected for network science and machine learning research.
Stars: ✭ 302 (+25.31%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+1956.43%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+1941.08%)
Datasets For GoodList of datasets to apply stats/machine learning/technology to the world of social good.
Stars: ✭ 174 (-27.8%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (+144.81%)
Awesome StreamlitThe purpose of this project is to share knowledge on how awesome Streamlit is and can be
Stars: ✭ 769 (+219.09%)
RowsA common, beautiful interface to tabular data, no matter the format
Stars: ✭ 739 (+206.64%)
DataconfsA list of conferences connected with data worldwide.
Stars: ✭ 36 (-85.06%)
Free Ai Resources🚀 FREE AI Resources - 🎓 Courses, 👷 Jobs, 📝 Blogs, 🔬 AI Research, and many more - for everyone!
Stars: ✭ 192 (-20.33%)
Qriyou're invited to a data party!
Stars: ✭ 1,003 (+316.18%)
Data PolygamyData Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.
Stars: ✭ 39 (-83.82%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+3439.83%)
PydatasetInstant access to many datasets in Python.
Stars: ✭ 880 (+265.15%)
AestheticsImage Aesthetics Toolkit - includes Fisher Vector implementation, AVA (Image Aesthetic Visual Analysis) dataset and fast multi-threaded downloader
Stars: ✭ 113 (-53.11%)
Covid 19 Uk DataCoronavirus (COVID-19) UK Historical Data
Stars: ✭ 169 (-29.88%)
ChordPython package for creating beautiful interactive Chord Diagrams. Pro version available at https://m8.fyi/chord
Stars: ✭ 217 (-9.96%)
DatatableA go in-memory table
Stars: ✭ 215 (-10.79%)
Dataset SerializeJSON to DataSet and DataSet to JSON converter for Delphi and Lazarus (FPC)
Stars: ✭ 213 (-11.62%)
DialogrptEMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"
Stars: ✭ 216 (-10.37%)
DataladKeep code, data, containers under control with git and git-annex
Stars: ✭ 234 (-2.9%)
Short Jokes DatasetPython scripts for building 'Short Jokes' dataset, featured on Kaggle
Stars: ✭ 215 (-10.79%)
Ava downloader⏬ Download AVA dataset (A Large-Scale Database for Aesthetic Visual Analysis)
Stars: ✭ 214 (-11.2%)
Stocknet DatasetA comprehensive dataset for stock movement prediction from tweets and historical stock prices.
Stars: ✭ 228 (-5.39%)
OmnianomalyKDD 2019: Robust Anomaly Detection for Multivariate Time Series through Stochastic Recurrent Neural Network
Stars: ✭ 208 (-13.69%)
WeldHigh-performance runtime for data analytics applications
Stars: ✭ 2,709 (+1024.07%)
AlphatoolsQuantitative finance research tools in Python
Stars: ✭ 226 (-6.22%)
Reddit Hyped StocksA web application to explore currently hyped stocks on Reddit
Stars: ✭ 173 (-28.22%)
TaxizeA taxonomic toolbelt for R
Stars: ✭ 209 (-13.28%)
DataData and code behind the articles and graphics at FiveThirtyEight
Stars: ✭ 15,241 (+6224.07%)
Sc17SuperComputing 2017 Deep Learning Tutorial
Stars: ✭ 211 (-12.45%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+1165.98%)
DashAnalytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript Required.
Stars: ✭ 15,592 (+6369.71%)
CartoframesCARTO Python package for data scientists
Stars: ✭ 208 (-13.69%)
StreamlitStreamlit — The fastest way to build data apps in Python
Stars: ✭ 16,906 (+6914.94%)