Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+22427.27%)
Bogus📇 A simple and sane fake data generator for C#, F#, and VB.NET. Based on and ported from the famed faker.js.
Stars: ✭ 5,083 (+23004.55%)
Octo CliCLI tool to expose data from any database as a serverless web service.
Stars: ✭ 653 (+2868.18%)
FootballdataA hodgepodge of JSON and CSV Football/Soccer data
Stars: ✭ 526 (+2290.91%)
TabulatorInteractive Tables and Data Grids for JavaScript
Stars: ✭ 4,329 (+19577.27%)
Core2dA multi-platform data driven 2D diagram editor.
Stars: ✭ 475 (+2059.09%)
Sensei GridSimple and lightweight data grid in JS/HTML
Stars: ✭ 808 (+3572.73%)
Isp Data PollutionISP Data Pollution to Protect Private Browsing History with Obfuscation
Stars: ✭ 425 (+1831.82%)
DatafusionDataFusion has now been donated to the Apache Arrow project
Stars: ✭ 611 (+2677.27%)
Machine Learning MindmapA mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.
Stars: ✭ 5,339 (+24168.18%)
Bad Data GuideAn exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
Stars: ✭ 3,862 (+17454.55%)
Sheetjs📗 SheetJS Community Edition -- Spreadsheet Data Toolkit
Stars: ✭ 28,479 (+129350%)
Disk.frameFast Disk-Based Parallelized Data Manipulation Framework for Larger-than-RAM Data
Stars: ✭ 517 (+2250%)
Awesome Ai Ml DlAwesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Stars: ✭ 831 (+3677.27%)
PybaseballPull current and historical baseball statistics using Python (Statcast, Baseball Reference, FanGraphs)
Stars: ✭ 484 (+2100%)
McwMicrosoft Cloud Workshop Project
Stars: ✭ 677 (+2977.27%)
RioA Swiss-Army Knife for Data I/O
Stars: ✭ 467 (+2022.73%)
Z1pZip Codes Validation and Parse.
Stars: ✭ 17 (-22.73%)
TensorbaseTensorBase BE is building a high performance, cloud neutral bigdata warehouse for SMEs fully in Rust.
Stars: ✭ 440 (+1900%)
Fsharp.dataF# Data: Library for Data Access
Stars: ✭ 631 (+2768.18%)
DataThis repository contains general data for Web technologies
Stars: ✭ 418 (+1800%)
Awesome StreamlitThe purpose of this project is to share knowledge on how awesome Streamlit is and can be
Stars: ✭ 769 (+3395.45%)
Finance Go📊 Financial markets data library implemented in go.
Stars: ✭ 392 (+1681.82%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (+2595.45%)
PaniniA super simple flat file generator.
Stars: ✭ 562 (+2454.55%)
SamplesSample projects using Material, Graph, and Algorithm.
Stars: ✭ 386 (+1654.55%)
TerriajsA library for building rich, web-based geospatial data platforms.
Stars: ✭ 699 (+3077.27%)
WebReact web interface for the OpenDota platform
Stars: ✭ 889 (+3940.91%)
Countly ServerCountly helps you get insights from your application. Available self-hosted or on private cloud.
Stars: ✭ 4,857 (+21977.27%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+121731.82%)
Sklearn ClassificationData Science Notebook on a Classification Task, using sklearn and Tensorflow.
Stars: ✭ 518 (+2254.55%)
Flight Prices ScraperAutomated Script to scrape flight prices from any website into a csv format
Stars: ✭ 17 (-22.73%)
Voice datasets🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (+2145.45%)
SnowplowThe enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP
Stars: ✭ 5,935 (+26877.27%)
Machine Learning RoadmapA roadmap connecting many of the most important concepts in machine learning, how to learn them and what tools to use to perform them.
Stars: ✭ 5,277 (+23886.36%)
AtscanAdvanced dork Search & Mass Exploit Scanner
Stars: ✭ 817 (+3613.64%)
Data Engineering BookAccumulated knowledge and experience in the field of Data Engineering
Stars: ✭ 471 (+2040.91%)
FakerFaker is a pure Elixir library for generating fake data.
Stars: ✭ 673 (+2959.09%)
Udacity Data Engineering ProjectsFew projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+1981.82%)
Riceteacatpandarepo with challenge material for riceteacatpanda (2020)
Stars: ✭ 18 (-18.18%)
FetchSimple & Efficient data access for Scala and Scala.js
Stars: ✭ 453 (+1959.09%)
PyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 647 (+2840.91%)
Brasil.ioBackend do Brasil.IO (para código dos scripts de coleta de dados, veja o link na página de cada dataset)
Stars: ✭ 780 (+3445.45%)
FeatranA Scala feature transformation library for data science and machine learning
Stars: ✭ 420 (+1809.09%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+2727.27%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+1777.27%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-27.27%)
React SpreadsheetSimple, customizable yet performant spreadsheet for React
Stars: ✭ 393 (+1686.36%)
Valid.js📝 A library for data validation.
Stars: ✭ 604 (+2645.45%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (+1677.27%)
Datacurator Filetreea standard filetree for /r/datacurator [ and r/datahoarder ]
Stars: ✭ 753 (+3322.73%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (+2581.82%)
LpfmpointsEvolution of LPFM Stations
Stars: ✭ 19 (-13.64%)
Mithril DataA rich data model library for Mithril javascript framework
Stars: ✭ 17 (-22.73%)
BitsA bite sized library for dealing with bytes.
Stars: ✭ 16 (-27.27%)
RowsA common, beautiful interface to tabular data, no matter the format
Stars: ✭ 739 (+3259.09%)
IexfinancePython SDK for IEX Cloud
Stars: ✭ 573 (+2504.55%)