Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+9655.75%)
Data Engineering BookAccumulated knowledge and experience in the field of Data Engineering
Stars: ✭ 471 (+108.41%)
Isp Data PollutionISP Data Pollution to Protect Private Browsing History with Obfuscation
Stars: ✭ 425 (+88.05%)
Disk.frameFast Disk-Based Parallelized Data Manipulation Framework for Larger-than-RAM Data
Stars: ✭ 517 (+128.76%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+2092.92%)
Data Science PortfolioPortfolio of data science projects completed by me for academic, self learning, and hobby purposes.
Stars: ✭ 559 (+147.35%)
Trump LiesTutorial: Web scraping in Python with Beautiful Soup
Stars: ✭ 201 (-11.06%)
Free Ai Resources🚀 FREE AI Resources - 🎓 Courses, 👷 Jobs, 📝 Blogs, 🔬 AI Research, and many more - for everyone!
Stars: ✭ 192 (-15.04%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+266.37%)
Awesome StreamlitThe purpose of this project is to share knowledge on how awesome Streamlit is and can be
Stars: ✭ 769 (+240.27%)
Riceteacatpandarepo with challenge material for riceteacatpanda (2020)
Stars: ✭ 18 (-92.04%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+82.74%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+3585.4%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+282.3%)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+327.88%)
Data PolygamyData Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.
Stars: ✭ 39 (-82.74%)
SkootA package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn friendly interface in an effort to expedite the modeling process.
Stars: ✭ 50 (-77.88%)
BoltzmanncleanFill missing values in Pandas DataFrames using Restricted Boltzmann Machines
Stars: ✭ 23 (-89.82%)
Climate Change Data🌍 A curated list of APIs, open data and ML/AI projects on climate change
Stars: ✭ 195 (-13.72%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-74.34%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-67.7%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+3674.78%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+443.81%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-65.04%)
Daily Coding ProblemSeries of the problem 💯 and solution ✅ asked by Daily Coding problem👨🎓 website.
Stars: ✭ 90 (-60.18%)
SspipeSimple Smart Pipe: python productivity-tool for rapid data manipulation
Stars: ✭ 96 (-57.52%)
Ml PyxisTool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.
Stars: ✭ 93 (-58.85%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-52.21%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-32.74%)
Gspread DataframeRead/write Google spreadsheets using pandas DataFrames
Stars: ✭ 118 (-47.79%)
Machine Learning With PythonPractice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+872.12%)
LearnpythonforresearchThis repository provides everything you need to get started with Python for (social science) research.
Stars: ✭ 163 (-27.88%)
AuptimizerAn automatic ML model optimization tool.
Stars: ✭ 166 (-26.55%)
Dbg PdsDeutsche Boerse's Financial Trading Public Data Set
Stars: ✭ 124 (-45.13%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+570.8%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (-44.25%)
Cape PythonCollaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-44.69%)
Data Forge JsJavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 139 (-38.5%)
TrafficA toolbox for processing and analysing air traffic data
Stars: ✭ 138 (-38.94%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-39.38%)
ExportsheetdataAdd-on for Google Sheets that allows sheets to be exported as JSON or XML.
Stars: ✭ 170 (-24.78%)
Soda SqlMetric collection, data testing and monitoring for SQL accessible data
Stars: ✭ 173 (-23.45%)
Py QuantmodPowerful financial charting library based on R's Quantmod | http://py-quantmod.readthedocs.io/en/latest/
Stars: ✭ 155 (-31.42%)
Pandas DatareaderExtract data from a wide range of Internet sources into a pandas DataFrame.
Stars: ✭ 2,183 (+865.93%)
PandasschemaA validation library for Pandas data frames using user-friendly schemas
Stars: ✭ 135 (-40.27%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (+73.01%)
Finance Go📊 Financial markets data library implemented in go.
Stars: ✭ 392 (+73.45%)
Pymc Example ProjectExample PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.
Stars: ✭ 90 (-60.18%)
Blockchain2graphBlockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Stars: ✭ 134 (-40.71%)
DtaleVisualizer for pandas data structures
Stars: ✭ 2,864 (+1167.26%)
Andrew Ng NotesThis is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (-20.35%)