AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+248.95%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+296.05%)
DatacomparerdataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.
Stars: ✭ 58 (-95.33%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-78.02%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (-13.37%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-98.71%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (-1.05%)
GraphiaA visualisation tool for the creation and analysis of graphs
Stars: ✭ 67 (-94.61%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-86.23%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (-68.52%)
OpenrefineOpenRefine is a free, open source power tool for working with messy data and improving it
Stars: ✭ 8,531 (+586.88%)
Production Data ScienceProduction Data Science: a workflow for collaborative data science aimed at production
Stars: ✭ 388 (-68.76%)
Knowledge RepoA next-generation curated knowledge sharing platform for data scientists and other technical professions.
Stars: ✭ 4,956 (+299.03%)
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+352.25%)
PdpipeEasy pipelines for pandas DataFrames.
Stars: ✭ 590 (-52.5%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (-0.32%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+2058.05%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (-50.64%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-43.32%)
RowsA common, beautiful interface to tabular data, no matter the format
Stars: ✭ 739 (-40.5%)
Riceteacatpandarepo with challenge material for riceteacatpanda (2020)
Stars: ✭ 18 (-98.55%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (-3.06%)
SocratA Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-97.91%)
Awesome StreamlitThe purpose of this project is to share knowledge on how awesome Streamlit is and can be
Stars: ✭ 769 (-38.08%)
VdsVerteego Data Suite
Stars: ✭ 9 (-99.28%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (-30.43%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+327.13%)
DatasheetsRead data from, write data to, and modify the formatting of Google Sheets
Stars: ✭ 593 (-52.25%)
DataprooferA proofreader for your data
Stars: ✭ 628 (-49.44%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-49.92%)
PrefectThe easiest way to automate your data
Stars: ✭ 7,956 (+540.58%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-56.44%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (-33.33%)
ResourcesPyMC3 educational resources
Stars: ✭ 930 (-25.12%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-98.23%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (-31.24%)
Mlcourse.aiOpen Machine Learning Course
Stars: ✭ 7,963 (+541.14%)
Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (-22.14%)
ApogeeTools for dealing with APOGEE data
Stars: ✭ 34 (-97.26%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (-20.61%)
Janitorsimple tools for data cleaning in R
Stars: ✭ 981 (-21.01%)
Data PolygamyData Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.
Stars: ✭ 39 (-96.86%)
TiledbThe Universal Storage Engine
Stars: ✭ 1,072 (-13.69%)
DartSelf-service data workflow management
Stars: ✭ 15 (-98.79%)
MathematicavsrExample projects, code, and documents for comparing Mathematica with R.
Stars: ✭ 41 (-96.7%)
Drake ExamplesExample workflows for the drake R package
Stars: ✭ 57 (-95.41%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (-93.96%)
Datacamp🍧 A repository that contains courses I have taken on DataCamp
Stars: ✭ 69 (-94.44%)
StartrA template for data journalism in R
Stars: ✭ 69 (-94.44%)