Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+240.44%)
SspipeSimple Smart Pipe: python productivity-tool for rapid data manipulation
Stars: ✭ 96 (-73.41%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+270.64%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-70.36%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-78.12%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-69.81%)
Scikit Learnscikit-learn: machine learning in Python
Stars: ✭ 48,322 (+13285.6%)
kobe-every-shot-everA Los Angeles Times analysis of Every shot in Kobe Bryant's NBA career
Stars: ✭ 66 (-81.72%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-95.57%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+233.52%)
PandasvaultAdvanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Stars: ✭ 316 (-12.47%)
Deeplearning NotesNotes for Deep Learning Specialization Courses led by Andrew Ng.
Stars: ✭ 126 (-65.1%)
Aws Data WranglerPandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+560.66%)
PandasschemaA validation library for Pandas data frames using user-friendly schemas
Stars: ✭ 135 (-62.6%)
DataProfilerWhat's in your data? Extract schema, statistics and entities from datasets
Stars: ✭ 843 (+133.52%)
Py QuantmodPowerful financial charting library based on R's Quantmod | http://py-quantmod.readthedocs.io/en/latest/
Stars: ✭ 155 (-57.06%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-57.62%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+1262.6%)
Machine Learning With PythonPractice and tutorial-style notebooks covering wide variety of machine learning techniques
Stars: ✭ 2,197 (+508.59%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (-79.22%)
tutorialsShort programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-96.12%)
Pydataroadopen source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (-16.34%)
FinanceHere you can find all the quantitative finance algorithms that I've worked on and refined over the past year!
Stars: ✭ 194 (-46.26%)
Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-46.54%)
ipython-notebooksA collection of Jupyter notebooks exploring different datasets.
Stars: ✭ 43 (-88.09%)
Machine Learning ResourcesA curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (-37.4%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-39.61%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-91.69%)
KneedKnee point detection in Python 📈
Stars: ✭ 328 (-9.14%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+745.15%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+773.13%)
TablesawJava dataframe and visualization library
Stars: ✭ 2,785 (+671.47%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+743.21%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (-83.38%)
KlibEasy to use Python library of customized functions for cleaning and analyzing data.
Stars: ✭ 192 (-46.81%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (-55.4%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-94.74%)
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+16.07%)
online-course-recommendation-systemBuilt on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (-91.41%)
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-93.91%)
DatscanDatScan is an initiative to build an open-source CMS that will have the capability to solve any problem using data Analysis just with the help of various modules and a vast standardized module library
Stars: ✭ 13 (-96.4%)
visionsType System for Data Analysis in Python
Stars: ✭ 136 (-62.33%)
GreyNSightsPrivacy-Preserving Data Analysis using Pandas
Stars: ✭ 18 (-95.01%)
fairlensIdentify bias and measure fairness of your data
Stars: ✭ 51 (-85.87%)
Dream3dData Analysis program and framework for materials science data analytics, based on the managing framework SIMPL framework.
Stars: ✭ 73 (-79.78%)
GradioCreate UIs for your machine learning model in Python in 3 minutes
Stars: ✭ 4,358 (+1107.2%)