All Categories → Data Processing → data-analysis

Top 507 data-analysis open source projects

Crunchbase Ml
Merge and Acquisitions Prediction based on M&A information from Crunchbase.
Kodiak
Enhance your feature engineering workflow with Kodiak
March Madness Data
NCAA brackets in JSON form
Eda miner
Swiss army knife, but for visualization, analytics, and machine learning. View docs here: http://edaminer.com/docs/ and a demo (don't abuse) here: http://edaminer.com/
Pyda 2e Zh
📖 [译] 利用 Python 进行数据分析 · 第 2 版
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Vectorbt
Ultimate Python library for time series analysis and backtesting at scale
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Socrat
A Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Raio X
📊 Análise de dados das mulheres do curso de Ciência da Computação na UFCG
Nanny
A tidyverse suite for (pre-) machine-learning: cluster, PCA, permute, impute, rotate, redundancy, triangular, smart-subset, abundant and variable features.
Visualization Of Global Terrorism Database
📊 Visualization of GTD with py Plotly lib, including amazing graphs and animation 📼
Pyamplitude
A Python connector for Amplitude Analytics
Dataframe
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Getting Started
This repository is a getting started guide to Singer.
Siuba
Python library for using dplyr like syntax with pandas and SQL
Imbalanced Learn
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Data Analysis And Machine Learning Projects
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Qs ledger
Quantified Self Personal Data Aggregator and Data Analysis
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Rumale
Rumale is a machine learning library in Ruby
Dapy
Easy-to-use data analysis / manipulation framework for humans
Gonum
Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more
Knowledge Repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Weibospider
⚡ A distributed crawler for weibo, building with celery and requests.
Antd Umi Sys
企业BI系统,数据可视化平台,主要技术:react、antd、umi、dva、es6、less等,与君共勉,互相学习,如果喜欢请start ⭐。
Awesome R
A curated list of awesome R packages, frameworks and software.
Dataanalysisinaction
(已完结)《极客时间数据分析实战45讲-详细笔记》包含markdown、图片、思维导图、代码 、数据。 可直接阅读代码、测试!
R
Exercises (incl. analyses) with R language (math+statistics)
Gop
GoPlus - The Go+ language for engineering, STEM education, and data science
Pydata Notebook
利用Python进行数据分析 第二版 (2017) 中文翻译笔记
Jupyter pivottablejs
Drag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Iclr2020 Openreviewdata
Script that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using Selenium and ChromeDriver on Ubuntu.
Iris
A powerful, format-agnostic, and community-driven Python package for analysing and visualising Earth science data
The Elements Of Statistical Learning Python Notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
Datacleaner
The premier open source Data Quality solution
Pandastable
Table analysis in Tkinter using pandas DataFrames.
Bap
Bayesian Analysis with Python (Second Edition)
Prettypandas
A Pandas Styler class for making beautiful tables
181-240 of 507 data-analysis projects