All Categories → Data Processing → data-analysis

Top 507 data-analysis open source projects

Sqlpad
Web-based SQL editor run in your own private cloud. Supports MySQL, Postgres, SQL Server, Vertica, Crate, ClickHouse, Trino, Presto, SAP HANA, Cassandra, Snowflake, BigQuery, SQLite, and more with ODBC
Articles
A repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Pandas Summary
An extension to pandas dataframes describe function.
Notebooks
interactive notebooks from Planet Engineering
Scikit Mobility
scikit-mobility: mobility analysis in Python
Corner.py
Make some beautiful corner plots
Kneed
Knee point detection in Python 📈
Akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Ai Learn
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Notebooks
All of our computational notebooks
Hyperspy
Multidimensional data analysis
Dianping textmining
大众点评评论文本挖掘,包括点评数据爬取、数据清洗入库、数据分析、评论情感分析等的完整挖掘项目
Sealion
The first machine learning framework that encourages learning ML concepts instead of memorizing class functions.
Knowage Server
Knowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
Urs
Universal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Xlearn
High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Datagear
数据可视化分析平台,使用Java语言开发,采用浏览器/服务器架构,支持SQL、CSV、Excel、HTTP接口、JSON等多种数据源
Eseur Book
Issue handling for Evidence-based Software Engineering: based on the publicly available data
fairlens
Identify bias and measure fairness of your data
JimuReport
「低代码可视化报表」类似excel操作风格,在线拖拽完成设计!功能涵盖: 报表设计、图形报表、打印设计、大屏设计等,完全免费!秉承“简单、易用、专业”的产品理念,极大的降低报表开发难度、缩短开发周期、解决各类报表难题。
Data-Analysis
Different types of data analytics projects : EDA, PDA, DDA, TSA and much more.....
Google-Data-Analytics-Professional-Certificate
Quizzes & Assignment Solutions for Google Data Analytics Professional Certificate on Coursera. Also included a few resources on side that I found helpful.
mlmachine
mlmachine accelerates machine learning experimentation
twitter-analytics-wrapper
A simple Python wrapper to download tweets data from the Twitter Analytics platform. Particularly interesting for the impressions metrics that are unavailable on current Twitter API. Also works for the videos data.
data vis statistics geosciences
This repository contains the laboratory portion of an upper level undergraduate class in Python on data visualization and statistics for geo & space scientists. Labs are updated when the course is in session through the most recent branch. See master version for current class.
BilibiliCrawler
🌀 crawl bilibili user info and video info for data analysis | BiliBili爬虫
Guitar
A Simple and Efficient Distributed Multidimensional BI Analysis Engine.
TextGridTools
Read, write, and manipulate Praat TextGrid files with Python
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
demeter
Process and analyze X-ray Absorption Spectroscopy data using Feff and either Larch or Ifeffit.
Dominando-Pandas
Este repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
241-300 of 507 data-analysis projects