SqlpadWeb-based SQL editor run in your own private cloud. Supports MySQL, Postgres, SQL Server, Vertica, Crate, ClickHouse, Trino, Presto, SAP HANA, Cassandra, Snowflake, BigQuery, SQLite, and more with ODBC
Data ScienceCollection of useful data science topics along with code and articles
ArticlesA repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Quantitative NotebooksEducational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Notebooksinteractive notebooks from Planet Engineering
KneedKnee point detection in Python 📈
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Ai Learn人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Pydataroadopen source for wechat-official-account (ID: PyDataLab)
SealionThe first machine learning framework that encourages learning ML concepts instead of memorizing class functions.
Knowage ServerKnowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
UrsUniversal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
XlearnHigh performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Datagear数据可视化分析平台,使用Java语言开发,采用浏览器/服务器架构,支持SQL、CSV、Excel、HTTP接口、JSON等多种数据源
Eseur BookIssue handling for Evidence-based Software Engineering: based on the publicly available data
LagoujobJob data mining repo for lagou.com
fairlensIdentify bias and measure fairness of your data
JimuReport「低代码可视化报表」类似excel操作风格,在线拖拽完成设计!功能涵盖: 报表设计、图形报表、打印设计、大屏设计等,完全免费!秉承“简单、易用、专业”的产品理念,极大的降低报表开发难度、缩短开发周期、解决各类报表难题。
Data-AnalysisDifferent types of data analytics projects : EDA, PDA, DDA, TSA and much more.....
trading sim📈📆 Backtest trading strategies concurrently using historical chart data from various financial exchanges.
mlmachinemlmachine accelerates machine learning experimentation
validadaAnother library for defensive data analysis.
GreyNSightsPrivacy-Preserving Data Analysis using Pandas
twitter-analytics-wrapperA simple Python wrapper to download tweets data from the Twitter Analytics platform. Particularly interesting for the impressions metrics that are unavailable on current Twitter API. Also works for the videos data.
kwxBERT, LDA, and TFIDF based keyword extraction in Python
r4dswebsitePublic repository for the R4DS community website.
growthbookOpen Source Feature Flagging and A/B Testing Platform
social-dataCode and data for eviction and housing analysis in the US
data vis statistics geosciencesThis repository contains the laboratory portion of an upper level undergraduate class in Python on data visualization and statistics for geo & space scientists. Labs are updated when the course is in session through the most recent branch. See master version for current class.
BilibiliCrawler🌀 crawl bilibili user info and video info for data analysis | BiliBili爬虫
visionsType System for Data Analysis in Python
GuitarA Simple and Efficient Distributed Multidimensional BI Analysis Engine.
CoreMSCoreMS is a comprehensive mass spectrometry software framework
sherlock🔎 Find usernames across social networks.
TextGridToolsRead, write, and manipulate Praat TextGrid files with Python
leaflet heatmap简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
demeterProcess and analyze X-ray Absorption Spectroscopy data using Feff and either Larch or Ifeffit.
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.