All Projects → X-lab2017 → open-digger

X-lab2017 / open-digger

Licence: Apache-2.0 license
Open source analysis tools

Programming Languages

Jupyter Notebook
11667 projects
typescript
32286 projects

Projects that are alternatives of or similar to open-digger

elucidate
convenience functions to help researchers elucidate patterns in their data
Stars: ✭ 26 (-86.53%)
Mutual labels:  data-analysis
metrics
📈 What to measure, how to measure it.
Stars: ✭ 14 (-92.75%)
Mutual labels:  data-analysis
crazy-awesome-crypto
A list of awesome crypto and blockchain projects
Stars: ✭ 35 (-81.87%)
Mutual labels:  data-analysis
online-course-recommendation-system
Built on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (-83.94%)
Mutual labels:  data-analysis
tianchi-diabetes
天池精准医疗大赛——人工智能辅助糖尿病遗传风险预测 第一赛季
Stars: ✭ 20 (-89.64%)
Mutual labels:  data-analysis
FDBeye
R tools for eyetracker workflows.
Stars: ✭ 101 (-47.67%)
Mutual labels:  data-analysis
iMOKA
interactive Multi Objective K-mer Analysis
Stars: ✭ 19 (-90.16%)
Mutual labels:  data-analysis
facerec-bias-bfw
Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).
Stars: ✭ 40 (-79.27%)
Mutual labels:  data-analysis
LeTourDataSet
Every cyclist and stage of the Tour de France in two CSV files.
Stars: ✭ 61 (-68.39%)
Mutual labels:  data-analysis
meta-csv
A Clojure smart reader for CSV files
Stars: ✭ 20 (-89.64%)
Mutual labels:  data-analysis
RepSeP
Reproducible Self-Publishing - Demo Publications in the Most Common Formats
Stars: ✭ 14 (-92.75%)
Mutual labels:  data-analysis
Fraud-Detection-in-Online-Transactions
Detecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Algorithm may result in Overfitting
Stars: ✭ 41 (-78.76%)
Mutual labels:  data-analysis
osm-data-classification
Migrated to: https://gitlab.com/Oslandia/osm-data-classification
Stars: ✭ 23 (-88.08%)
Mutual labels:  data-analysis
dataquest-guided-projects-solutions
My dataquest project solutions
Stars: ✭ 35 (-81.87%)
Mutual labels:  data-analysis
tieba-zhuaqu
百度贴吧分布式爬虫,用于贴吧数据挖掘。从贴吧维度和用户维度进行数据分析
Stars: ✭ 56 (-70.98%)
Mutual labels:  data-analysis
advanced-pandas
Pandas is a powerful tool for data exploration and analysis (including timeseries).
Stars: ✭ 22 (-88.6%)
Mutual labels:  data-analysis
akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 5,155 (+2570.98%)
Mutual labels:  data-analysis
datatile
A library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+117.1%)
Mutual labels:  data-analysis
covidviz
Professional visualizations of COVID-19, emulating NYT, The Guardian, Washington Post, The Economist & others, using only Python & Altair.
Stars: ✭ 24 (-87.56%)
Mutual labels:  data-analysis
dflib
In-memory Java DataFrame library
Stars: ✭ 50 (-74.09%)
Mutual labels:  data-analysis

OpenDigger

apache2

OpenDigger is an open source analysis report project for all open source data initiated by X-lab, this project aims to combine the wisdom of global developers to jointly analyze and insight into open source related data to help everyone better understand and participate in open source.

Usage

OpenDigger can be used as an online analysis tool or cron task scripts, and is used to generate lots of data for open source reports and tools like:

For study purpose, you can checkout the Clickhouse demo notebook

  • Clickhouse Demo(notebook): A comprehensive demo notebook for Clickhouse driver.

Data

GitHub Event Log

We use GHArchive as our data source for GitHub event logs and the data service is provided by clickhouse cluster cloud service. For data details, please check the data docs.

Sample Data Usage

OpenDigger provides ClickHouse sample data and Jupyter notebook image to run OpenDigger in local environment, please refer to sample data doc.

Communication

Welcome to join the WeChat group by scanning the QRCode and I will invite you into our WeChat group.

License

We use Apache-2.0 license for code part, please make sure abide by the licenses when using the project.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].