Kddcup 20206th Solution for 2020-KDDCUP Debiasing Challenge
Stars: ✭ 118 (-44.86%)
PdftabextractA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars: ✭ 1,969 (+820.09%)
Gitlogg💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'
Stars: ✭ 102 (-52.34%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-21.03%)
BellaBella is a pure python post-exploitation data mining tool & remote administration tool for macOS. 🍎💻
Stars: ✭ 112 (-47.66%)
Sourced Cesource{d} Community Edition (CE)
Stars: ✭ 153 (-28.5%)
MsnoiseA Python Package for Monitoring Seismic Velocity Changes using Ambient Seismic Noise | http://www.msnoise.org
Stars: ✭ 94 (-56.07%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-20.09%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-35.98%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (-10.75%)
TipdmTipDM建模平台,开源的数据挖掘工具。
Stars: ✭ 130 (-39.25%)
Pipelinethe `pipeline` shell command
Stars: ✭ 168 (-21.5%)
RavenRAVEN is a flexible and multi-purpose probabilistic risk analysis, uncertainty quantification, parameter optimization and data knowledge-discovering framework.
Stars: ✭ 122 (-42.99%)
Estadistica Con RApuntes personales sobre estadística, machine learning y lenguaje de programación R
Stars: ✭ 201 (-6.07%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-46.26%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+5864.02%)
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (-53.74%)
Alimusic🎼天池阿里音乐流行趋势预测大赛,项目中涵盖了从初赛到复赛的全部核心代码。复赛的聚合数据可以在百度网盘下载,更详细的思路介绍欢迎访问我的博客。
Stars: ✭ 147 (-31.31%)
DaggyDaggy - Data Aggregation Utility. Open source, free, cross-platform, server-less, useful utility for remote or local data aggregation and streaming
Stars: ✭ 91 (-57.48%)
Efficient AprioriAn efficient Python implementation of the Apriori algorithm.
Stars: ✭ 145 (-32.24%)
MatrixprofileA Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Stars: ✭ 141 (-34.11%)
LightgbmA fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Stars: ✭ 13,293 (+6111.68%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+6151.87%)
SmartproxyHTTP(S) Rotating Residential proxies - Code examples & General information
Stars: ✭ 205 (-4.21%)
StriplogLithology and stratigraphic logs for wells or outcrop.
Stars: ✭ 133 (-37.85%)
WellyWell handling
Stars: ✭ 168 (-21.5%)
Ail FrameworkAIL framework - Analysis Information Leak framework
Stars: ✭ 191 (-10.75%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-41.59%)
SpypiAn (un-)ethical hacking-station based on Raspberry Pi and Python
Stars: ✭ 167 (-21.96%)
OpenhistorianThe Open Source Time-Series Data Historian
Stars: ✭ 120 (-43.93%)
Ayakashi⚡️ Ayakashi.io - The next generation web scraping framework
Stars: ✭ 117 (-45.33%)
PzadКурс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
Stars: ✭ 160 (-25.23%)
Lab WorkshopsMaterials for workshops on text mining, machine learning, and data visualization
Stars: ✭ 112 (-47.66%)
Emutomanipulate JSON files
Stars: ✭ 180 (-15.89%)
WebplotdigitizerHTML5 based online tool to extract numerical data from plot images.
Stars: ✭ 1,605 (+650%)
GspanPython implementation of frequent subgraph mining algorithm gSpan. Directed graphs are supported.
Stars: ✭ 103 (-51.87%)
Tradingview Data ScraperExtract price and indicator data from TradingView charts to create ML datasets
Stars: ✭ 203 (-5.14%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-53.27%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (-30.84%)
Papers Literature Ml Dl Rl AiHighly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Stars: ✭ 1,341 (+526.64%)
Rosie Pattern LanguageRosie Pattern Language (RPL) and the Rosie Pattern Engine have MOVED!
Stars: ✭ 146 (-31.78%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+1.4%)
QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (-3.74%)
InstascrapePowerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Stars: ✭ 202 (-5.61%)
ChefboostA Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Stars: ✭ 176 (-17.76%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (-31.78%)