Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (-13.55%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-32.67%)
Gitlogg💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'
Stars: ✭ 102 (-59.36%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+1115.54%)
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (-60.56%)
Pipelinethe `pipeline` shell command
Stars: ✭ 168 (-33.07%)
MsnoiseA Python Package for Monitoring Seismic Velocity Changes using Ambient Seismic Noise | http://www.msnoise.org
Stars: ✭ 94 (-62.55%)
QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (-17.93%)
PdftabextractA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars: ✭ 1,969 (+684.46%)
Dc Hi guides[Data Castle 算法竞赛] 精品旅行服务成单预测 final rank 11
Stars: ✭ 83 (-66.93%)
Suod(MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)
Stars: ✭ 245 (-2.39%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+4984.86%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (-70.12%)
Estadistica Con RApuntes personales sobre estadística, machine learning y lenguaje de programación R
Stars: ✭ 201 (-19.92%)
Bee UniversityProject thu thập điểm chuẩn đại học 2014 - 2018 và phân tích dữ liệu
Stars: ✭ 73 (-70.92%)
Sourced Cesource{d} Community Edition (CE)
Stars: ✭ 153 (-39.04%)
FfbeDatamining for FFBE GL
Stars: ✭ 69 (-72.51%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (-7.57%)
EvalneSource code for EvalNE, a Python library for evaluating Network Embedding methods.
Stars: ✭ 67 (-73.31%)
Alimusic🎼天池阿里音乐流行趋势预测大赛,项目中涵盖了从初赛到复赛的全部核心代码。复赛的聚合数据可以在百度网盘下载,更详细的思路介绍欢迎访问我的博客。
Stars: ✭ 147 (-41.43%)
InstascrapePowerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Stars: ✭ 202 (-19.52%)
GendisContains an implementation (sklearn API) of the algorithm proposed in "GENDIS: GEnetic DIscovery of Shapelets" and code to reproduce all experiments.
Stars: ✭ 59 (-76.49%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (-41.83%)
Etherscan MlPython Data Science and Machine Learning Library for the Ethereum and ERC-20 Blockchain
Stars: ✭ 55 (-78.09%)
TweetfeelsReal-time sentiment analysis in Python using twitter's streaming api
Stars: ✭ 249 (-0.8%)
Php MlPHP-ML - Machine Learning library for PHP
Stars: ✭ 7,900 (+3047.41%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-82.87%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (-23.9%)
HeliomlA book about machine learning, statistics, and data mining for heliophysics
Stars: ✭ 36 (-85.66%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-86.06%)
Automlpipeline.jlA package that makes it trivial to create and evaluate machine learning pipeline architectures.
Stars: ✭ 223 (-11.16%)
Invoice2dataExtract structured data from PDF invoices
Stars: ✭ 943 (+275.7%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-45.42%)
ClevercsvCleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (+253.39%)
Data miningThe Ruby DataMining Gem, is a little collection of several Data-Mining-Algorithms
Stars: ✭ 10 (-96.02%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+240.24%)
ReaperSocial media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
Stars: ✭ 240 (-4.38%)
TipdmTipDM建模平台,开源的数据挖掘工具。
Stars: ✭ 130 (-48.21%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-91.24%)
BiolitmapCode for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-92.83%)
StocktalkData collection tool for social media analytics
Stars: ✭ 765 (+204.78%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-13.15%)
RavenRAVEN is a flexible and multi-purpose probabilistic risk analysis, uncertainty quantification, parameter optimization and data knowledge-discovering framework.
Stars: ✭ 122 (-51.39%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+1155.78%)
Prefixspan PyThe shortest yet efficient Python implementation of the sequential pattern mining algorithm PrefixSpan, closed sequential pattern mining algorithm BIDE, and generator sequential pattern mining algorithm FEAT.
Stars: ✭ 214 (-14.74%)
Ayakashi⚡️ Ayakashi.io - The next generation web scraping framework
Stars: ✭ 117 (-53.39%)