RavenRAVEN is a flexible and multi-purpose probabilistic risk analysis, uncertainty quantification, parameter optimization and data knowledge-discovering framework.
Kddcup 20206th Solution for 2020-KDDCUP Debiasing Challenge
Ayakashi⚡️ Ayakashi.io - The next generation web scraping framework
Lab WorkshopsMaterials for workshops on text mining, machine learning, and data visualization
BellaBella is a pure python post-exploitation data mining tool & remote administration tool for macOS. 🍎💻
WebplotdigitizerHTML5 based online tool to extract numerical data from plot images.
GspanPython implementation of frequent subgraph mining algorithm gSpan. Directed graphs are supported.
Gitlogg💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
MsnoiseA Python Package for Monitoring Seismic Velocity Changes using Ambient Seismic Noise | http://www.msnoise.org
DaggyDaggy - Data Aggregation Utility. Open source, free, cross-platform, server-less, useful utility for remote or local data aggregation and streaming
Csmath 2020This mathematics course is taught for the first year Ph.D. students of computer science and related areas @ZJU
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Tsv UtilseBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Tsrepr TSrepr: R package for time series representations
Bee UniversityProject thu thập điểm chuẩn đại học 2014 - 2018 và phân tích dữ liệu
BoltFast approximate vector operations
FfbeDatamining for FFBE GL
GorseAn open source recommender system service written in Go
EvalneSource code for EvalNE, a Python library for evaluating Network Embedding methods.
Linkedingiveaway👨🏽🏫You can learn about anything over here. What Giveaways I do and why it's important in today's modern world. Are you interested in Giveaway's?🔋
Wordtokenizers.jlHigh performance tokenizers for natural language processing and other related tasks
GendisContains an implementation (sklearn API) of the algorithm proposed in "GENDIS: GEnetic DIscovery of Shapelets" and code to reproduce all experiments.
Etherscan MlPython Data Science and Machine Learning Library for the Ethereum and ERC-20 Blockchain
PycmMulti-class confusion matrix library in Python
Php MlPHP-ML - Machine Learning library for PHP
CgnnCrystal Graph Neural Networks
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
HeliomlA book about machine learning, statistics, and data mining for heliophysics
Mldmпотоковый курс "Машинное обучение и анализ данных (Machine Learning and Data Mining)" на факультете ВМК МГУ имени М.В. Ломоносова
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Metasra PipelineMetaSRA: normalized sample-specific metadata for the Sequence Read Archive
SubdueThe Subdue graph miner discovers highly-compressing patterns in an input graph.
ClevercsvCleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
En Data miningData Mining Historical Newspaper Metadata (METS/ALTO formats)
Data miningThe Ruby DataMining Gem, is a little collection of several Data-Mining-Algorithms
VectorbtUltimate Python library for time series analysis and backtesting at scale
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Awesome Ai BooksSome awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning
Model Describermodel-describer : Making machine learning interpretable to humans
BiolitmapCode for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Pyclusteringpyclustring is a Python, C++ data mining library.
StocktalkData collection tool for social media analytics
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018