xforestA super-fast and scalable Random Forest library based on fast histogram decision tree algorithm and distributed bagging framework. It can be used for binary classification, multi-label classification, and regression tasks. This library provides both Python and command line interface to users.
Stars: ✭ 20 (-37.5%)
anomalyDetectionAn R package for implementing augmented network log anomaly detection procedures
Stars: ✭ 21 (-34.37%)
jdsJenesis Data Store: a dynamic, cross platform, high performance, ORM data-mapper. Designed to assist in rapid development and data mining
Stars: ✭ 17 (-46.87%)
HealthyPiOBSOLETE. Replaced by https://github.com/Protocentral/protocentral-healthypi-v3
Stars: ✭ 64 (+100%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (+290.63%)
FSCNMFAn implementation of "Fusing Structure and Content via Non-negative Matrix Factorization for Embedding Information Networks".
Stars: ✭ 16 (-50%)
Kddcup 20206th Solution for 2020-KDDCUP Debiasing Challenge
Stars: ✭ 118 (+268.75%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (+259.38%)
Instagram-Comments-ScraperInstagram comment scraper using python and selenium. Save the comments into excel.
Stars: ✭ 73 (+128.13%)
hpipeWorkflow engine for various computing systems.
Stars: ✭ 26 (-18.75%)
BellaBella is a pure python post-exploitation data mining tool & remote administration tool for macOS. 🍎💻
Stars: ✭ 112 (+250%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+54650%)
kmeansA simple implementation of K-means (and Bisecting K-means) clustering algorithm in Python
Stars: ✭ 18 (-43.75%)
Gitlogg💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'
Stars: ✭ 102 (+218.75%)
NIDS-Intrusion-DetectionSimple Implementation of Network Intrusion Detection System. KddCup'99 Data set is used for this project. kdd_cup_10_percent is used for training test. correct set is used for test. PCA is used for dimension reduction. SVM and KNN supervised algorithms are the classification algorithms of project. Accuracy : %83.5 For SVM , %80 For KNN
Stars: ✭ 45 (+40.63%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+9750%)
csmath-2021This mathematics course is taught for the first year Ph.D. students of computer science and related areas @zju
Stars: ✭ 30 (-6.25%)
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (+209.38%)
candis🎀 A data mining suite for gene expression data.
Stars: ✭ 28 (-12.5%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-50%)
XCloudOfficial Code for Paper <XCloud: Design and Implementation of AI Cloud Platform with RESTful API Service> (arXiv1912.10344)
Stars: ✭ 58 (+81.25%)
MsnoiseA Python Package for Monitoring Seismic Velocity Changes using Ambient Seismic Noise | http://www.msnoise.org
Stars: ✭ 94 (+193.75%)
tf-idf-pythonTerm frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (+206.25%)
HFT-PredictionMachine learning approach to high frequency trading, MLP & RNN used
Stars: ✭ 19 (-40.62%)
rec-coreData pipelining service
Stars: ✭ 19 (-40.62%)
dee2Digital Expression Explorer 2 (DEE2): a repository of uniformly processed RNA-seq data
Stars: ✭ 32 (+0%)
Dc Hi guides[Data Castle 算法竞赛] 精品旅行服务成单预测 final rank 11
Stars: ✭ 83 (+159.38%)
iwwAI based web-wrapper for web-content-extraction
Stars: ✭ 61 (+90.63%)
ReaperSocial media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
Stars: ✭ 240 (+650%)
heidiheidi : tidy data in Haskell
Stars: ✭ 24 (-25%)
neuromanticLatest Data Science Materials
Stars: ✭ 27 (-15.62%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (+134.38%)
gosquitogosquito ("go" + "mosquito") is a pluggable tool for data gathering, data processing and data transmitting to various destinations.
Stars: ✭ 25 (-21.87%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+9434.38%)
RavenRAVEN is a flexible and multi-purpose probabilistic risk analysis, uncertainty quantification, parameter optimization and data knowledge-discovering framework.
Stars: ✭ 122 (+281.25%)
evineInteractive CLI Web Crawler
Stars: ✭ 140 (+337.5%)
kasthack.ospГенератор сырых дампов пользователей VK.
Stars: ✭ 15 (-53.12%)
Bee UniversityProject thu thập điểm chuẩn đại học 2014 - 2018 và phân tích dữ liệu
Stars: ✭ 73 (+128.13%)
modelscriptREPO MOVED TO https://github.com/repetere/jsonstack-data - Data Science and Machine learning in JavaScript
Stars: ✭ 40 (+25%)
hh researchАвтоматизация поиска и исследования вакансий с сайта hh.ru (Headhunter) с помощью методов Python. Классификация данных, поиск статистических параметров.
Stars: ✭ 36 (+12.5%)
ecg-recorder-iosThis is the project page for the iOS app ECG Recorder for Polar H10 heart rate sensor.
Stars: ✭ 20 (-37.5%)
xgboost-smote-detect-fraudCan we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Stars: ✭ 59 (+84.38%)