Can we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!

Stars: ✭ 59 (+13.46%)

Mutual labels: data-mining

Reaper

Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

Stars: ✭ 240 (+361.54%)

Mutual labels: data-mining

dh-core

Functional data science

Stars: ✭ 123 (+136.54%)

Mutual labels: data-mining

Datascience

Curated list of Python resources for data science.

Stars: ✭ 3,051 (+5767.31%)

Mutual labels: data-mining

2018-Tencent-Lookalike

2018-腾讯广告算法大赛-相似人群拓展(初赛)：10th/1563 (Top 0.64%)

Stars: ✭ 46 (-11.54%)

Mutual labels: data-mining

Deepgraph

Analyze Data with Pandas-based Networks. Documentation:

Stars: ✭ 232 (+346.15%)

Mutual labels: data-mining

if1007

Desenvolvimento de Aplicações com Arquitetura Baseada em Microservices

Stars: ✭ 78 (+50%)

Mutual labels: course

Automlpipeline.jl

A package that makes it trivial to create and evaluate machine learning pipeline architectures.

Stars: ✭ 223 (+328.85%)

Mutual labels: data-mining

iis

Information Inference Service of the OpenAIRE system

Stars: ✭ 16 (-69.23%)

Mutual labels: data-mining

Amazing Feature Engineering

Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.

Stars: ✭ 218 (+319.23%)

Mutual labels: data-mining

TextClassification

基于scikit-learn实现对新浪新闻的文本分类，数据集为100w篇文档，总计10类，测试集与训练集1:1划分。分类算法采用SVM和Bayes，其中Bayes作为baseline。

Stars: ✭ 86 (+65.38%)

Mutual labels: data-mining

Gwu data mining

Materials for GWU DNSC 6279 and DNSC 6290.

Stars: ✭ 217 (+317.31%)

Mutual labels: data-mining

website-to-json

Converts website to json using jQuery selectors

Stars: ✭ 37 (-28.85%)

Mutual labels: data-mining

Qminer

Analytic platform for real-time large-scale streams containing structured and unstructured data.

Stars: ✭ 206 (+296.15%)

Mutual labels: data-mining

curso-introduccion-pyqgis

Curso de Introducción al desarrollo con PyQGIS (por Germán Carrillo)

Stars: ✭ 28 (-46.15%)

Mutual labels: course

Estadistica Con R

Apuntes personales sobre estadística, machine learning y lenguaje de programación R

Stars: ✭ 201 (+286.54%)

Mutual labels: data-mining

kubernetes-localdev

Create a local Kubernetes development environment on macOS or Windows and WSL2, including HTTPS/TLS and OAuth2/OIDC authentication.

Stars: ✭ 210 (+303.85%)

Mutual labels: course

Instascrape

Powerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically

Stars: ✭ 202 (+288.46%)

Mutual labels: data-mining

rails contact list

Learn Ruby on Rails by creating an app from scratch

Stars: ✭ 60 (+15.38%)

Mutual labels: course

Pyss3

A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI

Stars: ✭ 191 (+267.31%)

Mutual labels: data-mining

AILA-Artificial-Intelligence-for-Legal-Assistance

Python implementations of the various methods used in FIRE 2019 conference.

Stars: ✭ 39 (-25%)

Mutual labels: data-mining

Dataaspirant codes

Complete machine learning model codes

Stars: ✭ 185 (+255.77%)

Mutual labels: data-mining

Heart disease prediction

Heart Disease prediction using 5 algorithms

Stars: ✭ 43 (-17.31%)

Mutual labels: data-mining

Awesome Machine Learning Interpretability

A curated list of awesome machine learning interpretability resources.

Stars: ✭ 2,404 (+4523.08%)

Mutual labels: data-mining

nodejs

⛳ Node.js 应用开发课程资料

Stars: ✭ 14 (-73.08%)

Mutual labels: course

Chefboost

A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python

Stars: ✭ 176 (+238.46%)

Mutual labels: data-mining

EasyMiner

Easy association rule mining and classification on the web

Stars: ✭ 14 (-73.08%)

Mutual labels: data-mining

Data Science Resources

👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋

Stars: ✭ 171 (+228.85%)

Mutual labels: data-mining

blinkist-m4a-downloader

Grabs all of the audio files from all of the Blinkist books

Stars: ✭ 100 (+92.31%)

Mutual labels: data-mining

Data Science Toolkit

Collection of stats, modeling, and data science tools in Python and R.

Stars: ✭ 169 (+225%)

Mutual labels: data-mining

skillbox

Выполненные и принятые домашние задания, а также другие материалы, которые могут помочь в обучении

Stars: ✭ 32 (-38.46%)

Mutual labels: course

Pipeline

the `pipeline` shell command

Stars: ✭ 168 (+223.08%)

Mutual labels: data-mining

KaliIntelligenceSuite

Kali Intelligence Suite (KIS) shall aid in the fast, autonomous, central, and comprehensive collection of intelligence by executing standard penetration testing tools. The collected data is internally stored in a structured manner to allow the fast identification and visualisation of the collected information.

Stars: ✭ 58 (+11.54%)

Mutual labels: data-mining

Pdftabextract

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.

Stars: ✭ 1,969 (+3686.54%)

Mutual labels: data-mining

complete-gRPC

In this course, we are going to learn about gRPC and how to use it with protocol buffer

Stars: ✭ 53 (+1.92%)

Mutual labels: course

Gensim

Topic Modelling for Humans

Stars: ✭ 12,763 (+24444.23%)

Mutual labels: data-mining

simon-frontend

💹 SIMON is powerful, flexible, open-source and easy to use machine learning knowledge discovery platform 💻

Stars: ✭ 114 (+119.23%)

Mutual labels: data-mining

Sourced Ce

source{d} Community Edition (CE)

Stars: ✭ 153 (+194.23%)

Mutual labels: data-mining

Data-Mining-on-Social-Media

Python scripts to extract tweets and facebook posts from public users.

Stars: ✭ 99 (+90.38%)

Mutual labels: data-mining

Alimusic

🎼天池阿里音乐流行趋势预测大赛，项目中涵盖了从初赛到复赛的全部核心代码。复赛的聚合数据可以在百度网盘下载，更详细的思路介绍欢迎访问我的博客。

Stars: ✭ 147 (+182.69%)

Mutual labels: data-mining

hierarchical-clustering

A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.

Stars: ✭ 62 (+19.23%)

Mutual labels: data-mining

Fantasy Basketball

Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.

Stars: ✭ 146 (+180.77%)

Mutual labels: data-mining

tableaunoir

An online blackboard 🖉 with fridge magnets 🌈🧲 for teaching, and making animations 🏃 and presentations ⎚.

Stars: ✭ 149 (+186.54%)

Mutual labels: course

Twitterdatamining

Twitter数据挖掘及其可视化

Stars: ✭ 145 (+178.85%)

Mutual labels: data-mining

python for scientists

Python Open Courseware for Scientists and Engineers

Stars: ✭ 55 (+5.77%)

Mutual labels: course

sugarcube

Monoidal data processes.

Stars: ✭ 32 (-38.46%)

Mutual labels: data-mining

vuejs-egitimi

Vue.js ile Sıfırdan Uygulama Geliştirme Eğitimi uygulama ve proje dosyaları

Stars: ✭ 19 (-63.46%)

Mutual labels: course

hub-toolbox-python3

Hubness analysis and removal functions

Stars: ✭ 17 (-67.31%)

Mutual labels: data-mining

edge-computer-vision

Edge Computer Vision Course

Stars: ✭ 41 (-21.15%)

Mutual labels: course

Apriori-and-Eclat-Frequent-Itemset-Mining

Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.

Stars: ✭ 36 (-30.77%)

Mutual labels: data-mining

lt1

Course on Language Technologies and NLP

Stars: ✭ 15 (-71.15%)

Mutual labels: course

Semantic-Bus

object flow treatment, data transformation

Stars: ✭ 49 (-5.77%)

Mutual labels: data-mining

61-120 of 501 similar projects

‹

›

next*5