teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+75%)
kenchiA scikit-learn compatible library for anomaly detection
Stars: ✭ 36 (-30.77%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+33592.31%)
Data-Analyst-NanodegreeThis repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-75%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+5961.54%)
xgboost-smote-detect-fraudCan we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Stars: ✭ 59 (+13.46%)
ReaperSocial media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
Stars: ✭ 240 (+361.54%)
dh-coreFunctional data science
Stars: ✭ 123 (+136.54%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+5767.31%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (+346.15%)
if1007Desenvolvimento de Aplicações com Arquitetura Baseada em Microservices
Stars: ✭ 78 (+50%)
Automlpipeline.jlA package that makes it trivial to create and evaluate machine learning pipeline architectures.
Stars: ✭ 223 (+328.85%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-69.23%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+319.23%)
TextClassification基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
Stars: ✭ 86 (+65.38%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+317.31%)
website-to-jsonConverts website to json using jQuery selectors
Stars: ✭ 37 (-28.85%)
QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+296.15%)
Estadistica Con RApuntes personales sobre estadística, machine learning y lenguaje de programación R
Stars: ✭ 201 (+286.54%)
kubernetes-localdevCreate a local Kubernetes development environment on macOS or Windows and WSL2, including HTTPS/TLS and OAuth2/OIDC authentication.
Stars: ✭ 210 (+303.85%)
InstascrapePowerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Stars: ✭ 202 (+288.46%)
rails contact listLearn Ruby on Rails by creating an app from scratch
Stars: ✭ 60 (+15.38%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (+267.31%)
nodejs⛳ Node.js 应用开发课程资料
Stars: ✭ 14 (-73.08%)
ChefboostA Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Stars: ✭ 176 (+238.46%)
EasyMinerEasy association rule mining and classification on the web
Stars: ✭ 14 (-73.08%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+228.85%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (+225%)
skillboxВыполненные и принятые домашние задания, а также другие материалы, которые могут помочь в обучении
Stars: ✭ 32 (-38.46%)
Pipelinethe `pipeline` shell command
Stars: ✭ 168 (+223.08%)
KaliIntelligenceSuiteKali Intelligence Suite (KIS) shall aid in the fast, autonomous, central, and comprehensive collection of intelligence by executing standard penetration testing tools. The collected data is internally stored in a structured manner to allow the fast identification and visualisation of the collected information.
Stars: ✭ 58 (+11.54%)
PdftabextractA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars: ✭ 1,969 (+3686.54%)
complete-gRPCIn this course, we are going to learn about gRPC and how to use it with protocol buffer
Stars: ✭ 53 (+1.92%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+24444.23%)
simon-frontend💹 SIMON is powerful, flexible, open-source and easy to use machine learning knowledge discovery platform 💻
Stars: ✭ 114 (+119.23%)
Sourced Cesource{d} Community Edition (CE)
Stars: ✭ 153 (+194.23%)
Alimusic🎼天池阿里音乐流行趋势预测大赛,项目中涵盖了从初赛到复赛的全部核心代码。复赛的聚合数据可以在百度网盘下载,更详细的思路介绍欢迎访问我的博客。
Stars: ✭ 147 (+182.69%)
hierarchical-clusteringA Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.
Stars: ✭ 62 (+19.23%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (+180.77%)
tableaunoirAn online blackboard 🖉 with fridge magnets 🌈🧲 for teaching, and making animations 🏃 and presentations ⎚.
Stars: ✭ 149 (+186.54%)
sugarcubeMonoidal data processes.
Stars: ✭ 32 (-38.46%)
vuejs-egitimiVue.js ile Sıfırdan Uygulama Geliştirme Eğitimi uygulama ve proje dosyaları
Stars: ✭ 19 (-63.46%)
Apriori-and-Eclat-Frequent-Itemset-MiningImplementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.
Stars: ✭ 36 (-30.77%)
lt1Course on Language Technologies and NLP
Stars: ✭ 15 (-71.15%)
Semantic-Busobject flow treatment, data transformation
Stars: ✭ 49 (-5.77%)