scibloxsciblox - Easier Data Science and Machine Learning
Stars: ✭ 48 (+20%)
Medium-Stats-AnalysisExploring data and analyzing metrics for user-specific Medium Stats
Stars: ✭ 27 (-32.5%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (+50%)
scikit-hubnessA Python package for hubness analysis and high-dimensional data mining
Stars: ✭ 41 (+2.5%)
nuts-mlFlow-based data pre-processing for deep learning
Stars: ✭ 32 (-20%)
Apriori-and-Eclat-Frequent-Itemset-MiningImplementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.
Stars: ✭ 36 (-10%)
AsclepiusOpen Price Comparison for US Hospitals
Stars: ✭ 20 (-50%)
TweetfeelsReal-time sentiment analysis in Python using twitter's streaming api
Stars: ✭ 249 (+522.5%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+43700%)
conferencias matutinas amloCSVs de las versiones estenográficas de las conferencias matutinas del Presidente Andres Manuel López Obrador ( Mañaneras AMLO )
Stars: ✭ 25 (-37.5%)
website-to-jsonConverts website to json using jQuery selectors
Stars: ✭ 37 (-7.5%)
TextClassification基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
Stars: ✭ 86 (+115%)
data-miningResources for the Data Mining for Bussiness and Governance course.
Stars: ✭ 52 (+30%)
PaperWeeklyAI📚「@MaiweiAI」Studying papers in the fields of computer vision, NLP, and machine learning algorithms every week.
Stars: ✭ 50 (+25%)
xforestA super-fast and scalable Random Forest library based on fast histogram decision tree algorithm and distributed bagging framework. It can be used for binary classification, multi-label classification, and regression tasks. This library provides both Python and command line interface to users.
Stars: ✭ 20 (-50%)
sugarcubeMonoidal data processes.
Stars: ✭ 32 (-20%)
non-api-fb-scraperScrape public FaceBook posts from any group or user into a .csv file without needing to register for any API access
Stars: ✭ 40 (+0%)
machine-learning-data-pipelinePipeline module for parallel real-time data processing for machine learning models development and production purposes.
Stars: ✭ 22 (-45%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+7780%)
bsu🎓Repository for university labs on FAMCS, BSU
Stars: ✭ 91 (+127.5%)
hierarchical-clusteringA Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.
Stars: ✭ 62 (+55%)
ReaperSocial media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
Stars: ✭ 240 (+500%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+7527.5%)
PyDREAMPython Implementation of Decay Replay Mining (DREAM)
Stars: ✭ 22 (-45%)
dh-coreFunctional data science
Stars: ✭ 123 (+207.5%)
PySPODA Python package for spectral proper orthogonal decomposition (SPOD).
Stars: ✭ 50 (+25%)
MetQyRepository for R package MetQy (read related publication here: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6247936/)
Stars: ✭ 17 (-57.5%)
EasyMinerEasy association rule mining and classification on the web
Stars: ✭ 14 (-65%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+127.5%)
imbalanced-ensembleClass-imbalanced / Long-tailed ensemble learning in Python. Modular, flexible, and extensible. | 模块化、灵活、易扩展的类别不平衡/长尾机器学习库
Stars: ✭ 199 (+397.5%)
xgboost-smote-detect-fraudCan we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Stars: ✭ 59 (+47.5%)
Semantic-Busobject flow treatment, data transformation
Stars: ✭ 49 (+22.5%)
hh researchАвтоматизация поиска и исследования вакансий с сайта hh.ru (Headhunter) с помощью методов Python. Классификация данных, поиск статистических параметров.
Stars: ✭ 36 (-10%)
software-analyticsA repository with my data analysis results of software artifacts
Stars: ✭ 37 (-7.5%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-60%)
kenchiA scikit-learn compatible library for anomaly detection
Stars: ✭ 36 (-10%)
MatminerData mining for materials science
Stars: ✭ 251 (+527.5%)
interpretable-mlTechniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.
Stars: ✭ 17 (-57.5%)
LasioPython library for reading and writing well data using Log ASCII Standard (LAS) files
Stars: ✭ 234 (+485%)
Suod(MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)
Stars: ✭ 245 (+512.5%)
KaliIntelligenceSuiteKali Intelligence Suite (KIS) shall aid in the fast, autonomous, central, and comprehensive collection of intelligence by executing standard penetration testing tools. The collected data is internally stored in a structured manner to allow the fast identification and visualisation of the collected information.
Stars: ✭ 58 (+45%)
simon-frontend💹 SIMON is powerful, flexible, open-source and easy to use machine learning knowledge discovery platform 💻
Stars: ✭ 114 (+185%)
multiscorerA module for allowing the use of multiple metric functions in scikit's cross_val_score
Stars: ✭ 21 (-47.5%)
Data-Analyst-NanodegreeThis repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-67.5%)