Apriori-and-Eclat-Frequent-Itemset-MiningImplementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.
Stars: ✭ 36 (+80%)
KaliIntelligenceSuiteKali Intelligence Suite (KIS) shall aid in the fast, autonomous, central, and comprehensive collection of intelligence by executing standard penetration testing tools. The collected data is internally stored in a structured manner to allow the fast identification and visualisation of the collected information.
Stars: ✭ 58 (+190%)
Data-Analyst-NanodegreeThis repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-35%)
ECG analysisNo description or website provided.
Stars: ✭ 32 (+60%)
Medium-Stats-AnalysisExploring data and analyzing metrics for user-specific Medium Stats
Stars: ✭ 27 (+35%)
heidiheidi : tidy data in Haskell
Stars: ✭ 24 (+20%)
MetQyRepository for R package MetQy (read related publication here: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6247936/)
Stars: ✭ 17 (-15%)
bsu🎓Repository for university labs on FAMCS, BSU
Stars: ✭ 91 (+355%)
TurboDataMinerThe objective of this Burp Suite extension is the flexible and dynamic extraction, correlation, and structured presentation of information from the Burp Suite project as well as the flexible and dynamic on-the-fly modification of outgoing or incoming HTTP requests using Python scripts. Thus, Turbo Data Miner shall aid in gaining a better and fas…
Stars: ✭ 46 (+130%)
website-to-jsonConverts website to json using jQuery selectors
Stars: ✭ 37 (+85%)
COVID19TweetWNUT-2020 Task 2: Identification of informative COVID-19 English Tweets
Stars: ✭ 26 (+30%)
xforestA super-fast and scalable Random Forest library based on fast histogram decision tree algorithm and distributed bagging framework. It can be used for binary classification, multi-label classification, and regression tasks. This library provides both Python and command line interface to users.
Stars: ✭ 20 (+0%)
multiscorerA module for allowing the use of multiple metric functions in scikit's cross_val_score
Stars: ✭ 21 (+5%)
non-api-fb-scraperScrape public FaceBook posts from any group or user into a .csv file without needing to register for any API access
Stars: ✭ 40 (+100%)
conferencias matutinas amloCSVs de las versiones estenográficas de las conferencias matutinas del Presidente Andres Manuel López Obrador ( Mañaneras AMLO )
Stars: ✭ 25 (+25%)
neuromanticLatest Data Science Materials
Stars: ✭ 27 (+35%)
data-miningResources for the Data Mining for Bussiness and Governance course.
Stars: ✭ 52 (+160%)
TextClassification基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
Stars: ✭ 86 (+330%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (+200%)
EasyMinerEasy association rule mining and classification on the web
Stars: ✭ 14 (-30%)
gosquitogosquito ("go" + "mosquito") is a pluggable tool for data gathering, data processing and data transmitting to various destinations.
Stars: ✭ 25 (+25%)
simon-frontend💹 SIMON is powerful, flexible, open-source and easy to use machine learning knowledge discovery platform 💻
Stars: ✭ 114 (+470%)
PowerUp-2018The FRC 2018 programming repository for FRC Team 3695, Foximus Prime
Stars: ✭ 16 (-20%)
PyDREAMPython Implementation of Decay Replay Mining (DREAM)
Stars: ✭ 22 (+10%)
bookworm📚 social networks from novels
Stars: ✭ 72 (+260%)
python-notebooksA collection of Jupyter Notebooks used in conferences or just to have some snippets.
Stars: ✭ 14 (-30%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+355%)
modelscriptREPO MOVED TO https://github.com/repetere/jsonstack-data - Data Science and Machine learning in JavaScript
Stars: ✭ 40 (+100%)
xgboost-smote-detect-fraudCan we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Stars: ✭ 59 (+195%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-20%)
hh researchАвтоматизация поиска и исследования вакансий с сайта hh.ru (Headhunter) с помощью методов Python. Классификация данных, поиск статистических параметров.
Stars: ✭ 36 (+80%)
scibloxsciblox - Easier Data Science and Machine Learning
Stars: ✭ 48 (+140%)
TIGERPython toolbox to evaluate graph vulnerability and robustness (CIKM 2021)
Stars: ✭ 103 (+415%)
hierarchical-clusteringA Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.
Stars: ✭ 62 (+210%)
LeetCodeAt present contains scraped data from around 1500 problems present on the site. More to follow....
Stars: ✭ 45 (+125%)
interpretable-mlTechniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.
Stars: ✭ 17 (-15%)
dh-coreFunctional data science
Stars: ✭ 123 (+515%)
Ensemble-of-Multi-Scale-CNN-for-Dermatoscopy-ClassificationFully supervised binary classification of skin lesions from dermatoscopic images using an ensemble of diverse CNN architectures (EfficientNet-B6, Inception-V3, SEResNeXt-101, SENet-154, DenseNet-169) with multi-scale input.
Stars: ✭ 25 (+25%)
rankpruning🧹 Formerly for binary classification with noisy labels. Replaced by cleanlab.
Stars: ✭ 81 (+305%)
PySPODA Python package for spectral proper orthogonal decomposition (SPOD).
Stars: ✭ 50 (+150%)
cnn-text-classificationText classification with Convolution Neural Networks on Yelp, IMDB & sentence polarity dataset v1.0
Stars: ✭ 108 (+440%)
dee2Digital Expression Explorer 2 (DEE2): a repository of uniformly processed RNA-seq data
Stars: ✭ 32 (+60%)
iwwAI based web-wrapper for web-content-extraction
Stars: ✭ 61 (+205%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+200%)