twitter-analytics-wrapperA simple Python wrapper to download tweets data from the Twitter Analytics platform. Particularly interesting for the impressions metrics that are unavailable on current Twitter API. Also works for the videos data.
Stars: ✭ 44 (+100%)
DataprooferA proofreader for your data
Stars: ✭ 628 (+2754.55%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (+2686.36%)
Sourced Cesource{d} Community Edition (CE)
Stars: ✭ 153 (+595.45%)
Pipelinethe `pipeline` shell command
Stars: ✭ 168 (+663.64%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+21886.36%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+2359.09%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+4790.91%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (+0%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+172.73%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (+468.18%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+13768.18%)
Pydataroadopen source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (+1272.73%)
Ai Learn人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+19840.91%)
kasthack.ospГенератор сырых дампов пользователей VK.
Stars: ✭ 15 (-31.82%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (+2727.27%)
genieGenie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (-4.55%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (+59.09%)
TipdmTipDM建模平台,开源的数据挖掘工具。
Stars: ✭ 130 (+490.91%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (+240.91%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+890.91%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (+954.55%)
python-notebooksA collection of Jupyter Notebooks used in conferences or just to have some snippets.
Stars: ✭ 14 (-36.36%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+3781.82%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+5527.27%)
VectorbtUltimate Python library for time series analysis and backtesting at scale
Stars: ✭ 855 (+3786.36%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+677.27%)
genieclustGenie++ Fast and Robust Hierarchical Clustering with Noise Point Detection - for Python and R
Stars: ✭ 34 (+54.55%)
Knowage ServerKnowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
Stars: ✭ 276 (+1154.55%)
UrsUniversal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (+1150%)
PyodA Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+23004.55%)
LagoujobJob data mining repo for lagou.com
Stars: ✭ 256 (+1063.64%)
taller SparkRTaller SparkR para las Jornadas de Usuarios de R
Stars: ✭ 12 (-45.45%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+3100%)
Etl unicorn数据可视化, 数据挖掘, 数据处理 ETL
Stars: ✭ 156 (+609.09%)
heidiheidi : tidy data in Haskell
Stars: ✭ 24 (+9.09%)
re-datare_data - fix data issues before your users & CEO would discover them 😊
Stars: ✭ 955 (+4240.91%)
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (+22.73%)
jdsJenesis Data Store: a dynamic, cross platform, high performance, ORM data-mapper. Designed to assist in rapid development and data mining
Stars: ✭ 17 (-22.73%)
covid-19COVID-19 World is yet another Project to build a Dashboard like app to showcase the data related to the COVID-19(Corona Virus).
Stars: ✭ 28 (+27.27%)
InfectCreate you virus in termux!
Stars: ✭ 33 (+50%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (+36.36%)
pompScreen scraping and web crawling framework
Stars: ✭ 61 (+177.27%)
ipython-notebooksA collection of Jupyter notebooks exploring different datasets.
Stars: ✭ 43 (+95.45%)
pyglotaranA Python library for Global and Target Analysis of time-resolved spectroscopy data
Stars: ✭ 33 (+50%)
yt-channels-DS-AI-ML-CSA comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (+4618.18%)
NIDS-Intrusion-DetectionSimple Implementation of Network Intrusion Detection System. KddCup'99 Data set is used for this project. kdd_cup_10_percent is used for training test. correct set is used for test. PCA is used for dimension reduction. SVM and KNN supervised algorithms are the classification algorithms of project. Accuracy : %83.5 For SVM , %80 For KNN
Stars: ✭ 45 (+104.55%)
PythonTipsDSPython Tips for Data Scientist
Stars: ✭ 23 (+4.55%)
Instagram-Comments-ScraperInstagram comment scraper using python and selenium. Save the comments into excel.
Stars: ✭ 73 (+231.82%)