ClevercsvCleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (+1747.92%)
Igela delightful machine learning tool that allows you to train, test, and use models without writing code
Stars: ✭ 2,956 (+6058.33%)
sugarcubeMonoidal data processes.
Stars: ✭ 32 (-33.33%)
GamA PyTorch implementation of "Graph Classification Using Structural Attention" (KDD 2018).
Stars: ✭ 227 (+372.92%)
Data miningThe Ruby DataMining Gem, is a little collection of several Data-Mining-Algorithms
Stars: ✭ 10 (-79.17%)
ChefboostA Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting (GBDT, GBRT, GBM), Random Forest and Adaboost w/categorical features support for Python
Stars: ✭ 176 (+266.67%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+1679.17%)
DanmfA sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
Stars: ✭ 161 (+235.42%)
get smartiesDummy variable generation with fit/transform capabilities
Stars: ✭ 23 (-52.08%)
Fenchel Young LossesProbabilistic classification in PyTorch/TensorFlow/scikit-learn with Fenchel-Young losses
Stars: ✭ 152 (+216.67%)
Data Analysis主要是爬虫与数据分析项目总结,外加建模与机器学习,模型的评估。
Stars: ✭ 142 (+195.83%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+256.25%)
Qlik Py ToolsData Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
Stars: ✭ 135 (+181.25%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-54.17%)
Role2vecA scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).
Stars: ✭ 134 (+179.17%)
Semantic-Busobject flow treatment, data transformation
Stars: ✭ 49 (+2.08%)
BiolitmapCode for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-62.5%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (+252.08%)
House Price Prediction房价预测完整项目:1.爬取链家网数据 2.处理后,用sklearn中几个逻辑回归机器学习模型和keras神经网络搭建模型预测房价 最终结果神经网络效果更好,R^2值0.75左右
Stars: ✭ 116 (+141.67%)
StocktalkData collection tool for social media analytics
Stars: ✭ 765 (+1493.75%)
TextClassification基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
Stars: ✭ 86 (+79.17%)
SkproSupervised domain-agnostic prediction framework for probabilistic modelling
Stars: ✭ 107 (+122.92%)
DataprooferA proofreader for your data
Stars: ✭ 628 (+1208.33%)
Pipelinethe `pipeline` shell command
Stars: ✭ 168 (+250%)
KarateclubKarate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)
Stars: ✭ 1,190 (+2379.17%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (+1177.08%)
scikit-hubnessA Python package for hubness analysis and high-dimensional data mining
Stars: ✭ 41 (-14.58%)
PdftabextractA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars: ✭ 1,969 (+4002.08%)
Sklearn PorterTranspile trained scikit-learn estimators to C, Java, JavaScript and others.
Stars: ✭ 1,014 (+2012.5%)
Interpretable machine learning with pythonExamples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Stars: ✭ 530 (+1004.17%)
Lambda PacksPrecompiled packages for AWS Lambda
Stars: ✭ 997 (+1977.08%)
Rong360用户贷款风险预测
Stars: ✭ 489 (+918.75%)
AilearningAiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP
Stars: ✭ 32,316 (+67225%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+26489.58%)
Combo(AAAI' 20) A Python Toolbox for Machine Learning Model Combination
Stars: ✭ 481 (+902.08%)
Computer Vision Actioncomputer vision learning, include python machine learning action; computer vision based on deep learning ;coursera deeplearning.ai and other cv learning materials collect ...
Stars: ✭ 19 (-60.42%)
Decision-Tree-ImplementationA python 3 implementation of decision tree (machine learning classification algorithm) from scratch
Stars: ✭ 19 (-60.42%)
Deception-Detection-on-Amazon-reviews-datasetA SVM model that classifies the reviews as real or fake. Used both the review text and the additional features contained in the data set to build a model that predicted with over 85% accuracy without using any deep learning techniques.
Stars: ✭ 42 (-12.5%)
foreshadowAn automatic machine learning system
Stars: ✭ 29 (-39.58%)
dh-coreFunctional data science
Stars: ✭ 123 (+156.25%)
compvInsanely fast Open Source Computer Vision library for ARM and x86 devices (Up to #50 times faster than OpenCV)
Stars: ✭ 155 (+222.92%)
Prefixspan PyThe shortest yet efficient Python implementation of the sequential pattern mining algorithm PrefixSpan, closed sequential pattern mining algorithm BIDE, and generator sequential pattern mining algorithm FEAT.
Stars: ✭ 214 (+345.83%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+2479.17%)
DataCon🏆DataCon大数据安全分析大赛,2019年方向二(恶意代码检测)冠军源码、2020年方向五(恶意代码分析)季军源码
Stars: ✭ 69 (+43.75%)