Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (+341.18%)
geodaDataData package for accessing GeoDa datasets using R
Stars: ✭ 15 (-11.76%)
Suod(MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)
Stars: ✭ 245 (+1341.18%)
sugarcubeMonoidal data processes.
Stars: ✭ 32 (+88.24%)
datawizardMagic potions to clean and transform your data 🧙
Stars: ✭ 149 (+776.47%)
MatminerData mining for materials science
Stars: ✭ 251 (+1376.47%)
ffscraprR API Client for Fantasy Football League Platforms
Stars: ✭ 55 (+223.53%)
LasioPython library for reading and writing well data using Log ASCII Standard (LAS) files
Stars: ✭ 234 (+1276.47%)
Statistical LearningLecture Slides and R Sessions for Trevor Hastie and Rob Tibshinari's "Statistical Learning" Stanford course
Stars: ✭ 223 (+1211.76%)
AsclepiusOpen Price Comparison for US Hospitals
Stars: ✭ 20 (+17.65%)
rcppsimdjsonRcpp Bindings for the 'simdjson' Header Library
Stars: ✭ 103 (+505.88%)
scikit-hubnessA Python package for hubness analysis and high-dimensional data mining
Stars: ✭ 41 (+141.18%)
packagefinderComfortable search for R packages on CRAN, either directly from the R console or with an R Studio add-in
Stars: ✭ 43 (+152.94%)
kenchiA scikit-learn compatible library for anomaly detection
Stars: ✭ 36 (+111.76%)
lingtypologyR package for linguistic cartography and typological databases search
Stars: ✭ 47 (+176.47%)
TweetfeelsReal-time sentiment analysis in Python using twitter's streaming api
Stars: ✭ 249 (+1364.71%)
scMCAMouse cell atlas
Stars: ✭ 45 (+164.71%)
rdomainsClassifying the content of domains
Stars: ✭ 47 (+176.47%)
ChirpInterface to manage and centralize Google Alert information
Stars: ✭ 227 (+1235.29%)
nlrxnlrx NetLogo R
Stars: ✭ 66 (+288.24%)
EasyMinerEasy association rule mining and classification on the web
Stars: ✭ 14 (-17.65%)
Prefixspan PyThe shortest yet efficient Python implementation of the sequential pattern mining algorithm PrefixSpan, closed sequential pattern mining algorithm BIDE, and generator sequential pattern mining algorithm FEAT.
Stars: ✭ 214 (+1158.82%)
statesCreate country-year/month/day panels consistent with the COW or Gleditsch & Ward independent states lists
Stars: ✭ 13 (-23.53%)
imbalanced-ensembleClass-imbalanced / Long-tailed ensemble learning in Python. Modular, flexible, and extensible. | 模块化、灵活、易扩展的类别不平衡/长尾机器学习库
Stars: ✭ 199 (+1070.59%)
wqbcAn R package for water quality thresholds and index calculation for British Columbia
Stars: ✭ 16 (-5.88%)
Semantic-Busobject flow treatment, data transformation
Stars: ✭ 49 (+188.24%)
LuminescenceDevelopment of the R package 'Luminescence'
Stars: ✭ 13 (-23.53%)
software-analyticsA repository with my data analysis results of software artifacts
Stars: ✭ 37 (+117.65%)
metadatMeta-analytic datasets for R
Stars: ✭ 21 (+23.53%)
geneSCF inactiveGeneSCF moved to a dedicated GitHub page, https://github.com/genescf/GeneSCF
Stars: ✭ 21 (+23.53%)
suppdataGrabbing SUPPlementary DATA in R
Stars: ✭ 31 (+82.35%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+102958.82%)
TextClassification基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
Stars: ✭ 86 (+405.88%)
Orange3🍊 📊 💡 Orange: Interactive data analysis
Stars: ✭ 3,152 (+18441.18%)
PLNmodelsA collection of Poisson lognormal models for multivariate count data analysis
Stars: ✭ 44 (+158.82%)
metaforA meta-analysis package for R
Stars: ✭ 174 (+923.53%)
ReaperSocial media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
Stars: ✭ 240 (+1311.76%)
crminer⛔ ARCHIVED ⛔ Fetch 'Scholary' Full Text from 'Crossref'
Stars: ✭ 17 (+0%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+17847.06%)
tidyhydatAn R package to import Water Survey of Canada hydrometric data and make it tidy
Stars: ✭ 67 (+294.12%)
DeepgraphAnalyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (+1264.71%)
DoReMIFaSolTéléchargement des données sur le site de l'Insee
Stars: ✭ 25 (+47.06%)
Automlpipeline.jlA package that makes it trivial to create and evaluate machine learning pipeline architectures.
Stars: ✭ 223 (+1211.76%)
mlr3tuningHyperparameter optimization package of the mlr3 ecosystem
Stars: ✭ 44 (+158.82%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+1182.35%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (+252.94%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+1176.47%)
synsyn - the thesaurus
Stars: ✭ 45 (+164.71%)
QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+1111.76%)
PaperWeeklyAI📚「@MaiweiAI」Studying papers in the fields of computer vision, NLP, and machine learning algorithms every week.
Stars: ✭ 50 (+194.12%)
RcppEigenRcpp integration for the Eigen templated linear algebra library
Stars: ✭ 89 (+423.53%)
oemPenalized least squares estimation using the Orthogonalizing EM (OEM) algorithm
Stars: ✭ 22 (+29.41%)
rdflib📦 High level wrapper around the redland package for common rdf applications
Stars: ✭ 47 (+176.47%)
rAltmetricQuery and visualize metrics from altmetric.com
Stars: ✭ 46 (+170.59%)