wildebeestFile processing pipelines
Stars: ✭ 86 (-80%)
TextClassification基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
Stars: ✭ 86 (-80%)
HiveApache Hive
Stars: ✭ 4,031 (+837.44%)
DGFraud-TF2A Deep Graph-based Toolbox for Fraud Detection in TensorFlow 2.X
Stars: ✭ 84 (-80.47%)
spmf-pyPython SPMF Wrapper 🐍 🎁
Stars: ✭ 35 (-91.86%)
Monetdb OldThis is the official mirror of the MonetDB Mercurial repository. Please note that we do not accept pull requests on github. The regression test results can be found on the MonetDB Testweb http://monetdb.cwi.nl/testweb/web/status.php .For contributions please see: https://www.monetdb.org/Developers
Stars: ✭ 317 (-26.28%)
Statistical-Learning-using-RThis is a Statistical Learning application which will consist of various Machine Learning algorithms and their implementation in R done by me and their in depth interpretation.Documents and reports related to the below mentioned techniques can be found on my Rpubs profile.
Stars: ✭ 27 (-93.72%)
Mli ResourcesH2O.ai Machine Learning Interpretability Resources
Stars: ✭ 428 (-0.47%)
blogpost codesRepo of my blogpost articles codes
Stars: ✭ 41 (-90.47%)
pythonPython codes from tutorials on the Data Professor YouTube channel
Stars: ✭ 51 (-88.14%)
EasyMinerEasy association rule mining and classification on the web
Stars: ✭ 14 (-96.74%)
Pm4py CorePublic repository for the PM4Py (Process Mining for Python) project.
Stars: ✭ 313 (-27.21%)
PaperWeeklyAI📚「@MaiweiAI」Studying papers in the fields of computer vision, NLP, and machine learning algorithms every week.
Stars: ✭ 50 (-88.37%)
actComputational synthetic biology: Predicting DNA edits for bioengineering
Stars: ✭ 67 (-84.42%)
AsclepiusOpen Price Comparison for US Hospitals
Stars: ✭ 20 (-95.35%)
KyuubiKyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark
Stars: ✭ 363 (-15.58%)
schrutepyThe Entire Transcript from the Office in Tidy Format
Stars: ✭ 22 (-94.88%)
cocoon-demoCocoon – a flow-based workflow automation, data mining and visual analytics tool.
Stars: ✭ 19 (-95.58%)
sugarcubeMonoidal data processes.
Stars: ✭ 32 (-92.56%)
Themis数据库审核平台
Stars: ✭ 313 (-27.21%)
scikit-hubnessA Python package for hubness analysis and high-dimensional data mining
Stars: ✭ 41 (-90.47%)
software-analyticsA repository with my data analysis results of software artifacts
Stars: ✭ 37 (-91.4%)
Practical SqlCode and Data for the book "Practical SQL" by Anthony DeBarros, published by No Starch Press (2018).
Stars: ✭ 392 (-8.84%)
Data-ScienceUsing Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-96.51%)
kenchiA scikit-learn compatible library for anomaly detection
Stars: ✭ 36 (-91.63%)
SquealSqueal, a deep embedding of SQL in Haskell
Stars: ✭ 308 (-28.37%)
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+4804.42%)
jdsJenesis Data Store: a dynamic, cross platform, high performance, ORM data-mapper. Designed to assist in rapid development and data mining
Stars: ✭ 17 (-96.05%)
MusoqUse SQL on various data sources
Stars: ✭ 252 (-41.4%)
ODSC India 2018My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-93.95%)
SqlparserSimple SQL parser meant for querying CSV files
Stars: ✭ 249 (-42.09%)
LoukoumA simple SQL Query Builder
Stars: ✭ 305 (-29.07%)
E Commerce DbDatabase schema for e-commerce (webstores) sites.
Stars: ✭ 245 (-43.02%)
Mlinterview A curated awesome list of AI Startups in India & Machine Learning Interview Guide. Feel free to contribute!
Stars: ✭ 410 (-4.65%)
OcilibOCILIB (C and C++ Drivers for Oracle) - Open source C and C++ library for accessing Oracle databases
Stars: ✭ 245 (-43.02%)
InstrumentedsqlA sql driver that will wrap any other driver and log/trace all its calls
Stars: ✭ 244 (-43.26%)
SqlsSQL language server written in Go.
Stars: ✭ 301 (-30%)
DataATK Data - Data Access Framework for high-latency databases (Cloud SQL/NoSQL).
Stars: ✭ 243 (-43.49%)
UltimateppU++ is a C++ cross-platform rapid application development framework focused on programmer's productivity. It includes a set of libraries (GUI, SQL, Network etc.), and integrated development environment (TheIDE).
Stars: ✭ 237 (-44.88%)
Baby squeel🐷 An expressive query DSL for Active Record 4 and 5
Stars: ✭ 362 (-15.81%)
ECG analysisNo description or website provided.
Stars: ✭ 32 (-92.56%)
Datagear数据可视化分析平台,使用Java语言开发,采用浏览器/服务器架构,支持SQL、CSV、Excel、HTTP接口、JSON等多种数据源
Stars: ✭ 266 (-38.14%)
vlainic.github.ioMy GitHub blog: things you might be interested, and probably not...
Stars: ✭ 26 (-93.95%)
R-data-wranglingMaterials for my my R data workshop. https://cengel.github.io/R-data-wrangling/
Stars: ✭ 17 (-96.05%)
CatenaCatena is a distributed database based on a blockchain, accessible using SQL.
Stars: ✭ 302 (-29.77%)
JsstoreA complete IndexedDB wrapper with SQL like syntax.
Stars: ✭ 430 (+0%)
Edge SqlCloudflare Workers providing a SQL API
Stars: ✭ 429 (-0.23%)
Sql ParserSQL Parser for C++. Building C++ object structure from SQL statements.
Stars: ✭ 420 (-2.33%)