calcuMLatorAn intelligently dumb calculator that uses machine learning
Stars: ✭ 30 (-6.25%)
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (+6.25%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+21.88%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+246.88%)
pyspark-algorithmsPySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+125%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+368.75%)
Sales-PredictionIn depth analysis and forecasting of product sales based on the items, stores, transaction and other dependent variables like holidays and oil prices.
Stars: ✭ 56 (+75%)
basinBasin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-21.87%)
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+1168.75%)
regression-pythonIn this repository you can find many different, small, projects which demonstrate regression techniques using python programming language
Stars: ✭ 15 (-53.12%)
awesome-computer-vision-modelsA list of popular deep learning models related to classification, segmentation and detection problems
Stars: ✭ 419 (+1209.38%)
SoomvaarSoomvaar is the repo which 🏩 contains different collection of 👨💻🚀code in Python and 💫✨Machine 👬🏼 learning algorithms📗📕 that is made during 📃 my practice and learning of ML and Python✨💥
Stars: ✭ 41 (+28.13%)
hive to es同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (-34.37%)
oshinko-s2iThis is a place to put s2i images and utilities for spark application builders for openshift
Stars: ✭ 16 (-50%)
MLDemosMachine Learning Demonstrations: A graphical interface to draw data, apply a diverse array of machine learning tools to it, and directly see the results in a visual and understandable manner.
Stars: ✭ 46 (+43.75%)
smart-data-lakeSmart Automation Tool for building modern Data Lakes and Data Pipelines
Stars: ✭ 79 (+146.88%)
mlreefThe collaboration workspace for Machine Learning
Stars: ✭ 1,409 (+4303.13%)
sparserega collection of modern sparse (regularized) linear regression algorithms.
Stars: ✭ 55 (+71.88%)
rankpruning🧹 Formerly for binary classification with noisy labels. Replaced by cleanlab.
Stars: ✭ 81 (+153.13%)
bihmBidirectional Helmholtz Machines
Stars: ✭ 40 (+25%)
srqmAn introductory statistics course for social scientists, using Stata
Stars: ✭ 43 (+34.38%)
broomExtraHelpers for regression analyses using `{broom}` & `{easystats}` packages 📈 🔍
Stars: ✭ 45 (+40.63%)
cheapmlMachine Learning algorithms coded from scratch
Stars: ✭ 17 (-46.87%)
greycatGreyCat - Data Analytics, Temporal data, What-if, Live machine learning
Stars: ✭ 104 (+225%)
sia-cogVarious cognitive api for machine learning, vision, language intent alalysis. Covers traditional as well as deep learning model design and training.
Stars: ✭ 34 (+6.25%)
pycobrapython library implementing ensemble methods for regression, classification and visualisation tools including Voronoi tesselations.
Stars: ✭ 111 (+246.88%)
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (+34.38%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (+103.13%)
learning-hadoop-and-sparkCompanion to Learning Hadoop and Learning Spark courses on Linked In Learning
Stars: ✭ 146 (+356.25%)
openPDCOpen Source Phasor Data Concentrator
Stars: ✭ 109 (+240.63%)
OLSTECOnLine Low-rank Subspace tracking by TEnsor CP Decomposition in Matlab: Version 1.0.1
Stars: ✭ 30 (-6.25%)
anovosAnovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (+140.63%)
books-ML-and-DL.pdf Format Books for Machine and Deep Learning
Stars: ✭ 105 (+228.13%)
ml course"Learning Machine Learning" Course, Bogotá, Colombia 2019 #LML2019
Stars: ✭ 22 (-31.25%)
webhdfsNode.js WebHDFS REST API client
Stars: ✭ 88 (+175%)
Multi-Type-TD-TSRExtracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
Stars: ✭ 174 (+443.75%)
dpkb大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (+284.38%)
deep-blueberryIf you've always wanted to learn about deep-learning but don't know where to start, then you might have stumbled upon the right place!
Stars: ✭ 17 (-46.87%)
LogAnalyzeHelper论坛日志分析系统清洗程序(包含IP规则库,UDF开发,MapReduce程序,日志数据)
Stars: ✭ 33 (+3.13%)
bbaiSet model hyperparameters using deterministic, exact algorithms.
Stars: ✭ 19 (-40.62%)
xgboost-smote-detect-fraudCan we predict accurately on the skewed data? What are the sampling techniques that can be used. Which models/techniques can be used in this scenario? Find the answers in this code pattern!
Stars: ✭ 59 (+84.38%)
Statistical-Learning-using-RThis is a Statistical Learning application which will consist of various Machine Learning algorithms and their implementation in R done by me and their in depth interpretation.Documents and reports related to the below mentioned techniques can be found on my Rpubs profile.
Stars: ✭ 27 (-15.62%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-50%)
MineColabRun Minecraft Server on Google Colab.
Stars: ✭ 135 (+321.88%)