data-algorithms-with-sparkO'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Stars: ✭ 34 (+6.25%)
Mutual labels: machine-learning-algorithms, pyspark
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+246.88%)
Mutual labels: hadoop, pyspark
calcuMLatorAn intelligently dumb calculator that uses machine learning
Stars: ✭ 30 (-6.25%)
Mutual labels: machine-learning-algorithms, regression-models
regression-pythonIn this repository you can find many different, small, projects which demonstrate regression techniques using python programming language
Stars: ✭ 15 (-53.12%)
Mutual labels: machine-learning-algorithms, regression-models
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (+6.25%)
Mutual labels: hadoop, pyspark
basinBasin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-21.87%)
Mutual labels: hadoop, pyspark
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+368.75%)
Mutual labels: hadoop, pyspark
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+1168.75%)
Mutual labels: hadoop, pyspark
Sales-PredictionIn depth analysis and forecasting of product sales based on the items, stores, transaction and other dependent variables like holidays and oil prices.
Stars: ✭ 56 (+75%)
Mutual labels: machine-learning-algorithms, regression-models
pyspark-algorithmsPySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+125%)
Mutual labels: pyspark, rdd
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+21.88%)
Mutual labels: hadoop, pyspark
flask-spark-dockerJust a boilerplate for PySpark and Flask
Stars: ✭ 32 (+0%)
Mutual labels: pyspark
sia-cogVarious cognitive api for machine learning, vision, language intent alalysis. Covers traditional as well as deep learning model design and training.
Stars: ✭ 34 (+6.25%)
Mutual labels: machine-learning-algorithms
SoomvaarSoomvaar is the repo which 🏩 contains different collection of 👨💻🚀code in Python and 💫✨Machine 👬🏼 learning algorithms📗📕 that is made during 📃 my practice and learning of ML and Python✨💥
Stars: ✭ 41 (+28.13%)
Mutual labels: machine-learning-algorithms
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (+34.38%)
Mutual labels: hadoop
oshinko-s2iThis is a place to put s2i images and utilities for spark application builders for openshift
Stars: ✭ 16 (-50%)
Mutual labels: pyspark
mnist-neural-network-deeplearnjs🍃 Using a Neural Network to recognize MNIST digets in JavaScript.
Stars: ✭ 26 (-18.75%)
Mutual labels: machine-learning-algorithms