Pyspark Setup DemoDemo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-73.63%)
BtctradingTime Series Forecast with Bitcoin value, to detect upward/down trends with Machine Learning Algorithms
Stars: ✭ 99 (+8.79%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+64.84%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+1370.33%)
StocksPrograms for stock prediction and evaluation
Stars: ✭ 155 (+70.33%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (+73.63%)
TcdfTemporal Causal Discovery Framework (PyTorch): discovering causal relationships between time series
Stars: ✭ 217 (+138.46%)
big dataA collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-62.64%)
pyspark-cheatsheetPySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Stars: ✭ 115 (+26.37%)
Sci PypeA Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, training, testing, analyzing, extracting, importing, and archiving. This repository can run from a docker container or from the repository.
Stars: ✭ 90 (-1.1%)
Stock Price PredictorThis project seeks to utilize Deep Learning models, Long-Short Term Memory (LSTM) Neural Network algorithm, to predict stock prices.
Stars: ✭ 146 (+60.44%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (+68.13%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+119.78%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-57.14%)
SynapseMLSimple and Distributed Machine Learning
Stars: ✭ 3,355 (+3586.81%)
Stock AnalysisRegression, Scrapers, and Visualization
Stars: ✭ 255 (+180.22%)
Spark SyntaxThis is a repo documenting the best practices in PySpark.
Stars: ✭ 412 (+352.75%)
CortxCORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (+368.13%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (+398.9%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+718.68%)
Deep Learning Time SeriesList of papers, code and experiments using deep learning for time series forecasting
Stars: ✭ 796 (+774.73%)
100daysofmlcodeMy journey to learn and grow in the domain of Machine Learning and Artificial Intelligence by performing the #100DaysofMLCode Challenge.
Stars: ✭ 146 (+60.44%)
Repo 2019BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
Stars: ✭ 133 (+46.15%)
ClinicalbertClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission (CHIL 2020 Workshop)
Stars: ✭ 175 (+92.31%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+983.52%)
Gdax Orderbook MlApplication of machine learning to the Coinbase (GDAX) orderbook
Stars: ✭ 60 (-34.07%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-29.67%)
pyspark-algorithmsPySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (-20.88%)
Coinpusher📈 real-time cryptocurrency chart prediction based on neuronal-networks
Stars: ✭ 141 (+54.95%)
check-engineData validation library for PySpark 3.0.0
Stars: ✭ 29 (-68.13%)
mmtf-workshop-2018Structural Bioinformatics Training Workshop & Hackathon 2018
Stars: ✭ 50 (-45.05%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+21.98%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (+40.66%)
Stock Prediction ModelsGathers machine learning and deep learning models for Stock forecasting including trading bots and simulations
Stars: ✭ 4,660 (+5020.88%)
PythonThis repository helps you understand python from the scratch.
Stars: ✭ 285 (+213.19%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+6115.38%)
Esper TvEsper instance for TV news analysis
Stars: ✭ 37 (-59.34%)
Attentive Neural Processesimplementing "recurrent attentive neural processes" to forecast power usage (w. LSTM baseline, MCDropout)
Stars: ✭ 33 (-63.74%)
Pysparkgeoanalysis🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-30.77%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+948.35%)
Stock Market Analysis And PredictionStock Market Analysis and Prediction is the project on technical analysis, visualization and prediction using data provided by Google Finance.
Stars: ✭ 112 (+23.08%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (+24.18%)
Trading BotStock Trading Bot using Deep Q-Learning
Stars: ✭ 273 (+200%)
SkymapHigh-throughput gene to knowledge mapping through massive integration of public sequencing data.
Stars: ✭ 29 (-68.13%)