ColasGael / Machine Learning For Solar Energy Prediction
Licence: mit
Predict the Power Production of a solar panel farm from Weather Measurements using Machine Learning
Stars: ✭ 94
Projects that are alternatives of or similar to Machine Learning For Solar Energy Prediction
Rapidtables
Super fast list of dicts to pre-formatted tables conversion library for Python 2/3
Stars: ✭ 292 (+210.64%)
Mutual labels: data-processing
Texar Pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 636 (+576.6%)
Mutual labels: data-processing
2019 Electronic Design Competition
【电赛】2019 全国大学生电子设计竞赛 (F题)纸张数量检测装置 (基于STM32F407 & FDC2214 & USART HMI)
Stars: ✭ 53 (-43.62%)
Mutual labels: data-processing
Dali
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Stars: ✭ 3,624 (+3755.32%)
Mutual labels: data-processing
Awesome Web Scraping
List of libraries, tools and APIs for web scraping and data processing.
Stars: ✭ 4,510 (+4697.87%)
Mutual labels: data-processing
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+819.15%)
Mutual labels: data-processing
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-42.55%)
Mutual labels: data-processing
Forte
Forte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (-5.32%)
Mutual labels: data-processing
Pandera
A light-weight, flexible, and expressive pandas data validation library
Stars: ✭ 506 (+438.3%)
Mutual labels: data-processing
Cbrain
CBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.
Stars: ✭ 51 (-45.74%)
Mutual labels: data-processing
Tdm
R package for normalizing RNA-seq data to make them comparable to microarray data.
Stars: ✭ 33 (-64.89%)
Mutual labels: data-processing
Nonechucks
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Stars: ✭ 304 (+223.4%)
Mutual labels: data-processing
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-41.49%)
Mutual labels: data-processing
Hub
Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+4158.51%)
Mutual labels: data-processing
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+808.51%)
Mutual labels: data-processing
Broadway
Concurrent and multi-stage data ingestion and data processing with Elixir
Stars: ✭ 1,310 (+1293.62%)
Mutual labels: data-processing
Dialogpt
Large-scale pretraining for dialogue
Stars: ✭ 1,177 (+1152.13%)
Mutual labels: data-processing
Machine-Learning-for-Solar-Energy-Prediction
by Adele Kuzmiakova, Gael Colas and Alex McKeehan, graduate students from Stanford University
This is our final project for the CS229: "Machine Learning" class in Stanford (2017). Our teachers were Pr. Andrew Ng and Pr. Dan Boneh.
Language: Python, Matlab, R
Goal: predict the hourly power production of a photovoltaic power station from the measurements of a set of weather features.
This project could be decomposed in 3 parts:
- Data Pre-processing: we processed the raw weather data files (input) from the National Oceanographic and Atmospheric Administration and the power production data files (output) from Urbana-Champaign solar farm to get meaningful numeric values on an hourly basis ;
- Feature Selection: we run correlation analysis between the weather features and the energy output to discard useless features, we also implemented Principal Component Analysis to reduce the dimension of our dataset ;
- Machine Learning : we compared the performances of our ML algorithms. Implemented models include Weighted Linear Regression with and without dimension reduction, Boosting Regression Trees, and artificial Neural Networks with and without vanishing temporal gradient
Our final report and poster are available at the root.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].