All Projects → ColasGael → Machine Learning For Solar Energy Prediction

ColasGael / Machine Learning For Solar Energy Prediction

Licence: mit
Predict the Power Production of a solar panel farm from Weather Measurements using Machine Learning

Programming Languages

python
139335 projects - #7 most used programming language
matlab
3953 projects

Projects that are alternatives of or similar to Machine Learning For Solar Energy Prediction

Rapidtables
Super fast list of dicts to pre-formatted tables conversion library for Python 2/3
Stars: ✭ 292 (+210.64%)
Mutual labels:  data-processing
Texar Pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 636 (+576.6%)
Mutual labels:  data-processing
2019 Electronic Design Competition
【电赛】2019 全国大学生电子设计竞赛 (F题)纸张数量检测装置 (基于STM32F407 & FDC2214 & USART HMI)
Stars: ✭ 53 (-43.62%)
Mutual labels:  data-processing
Dali
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Stars: ✭ 3,624 (+3755.32%)
Mutual labels:  data-processing
Awesome Web Scraping
List of libraries, tools and APIs for web scraping and data processing.
Stars: ✭ 4,510 (+4697.87%)
Mutual labels:  data-processing
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+819.15%)
Mutual labels:  data-processing
prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (-42.55%)
Mutual labels:  data-processing
Forte
Forte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (-5.32%)
Mutual labels:  data-processing
Pandera
A light-weight, flexible, and expressive pandas data validation library
Stars: ✭ 506 (+438.3%)
Mutual labels:  data-processing
Cbrain
CBRAIN is a flexible Ruby on Rails framework for accessing and processing of large data on high-performance computing infrastructures.
Stars: ✭ 51 (-45.74%)
Mutual labels:  data-processing
Eternal
👾~ music, eternal ~ 👾
Stars: ✭ 323 (+243.62%)
Mutual labels:  data-processing
Awesome Kafka
A list about Apache Kafka
Stars: ✭ 397 (+322.34%)
Mutual labels:  data-processing
Tdm
R package for normalizing RNA-seq data to make them comparable to microarray data.
Stars: ✭ 33 (-64.89%)
Mutual labels:  data-processing
Nonechucks
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Stars: ✭ 304 (+223.4%)
Mutual labels:  data-processing
Pulsar Spark
When Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-41.49%)
Mutual labels:  data-processing
Hub
Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+4158.51%)
Mutual labels:  data-processing
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+808.51%)
Mutual labels:  data-processing
Broadway
Concurrent and multi-stage data ingestion and data processing with Elixir
Stars: ✭ 1,310 (+1293.62%)
Mutual labels:  data-processing
Dialogpt
Large-scale pretraining for dialogue
Stars: ✭ 1,177 (+1152.13%)
Mutual labels:  data-processing
Mdsplus
The MDSplus data management system
Stars: ✭ 47 (-50%)
Mutual labels:  data-processing

Machine-Learning-for-Solar-Energy-Prediction

by Adele Kuzmiakova, Gael Colas and Alex McKeehan, graduate students from Stanford University

This is our final project for the CS229: "Machine Learning" class in Stanford (2017). Our teachers were Pr. Andrew Ng and Pr. Dan Boneh.

Language: Python, Matlab, R

Goal: predict the hourly power production of a photovoltaic power station from the measurements of a set of weather features.

This project could be decomposed in 3 parts:

  • Data Pre-processing: we processed the raw weather data files (input) from the National Oceanographic and Atmospheric Administration and the power production data files (output) from Urbana-Champaign solar farm to get meaningful numeric values on an hourly basis ;
  • Feature Selection: we run correlation analysis between the weather features and the energy output to discard useless features, we also implemented Principal Component Analysis to reduce the dimension of our dataset ;
  • Machine Learning : we compared the performances of our ML algorithms. Implemented models include Weighted Linear Regression with and without dimension reduction, Boosting Regression Trees, and artificial Neural Networks with and without vanishing temporal gradient

Our final report and poster are available at the root.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].