All Projects → ecrc → exageostat

ecrc / exageostat

Licence: BSD-3-Clause license
A High Performance Unified Framework for Geostatistics on Manycore Systems.

Programming Languages

c
50402 projects - #5 most used programming language
shell
77523 projects
CMake
9771 projects

Projects that are alternatives of or similar to exageostat

pf
PF: a header only template library for fast particle filtering!
Stars: ✭ 41 (+105%)
Mutual labels:  statistical-models
Vision CoreML-App
This app predicts the age of a person from the picture input using camera or photos gallery. The app uses Core ML framework of iOS for the predictions. The Vision library of CoreML is used here. The trained model fed to the system is AgeNet.
Stars: ✭ 15 (-25%)
Mutual labels:  prediction
ichingshifa
Python 易經筮法、大衍之數、六十四卦、六爻、京房易、爻辭、日期占卦、占卜。A python package of Ichingshifa (also known as Yarrow Stalks Divination).
Stars: ✭ 23 (+15%)
Mutual labels:  prediction
The-Purchase-and-Redemption-Forecast-Challenge-baseline
天池“资金流入流出预测——挑战baseline”的解决方案,线上效果143.5
Stars: ✭ 78 (+290%)
Mutual labels:  prediction
ANTsR
Advanced Normalization Tools in R
Stars: ✭ 101 (+405%)
Mutual labels:  prediction
FTRLProximal
R package for online training of regression models using FTRL Proximal
Stars: ✭ 12 (-40%)
Mutual labels:  prediction
TensorFlow CNN
Example CNN on CIFAR-10 classification
Stars: ✭ 14 (-30%)
Mutual labels:  prediction
point-cloud-prediction
Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks
Stars: ✭ 97 (+385%)
Mutual labels:  prediction
Word-Prediction-Ngram
Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques
Stars: ✭ 25 (+25%)
Mutual labels:  prediction
hector
The Hector Simple Climate Model
Stars: ✭ 72 (+260%)
Mutual labels:  climate-model
SGP.NET
C# SGP4 Satellite Prediction Library. Load satellites from TLEs, convert between coordinate systems and reference frames, observe satellites from ground stations, and more.
Stars: ✭ 15 (-25%)
Mutual labels:  prediction
Facial-Recognition-Tool
➰ A Facial Recognition tool based on Open-CV.
Stars: ✭ 44 (+120%)
Mutual labels:  prediction
pyGeoStatistics
Geostatistics in Python
Stars: ✭ 70 (+250%)
Mutual labels:  geostatistics
blockchain-predictor
Deep leraning cryptocurrency prediction with blockchain-based dataset
Stars: ✭ 34 (+70%)
Mutual labels:  prediction
AgeEstimateAdience
Age and Gender Estimation Using Convolutional Neural Network
Stars: ✭ 42 (+110%)
Mutual labels:  prediction
clarktech-ncaab-predictor
A machine learning project to predict NCAA Men's Basketball outcomes
Stars: ✭ 24 (+20%)
Mutual labels:  prediction
Tensorflow-Wide-Deep-Local-Prediction
This project demonstrates how to run and save predictions locally using exported tensorflow estimator model
Stars: ✭ 28 (+40%)
Mutual labels:  prediction
arima
ARIMA, SARIMA, SARIMAX and AutoARIMA models for time series analysis and forecasting in the browser and Node.js
Stars: ✭ 31 (+55%)
Mutual labels:  prediction
node apply-magic-sauce
No description or website provided.
Stars: ✭ 47 (+135%)
Mutual labels:  prediction
models-by-example
By-hand code for models and algorithms. An update to the 'Miscellaneous-R-Code' repo.
Stars: ✭ 43 (+115%)
Mutual labels:  maximum-likelihood-estimation

What is ExaGeoStat?

The Exascale GeoStatistics project (ExaGeoStat) is a parallel high performance unified framework for computational geostatistics on many-core systems. The project aims at optimizing the likelihood function for a given spatial data to provide an efficient way to predict missing observations in the context of climate/weather forecasting applications. This machine learning framework proposes a unified simulation code structure to target various hardware architectures, from commodity x86 to GPU accelerator-based shared and distributed-memory systems. ExaGeoStat enables statisticians to tackle computationally challenging scientific problems at large-scale, while abstracting the hardware complexity, through state-of-the-art high performance linear algebra software libraries.

Vision of ExaGeoStat

ExaGeoStat is a collaboration between the KAUST Statistics group and the Extreme Computing Research Center (ECRC). Its contribution lies not in a new algorithm nor in a new data set, but in demonstrating the routine use of the larger data sets becoming available to geospatial statisticians, thanks to the implementation of state-of-the-art statistical algorithms on high-performance computing (HPC) hardware.

We have built a standalone software framework (ExaGeoStat) that is able to run on a variety of hardware resources, including GPUs and massive distributed systems such as Shaheen, KAUST's Cray XC40 supercomputer, and to create a statistical model to predict environmental data (i.e., temperature, flow rates, soil moisture, wind speed, etc.) at spatial locations on which data is missing, and to exploit large amounts of data to reduce the effect of individual measurement errors. The best-known methods for such statistical processing have a cost that grows rapidly in the size of the data set, namely, in proportion to its cube, or third power. Thus, increasing the size of data set by a factor ten drives up the cost of the computation by a factor of a thousand, while simultaneously driving up the memory requirements by a factor o hundred.

For instance, according to this cubic growth in complexity, a computation that requires one minute would require nearly 17 hours on a data set just ten times larger. This creates a computational strain on standard statistics software, for which contemporary data sizes were not anticipated; and even if possible, it puts the computation beyond the interactive attention span of the analyst. Parallelism (assigning thousands of processors to the single task) and Moore's Law allow leading edge computers to handle such "big data" with ease, but the software bridge must be built. Furthermore, the software interface must resemble the interactive one with which working statisticians are familiar.

To summarize, the combination of emerging computing capabilities and emerging datasets promises significant advances in statistical analyses of environmental and many other phenomena. Such cross-disciplinary advances are natural at KAUST, which is why this relatively low-hanging fruit was ours to harvest earliest. Our roadmap takes now ExaGeoStat a step further on the algorithmic side by integrating tile low-rank matrix approximation. This low-rank matrix approximation permits to exploit the data sparisty of the operator with a user-controlled numerical accuracy. This further expands practical problem sizes for statisticians with modest computational resources.

Current Version: 1.1.0

Current Features of ExaGeoStat

Operations:

  1. Large-scale synthetic matrix generation.
  2. Univariate and bivariate modeling using dense, Tile Low-Rank (TLR), Diagonal Super-Tile (DST), and mixed-precision computation.
  3. Univariate and bivariate Predicting large-scale unknown measures in predefined geospatial locations.
  4. Univariate and bivariate parameter estimation assessment using MLOE/MMOM criteria.

Programming models:

  1. MPI
  2. Task-based programming models

External libraries:

  1. StarPU dynamic runtime system
  2. HiCMA
  3. Stars-H
  4. Chameleon

Installation

Installation requires at least CMake of version 3.2.3. To build ExaGeoStat, please follow these instructions:

  1. Get from git repository

    git clone [email protected]:ecrc/exageostat
    

    or

    git clone https://github.com/ecrc/exageostat
    
  2. Go into exageostat folder

    cd exageostat
    
  3. Get submodules

    git submodule update --init --recursive
    
  4. Create build directory and go there

    mkdir build && cd build
    
  5. Use CMake to get all the dependencies

    cmake .. -DCMAKE_INSTALL_PREFIX=/path/to/install/  -DEXAGEOSTAT_SCHED_STARPU=ON   -DEXAGEOSTAT_USE_NETCDF=ON -DEXAGEOSTAT_USE_HICMA=ON
    
  6. Build EXAGEOSTAT

    make -j
    
  7. Build local documentation (optional)

    make docs
    
  8. Install EXAGEOSTAT

    make install
    
  9. Add line

    export PKG_CONFIG_PATH=/path/to/install/lib/pkgconfig:$PKG_CONFIG_PATH
    

    to your .bashrc file.

Now you can use pkg-config executable to collect compiler and linker flags for EXAGEOSTAT.

References

  1. Sameh Abdulah, Hatem Ltaief, Ying Sun, Marc G. Genton, and David Keyes. "ExaGeoStat: A High Performance Unified Software for Geostatistics on Manycore Systems," IEEE Transactions on Parallel and Distributed Systems (2018).

  2. Sameh Abdulah, Hatem Ltaief, Ying Sun, Marc G. Genton, and David Keyes. "Parallel Approximation of the Maximum Likelihood Estimation for the Prediction of Large-Scale Geostatistics Simulations," IEEE Cluster Conference, Belfast, UK, Septemeber, 2018.

  3. Sameh Abdulah, Hatem Ltaief, Ying Sun, Marc G. Genton, and David Keyes. "Geostatistical Modeling and Prediction Using Mixed Precision Tile Cholesky Factorization." 2019 IEEE 26th International Conference on High Performance Computing, Data, and Analytics (HiPC). IEEE, 2019.

Handout

More information can be found in this handout: Handout

multivariate-modeling

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].