SdcIntel® Scalable Dataframe Compiler for Pandas*
Stars: ✭ 623 (+2125%)
clisopsClimate Simulation Operations
Stars: ✭ 17 (-39.29%)
bigstatsrR package for statistical tools with big matrices stored on disk.
Stars: ✭ 139 (+396.43%)
SBTi-finance-toolThis toolkit helps companies and financial institutions to assess the temperature alignment of current targets, commitments, and investment and lending portfolios, and to use this information to develop targets for official validation by the SBTi. See the wiki for a change log.
Stars: ✭ 39 (+39.29%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (+442.86%)
awesome-climate-dataData sources, programming libraries and open source organisations that are working on the climate emergency
Stars: ✭ 17 (-39.29%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+435.71%)
pcmdi metricsSelf contained packages to run PCMDI Metrics
Stars: ✭ 44 (+57.14%)
xarrayutilsxarrayutils.readthedocs.io/
Stars: ✭ 50 (+78.57%)
hockeystickDownload and Visualize Essential Global Heating Data in R
Stars: ✭ 42 (+50%)
sparkucxA high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer
Stars: ✭ 32 (+14.29%)
student-grade-analyticsAnalyse academic and non-academic information of students and predict grades
Stars: ✭ 17 (-39.29%)
goes2goDownload and process GOES-16 and GOES-17 data from NOAA's archive on AWS using Python.
Stars: ✭ 77 (+175%)
IoT-system-PLC-data-to-InfluxDBThis project aim is to provide free software to fetch data from plcs (Siemens S7-300/400/1200/1500) and store it. Used stack is completly opensource. I used InfluDB as data storage, so application principle is following Big Data paradigm.
Stars: ✭ 26 (-7.14%)
DracoDRACO: Byzantine-resilient Distributed Training via Redundant Gradients
Stars: ✭ 21 (-25%)
mljar-api-RR wrapper for MLJAR API
Stars: ✭ 16 (-42.86%)
spark-rootApache Spark Data Source for ROOT File Format
Stars: ✭ 28 (+0%)
rastercuberastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Stars: ✭ 15 (-46.43%)
nebulaA distributed, fast open-source graph database featuring horizontal scalability and high availability
Stars: ✭ 8,196 (+29171.43%)
Wharton Stat 422 722The official class webpage for Statistics 422/722 taught at Wharton in the Spring of 2017
Stars: ✭ 14 (-50%)
cftClimate futures toolbox: easy MACA (MACAv2) climate data access 📦
Stars: ✭ 16 (-42.86%)
cloudberryBig Data Visualization
Stars: ✭ 89 (+217.86%)
scarfToolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (+92.86%)
beekeeperService for automatically managing and cleaning up unreferenced data
Stars: ✭ 43 (+53.57%)
nbodykitAnalysis kit for large-scale structure datasets, the massively parallel way
Stars: ✭ 93 (+232.14%)
SGDLibraryMATLAB/Octave library for stochastic optimization algorithms: Version 1.0.20
Stars: ✭ 165 (+489.29%)
RemoteShuffleServiceCeleborn provides an elastic and high-performance service for shuffle and spilled data.
Stars: ✭ 262 (+835.71%)
java-multithreadCódigos feitos para o curso de Multithreading com Java, no canal RinaldoDev do YouTube.
Stars: ✭ 24 (-14.29%)
ParallelKMeans.jlParallel & lightning fast implementation of available classic and contemporary variants of the KMeans clustering algorithm
Stars: ✭ 45 (+60.71%)
siembolAn open-source, real-time Security Information & Event Management tool based on big data technologies, providing a scalable, advanced security analytics framework.
Stars: ✭ 153 (+446.43%)
dxramA distributed in-memory key-value storage for billions of small objects.
Stars: ✭ 25 (-10.71%)
mptracMassive-Parallel Trajectory Calculations (MPTRAC) is a Lagrangian particle dispersion model for the analysis of atmospheric transport processes in the troposphere and stratosphere.
Stars: ✭ 19 (-32.14%)
img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Stars: ✭ 1,173 (+4089.29%)
matrix multiplicationParallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Stars: ✭ 41 (+46.43%)
CS Book🔥 Latest computer science e-books。提供最新技术类电子书下载, “我无非就是想卷死各位,或者被各位卷死!”
Stars: ✭ 40 (+42.86%)
GDLibraryMatlab library for gradient descent algorithms: Version 1.0.1
Stars: ✭ 50 (+78.57%)
lcbo-apiA crawler and API server for Liquor Control Board of Ontario retail data
Stars: ✭ 152 (+442.86%)
claireConstrained Large Deformation Diffeomorphic Image Registration (CLAIRE)
Stars: ✭ 30 (+7.14%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+39.29%)
gan deeplearning4jAutomatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-32.14%)
boxtreeQuad/octree building for FMMs in Python and OpenCL
Stars: ✭ 52 (+85.71%)
inmetrDEPRECATED A R-package to Import Historical Data from Brazilian Meteorological Stations
Stars: ✭ 18 (-35.71%)
PSycloneDomain-specific compiler for Finite Difference/Volume/Element Earth-system models in Fortran
Stars: ✭ 67 (+139.29%)
incubator-liminalApache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (+317.86%)
automile-phpAutomile offers a simple, smart, cutting-edge telematics solution for businesses to track and manage their business vehicles.
Stars: ✭ 28 (+0%)
FlameStreamDistributed stream processing model and its implementation
Stars: ✭ 14 (-50%)
lubeckHigh level linear algebra library for Dlang
Stars: ✭ 57 (+103.57%)
raccoonMassively parallel FEM code for phase-field for fracture by Dolbow Lab at Duke University
Stars: ✭ 21 (-25%)
foreachR package to provide foreach looping construct
Stars: ✭ 40 (+42.86%)
xarray-sentinelXarray backend to Copernicus Sentinel-1 satellite data products
Stars: ✭ 189 (+575%)