1. Gemm hlsScalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.
2. DaceDaCe - Data Centric Parallel Programming
3. FblasBLAS implementation for Intel FPGA
4. SMIStreaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware
5. nccNeural Code Comprehension: A Learnable Representation of Code Semantics
6. npbenchNPBench - A Benchmarking Suite for High-Performance NumPy
7. pymlirPython interface for MLIR - the Multi-Level Intermediate Representation
8. deep-weatherDeep Learning for Post-Processing Ensemble Weather Forecasts
10. pspinPsPIN: A RISC-V in-network accelerator for flexible high-performance low-power packet processing