SparkApache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
Stars: ✭ 55 (+175%)
foliaFoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
Stars: ✭ 56 (+180%)
js-cfb💾 OLE File Container Format
Stars: ✭ 54 (+170%)
Gfa SpecGraphical Fragment Assembly (GFA) Format Specification
Stars: ✭ 117 (+485%)
MP4ParseC++ library for MP4 file parsing.
Stars: ✭ 55 (+175%)
miniparquetLibrary to read a subset of Parquet files
Stars: ✭ 38 (+90%)
GbxDumpA Microsoft Windows application that displays the contents of the file header of *.Gbx files used by the Nadeo game engine GameBox.
Stars: ✭ 19 (-5%)
Awkward 0.xManipulate arrays of complex data structures as easily as Numpy.
Stars: ✭ 216 (+980%)
MatioMATLAB MAT File I/O Library
Stars: ✭ 206 (+930%)
aoc-mgx-formatAge of Empires: The Conquerors - Savegame File Format
Stars: ✭ 56 (+180%)
TinyMATC/C++ library to handle writing simple Matlab(r) MAT file
Stars: ✭ 22 (+10%)
Python AltiumAltium schematic format documentation, SVG converter and TK viewer
Stars: ✭ 112 (+460%)
geosparkbring sf to spark in production
Stars: ✭ 53 (+165%)
Json Photoshop ScriptingJSON Photoshop Scripting project: alternative way of scripting Photoshop in JavaScript, based on JSON.
Stars: ✭ 42 (+110%)
jhdfA pure Java HDF5 library
Stars: ✭ 83 (+315%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+785%)
Tinyply 🌍 C++11 ply 3d mesh format importer & exporter
Stars: ✭ 358 (+1690%)
ParquetviewerSimple windows desktop application for viewing & querying Apache Parquet files
Stars: ✭ 145 (+625%)
MeshIOCloudCompare plugin for loading COLLADA, glTF, and IFC-SPF 3D models
Stars: ✭ 14 (-30%)
KlogA plain-text file format and command line tool for time tracking
Stars: ✭ 222 (+1010%)
ruby-magicSimple interface to libmagic for Ruby Programming Language
Stars: ✭ 23 (+15%)
zipdumpAnalyze zipfile, either local, or from url
Stars: ✭ 25 (+25%)
BitmapC++ Bitmap Library
Stars: ✭ 125 (+525%)
hPDBPDB parser in Haskell
Stars: ✭ 20 (+0%)
Tweet-Analysis-With-Kafka-and-SparkA real time analytics dashboard to analyze the trending hashtags and @ mentions at any location using kafka and spark streaming.
Stars: ✭ 18 (-10%)
nafNucleotide Archival Format - Compressed file format for DNA/RNA/protein sequences
Stars: ✭ 35 (+75%)
CoolerA cool place to store your Hi-C
Stars: ✭ 112 (+460%)
TinyTIFFlightweight TIFF reader/writer library (C/C++)
Stars: ✭ 91 (+355%)
qsvCSVs sliced, diced & analyzed.
Stars: ✭ 438 (+2090%)
mimesnifferA MIME type sniffer for Go.
Stars: ✭ 22 (+10%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (+300%)
firehoseInterchange format for results for static analysis tools
Stars: ✭ 62 (+210%)
parquet-extraA collection of Apache Parquet add-on modules
Stars: ✭ 30 (+50%)
miniplyA fast and easy-to-use PLY parsing library in a single c++11 header and cpp file
Stars: ✭ 29 (+45%)
NtrghidraFully Featured Nintendo DS Loader for Ghidra
Stars: ✭ 56 (+180%)
go-objOBJ file loader for golang
Stars: ✭ 16 (-20%)
openmrs-fhir-analyticsA collection of tools for extracting FHIR resources and analytics services on top of that data.
Stars: ✭ 55 (+175%)
mmtfThe specification of the MMTF format for biological structures
Stars: ✭ 40 (+100%)
KsflKSFL - Kreative Structured Format Library
Stars: ✭ 7 (-65%)
Vscode Data PreviewData Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Stars: ✭ 245 (+1125%)
nixNeuroscience information exchange format
Stars: ✭ 64 (+220%)
Parquetjsfully asynchronous, pure JavaScript implementation of the Parquet file format
Stars: ✭ 200 (+900%)
AudiofileA simple C++ library for reading and writing audio files.
Stars: ✭ 399 (+1895%)
Kaitai structKaitai Struct: declarative language to generate binary data parsers in C++ / C# / Go / Java / JavaScript / Lua / Perl / PHP / Python / Ruby
Stars: ✭ 2,736 (+13580%)
Parquet RsApache Parquet implementation in Rust
Stars: ✭ 144 (+620%)
Uproot3ROOT I/O in pure Python and NumPy.
Stars: ✭ 312 (+1460%)
dt-sql-parserSQL Parsers for BigData, built with antlr4.
Stars: ✭ 135 (+575%)
ReClassicficationMaybe one day a WINE-style implementation of the classic Mac Toolbox.
Stars: ✭ 29 (+45%)