BeamApache Beam is a unified programming model for Batch and Streaming
Stars: ✭ 5,149 (+18289.29%)
Mutual labels: big-data, beam
terraform-aws-kinesis-firehoseThis code creates a Kinesis Firehose in AWS to send CloudWatch log data to S3.
Stars: ✭ 25 (-10.71%)
Mutual labels: big-data
FlameStreamDistributed stream processing model and its implementation
Stars: ✭ 14 (-50%)
Mutual labels: big-data
beamdasmErlang\Elixir byte code viewer. BEAM file disassembler extension for Visual Studio Code.
Stars: ✭ 44 (+57.14%)
Mutual labels: beam
bigflowA Python framework for data processing on GCP.
Stars: ✭ 96 (+242.86%)
Mutual labels: beam
img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Stars: ✭ 1,173 (+4089.29%)
Mutual labels: big-data
ngmswissgeol.ch gives you insight in geoscientific data - above and below the surface.
Stars: ✭ 23 (-17.86%)
Mutual labels: big-data
scarfToolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (+92.86%)
Mutual labels: big-data
spark-rootApache Spark Data Source for ROOT File Format
Stars: ✭ 28 (+0%)
Mutual labels: big-data
GDLibraryMatlab library for gradient descent algorithms: Version 1.0.1
Stars: ✭ 50 (+78.57%)
Mutual labels: big-data
gan deeplearning4jAutomatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-32.14%)
Mutual labels: big-data
nebulaA distributed, fast open-source graph database featuring horizontal scalability and high availability
Stars: ✭ 8,196 (+29171.43%)
Mutual labels: big-data
automile-phpAutomile offers a simple, smart, cutting-edge telematics solution for businesses to track and manage their business vehicles.
Stars: ✭ 28 (+0%)
Mutual labels: big-data
IoT-system-PLC-data-to-InfluxDBThis project aim is to provide free software to fetch data from plcs (Siemens S7-300/400/1200/1500) and store it. Used stack is completly opensource. I used InfluDB as data storage, so application principle is following Big Data paradigm.
Stars: ✭ 26 (-7.14%)
Mutual labels: big-data
lubeckHigh level linear algebra library for Dlang
Stars: ✭ 57 (+103.57%)
Mutual labels: big-data
lcbo-apiA crawler and API server for Liquor Control Board of Ontario retail data
Stars: ✭ 152 (+442.86%)
Mutual labels: big-data
spark-recordsBulletproof Apache Spark jobs with fast root cause analysis of failures.
Stars: ✭ 67 (+139.29%)
Mutual labels: big-data
RemoteShuffleServiceCeleborn provides an elastic and high-performance service for shuffle and spilled data.
Stars: ✭ 262 (+835.71%)
Mutual labels: big-data