All Projects → pushshift → Parallel-NDJSON-Reader

pushshift / Parallel-NDJSON-Reader

Licence: other
Parallel NDJSON Reader for Python

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Parallel-NDJSON-Reader

Parallel
Parallel processing for PHP based on Amp.
Stars: ✭ 478 (+3576.92%)
Mutual labels:  multiprocessing, parallel, parallel-processing
Hamsters.js
100% Vanilla Javascript Multithreading & Parallel Execution Library
Stars: ✭ 517 (+3876.92%)
Mutual labels:  parallel, parallel-processing
java-multithread
Códigos feitos para o curso de Multithreading com Java, no canal RinaldoDev do YouTube.
Stars: ✭ 24 (+84.62%)
Mutual labels:  parallel, parallel-processing
gitall.rs
Run Git commands in all subdirectories really fast
Stars: ✭ 25 (+92.31%)
Mutual labels:  parallel, parallel-processing
Future.apply
🚀 R package: future.apply - Apply Function to Elements in Parallel using Futures
Stars: ✭ 159 (+1123.08%)
Mutual labels:  parallel, parallel-processing
Curl Easy
cURL wrapper for PHP. Supports parallel and non-blocking requests. For high speed crawling, see stil/curl-robot
Stars: ✭ 297 (+2184.62%)
Mutual labels:  multiprocessing, parallel
Pyexpool
Python Multi-Process Execution Pool: concurrent asynchronous execution pool with custom resource constraints (memory, timeouts, affinity, CPU cores and caching), load balancing and profiling capabilities of the external apps on NUMA architecture
Stars: ✭ 149 (+1046.15%)
Mutual labels:  multiprocessing, parallel-processing
NotEnoughAV1Encodes-Qt
Linux GUI for AV1 Encoders
Stars: ✭ 27 (+107.69%)
Mutual labels:  multiprocessing
PGD
A Parallel Graphlet Decomposition Library for Large Graphs
Stars: ✭ 68 (+423.08%)
Mutual labels:  parallel
Demo.Ndjson.AsyncStreams
Sample project for demonstrating how to use async streams and NDJSON to improve user experience by streaming JSON objects from server to client and client to server in .NET
Stars: ✭ 30 (+130.77%)
Mutual labels:  ndjson
await
28Kb, small memory footprint, single binary that run list of commands in parallel and waits for their termination
Stars: ✭ 73 (+461.54%)
Mutual labels:  parallel
ParMmg
Distributed parallelization of 3D volume mesh adaptation
Stars: ✭ 19 (+46.15%)
Mutual labels:  parallel
AppiumGrid
A framework for running appium tests in parallel across devices and also on desktop browser... U like it STAR it !!
Stars: ✭ 17 (+30.77%)
Mutual labels:  parallel
nemesyst
Generalised and highly customisable, hybrid-parallelism, database based, deep learning framework.
Stars: ✭ 17 (+30.77%)
Mutual labels:  parallel
atpbar
Progress bars for threading and multiprocessing tasks on terminal and Jupyter Notebook
Stars: ✭ 74 (+469.23%)
Mutual labels:  multiprocessing
WAND-PIC
WAND-PIC
Stars: ✭ 20 (+53.85%)
Mutual labels:  parallel
Mandelbrot-set-explorer
An interactive Mandelbrot set, made with Python3 and Tkinter
Stars: ✭ 31 (+138.46%)
Mutual labels:  multiprocessing
marathon
Cross-platform test runner written for Android and iOS projects
Stars: ✭ 398 (+2961.54%)
Mutual labels:  parallel
t8code
Parallel algorithms and data structures for tree-based AMR with arbitrary element shapes.
Stars: ✭ 37 (+184.62%)
Mutual labels:  parallel
spellbook
Functional library for Javascript
Stars: ✭ 14 (+7.69%)
Mutual labels:  parallel

Parallel NDJSON Reader

Purpose

This script can read and process newline delimited data extremely quickly. For NDJSON files, my 12 core Xeon was able to decode (json.loads) 90,000 Twitter objects per second. This script is basically limited by the amount of CPUs you have and how fast your I/O subsystem is.

Features

  • Ability to select number of cores used by setting the value of the n_chunks variable.
  • If the file is too small to split into N pieces, the script will scale to the maximum number of chunks possible. This script is not meant for small files since there is a little bit of startup time involved. This is meant to tear through big data (gigabytes / terabytes / petabytes).

[email protected]

https://pushshift.io/donations

End

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].