Uproot3ROOT I/O in pure Python and NumPy.
Stars: ✭ 312 (+44.44%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (-62.96%)
Vscode Data PreviewData Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Stars: ✭ 245 (+13.43%)
arrow-datafusionApache Arrow DataFusion SQL Query Engine
Stars: ✭ 2,360 (+992.59%)
Parquet MrApache Parquet
Stars: ✭ 1,278 (+491.67%)
Amazon S3 Find And ForgetAmazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-46.76%)
RoapiCreate full-fledged APIs for static datasets without writing a single line of code.
Stars: ✭ 253 (+17.13%)
SdcIntel® Scalable Dataframe Compiler for Pandas*
Stars: ✭ 623 (+188.43%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-73.15%)
seapyState Estimation and Analysis in Python
Stars: ✭ 25 (-88.43%)
RootpyA pythonic interface for the ROOT libraries on top of the PyROOT bindings.
Stars: ✭ 186 (-13.89%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-48.61%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-18.06%)
Root numpyThe interface between ROOT and NumPy
Stars: ✭ 130 (-39.81%)
vinumVinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.
Stars: ✭ 57 (-73.61%)
graphiqueGraphQL service for arrow tables and parquet data sets.
Stars: ✭ 28 (-87.04%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+10107.41%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+649.54%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+660.19%)
spark-rootApache Spark Data Source for ROOT File Format
Stars: ✭ 28 (-87.04%)
KartothekA consistent table management library in python
Stars: ✭ 144 (-33.33%)
Cloud VolumeRead and write Neuroglancer datasets programmatically.
Stars: ✭ 63 (-70.83%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (-35.19%)
ParquetviewerSimple windows desktop application for viewing & querying Apache Parquet files
Stars: ✭ 145 (-32.87%)
QilingQiling Advanced Binary Emulation Framework
Stars: ✭ 2,816 (+1203.7%)
MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+1242.13%)
Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-10.65%)
PyboticsThe Python Toolbox for Robotics
Stars: ✭ 192 (-11.11%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (-3.24%)
Awkward 1.0Manipulate JSON-like data with NumPy-like idioms.
Stars: ✭ 203 (-6.02%)
Fashion RecommendationA clothing retrieval and visual recommendation model for fashion images.
Stars: ✭ 193 (-10.65%)
AlynDetect and fix skew in images containing text
Stars: ✭ 202 (-6.48%)
GtirbIntermediate Representation for Binary analysis and transformation
Stars: ✭ 190 (-12.04%)
PysrSimple, fast, and parallelized symbolic regression in Python/Julia via regularized evolution and simulated annealing
Stars: ✭ 213 (-1.39%)
TsalibTensor Shape Annotation Library (numpy, tensorflow, pytorch, ...)
Stars: ✭ 209 (-3.24%)
DythonA set of data tools in Python
Stars: ✭ 200 (-7.41%)
XtensorC++ tensors with broadcasting and lazy computing
Stars: ✭ 2,453 (+1035.65%)
Pisavar📡 🍍Detects activities of PineAP module and starts deauthentication attack (for fake access points - WiFi Pineapple Activities Detection)
Stars: ✭ 188 (-12.96%)
Parquetjsfully asynchronous, pure JavaScript implementation of the Parquet file format
Stars: ✭ 200 (-7.41%)
RxshellEasy shell access for Android apps using RxJava.
Stars: ✭ 189 (-12.5%)
PyresampleGeospatial image resampling in Python
Stars: ✭ 188 (-12.96%)
WindroseA Python Matplotlib, Numpy library to manage wind data, draw windrose (also known as a polar rose plot), draw probability density function and fit Weibull distribution
Stars: ✭ 208 (-3.7%)
Pyemma🚂 Python API for Emma's Markov Model Algorithms 🚂
Stars: ✭ 200 (-7.41%)
GunAn open source cybersecurity protocol for syncing decentralized graph data.
Stars: ✭ 15,172 (+6924.07%)
TftbA Python module for time-frequency analysis
Stars: ✭ 185 (-14.35%)
SeriloganalyzerRoslyn-based analysis for code using the Serilog logging library. Checks for common mistakes and usage problems.
Stars: ✭ 214 (-0.93%)
CalciteApache Calcite
Stars: ✭ 2,816 (+1203.7%)
Save Page StateA chrome extension to save the state of a page for further analysis
Stars: ✭ 208 (-3.7%)
Presto Go ClientA Presto client for the Go programming language.
Stars: ✭ 183 (-15.28%)