OpenubaA robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-94.42%)
Lpa DetectorOptimize and improve the Label propagation algorithm
Stars: ✭ 75 (-96.7%)
Yvm[yvm] low performance garbage-collectable jvm
Stars: ✭ 173 (-92.39%)
PandahousePandas interface for Clickhouse database
Stars: ✭ 126 (-94.46%)
Kamu CliNext generation tool for decentralized exchange and transformation of semi-structured data
Stars: ✭ 69 (-96.97%)
Ignareo Isml Auto VoterIgnareo the Carillon, a web spider template of ultimate concurrency built for leprechauns. Carillons as the best web spiders; Long live the golden years of leprechauns!
Stars: ✭ 154 (-93.23%)
HobbyscriptYet Another JVM/LLVM Dynamic Language (LLVM Backend WIP)
Stars: ✭ 72 (-96.83%)
Cape PythonCollaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-94.5%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (-96.83%)
TfmesosTensorflow in Docker on Mesos #tfmesos #tensorflow #mesos
Stars: ✭ 194 (-91.47%)
Difacto dmlcDistributed FM and LR based on Parameter Server with Ftrl
Stars: ✭ 126 (-94.46%)
GojvmJVM implementation by Go
Stars: ✭ 69 (-96.97%)
Fast MrmrAn improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).
Stars: ✭ 67 (-97.05%)
Scala SamplesThere are pieces of scala code that explain Scala syntax and related things - like what you can do with all this
Stars: ✭ 125 (-94.5%)
KontextfreiWriting application logic for Spark jobs that can be unit-tested without a SparkContext
Stars: ✭ 67 (-97.05%)
Password4jPassword4j is a user-friendly cryptographic library that supports Argon2, Bcrypt, Scrypt, PBKDF2 and various cryptographic hash functions.
Stars: ✭ 124 (-94.55%)
CapsuleDead-Simple Packaging and Deployment for JVM Apps
Stars: ✭ 1,143 (-49.74%)
PowderkegLive-coding the cluster!
Stars: ✭ 152 (-93.32%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-97.14%)
Labgridembedded systems control library for development, testing and installation
Stars: ✭ 124 (-94.55%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-97.19%)
BastionHighly-available Distributed Fault-tolerant Runtime
Stars: ✭ 2,333 (+2.59%)
Pysparkgeoanalysis🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-97.23%)
Fluentdispatch🌊 .NET Standard 2.1 framework which makes easy to scaffold distributed systems and dispatch incoming load into units of work in a deterministic way.
Stars: ✭ 152 (-93.32%)
EventqlDistributed "massively parallel" SQL query engine
Stars: ✭ 1,121 (-50.7%)
Spark AlchemyCollection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-94.64%)
MnesiacMnesia autoclustering made easy!
Stars: ✭ 62 (-97.27%)
LightgbmA fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Stars: ✭ 13,293 (+484.56%)
Jstack ReviewJavascript based JVM thread dump analyzer
Stars: ✭ 61 (-97.32%)
OrbitA distributed, serverless, peer-to-peer chat application on IPFS
Stars: ✭ 1,586 (-30.26%)
WaimakWaimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (-97.36%)
JlitespiderA lite distributed Java spider framework :-)
Stars: ✭ 151 (-93.36%)
Zemberek Nlp ServerZemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Stars: ✭ 60 (-97.36%)
OrbitOrbit - Virtual actor framework for building distributed systems
Stars: ✭ 1,585 (-30.3%)
ArewedistributedyetWebsite + Community effort to unlock the peer-to-peer web at arewedistributedyet.com ⚡🌐🔑
Stars: ✭ 189 (-91.69%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-97.45%)
ZparkioBoiler plate framework to use Spark and ZIO together.
Stars: ✭ 121 (-94.68%)
Benchm MlA minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
Stars: ✭ 1,835 (-19.31%)
Awesome PulsarA curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (-97.49%)
ErlamsaErlang port of famous radamsa fuzzzer.
Stars: ✭ 56 (-97.54%)
Node Jvmjava virtual machine in pure node.js
Stars: ✭ 2,053 (-9.72%)
Isolation ForestA Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Stars: ✭ 139 (-93.89%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-41.16%)
MarsMars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Stars: ✭ 2,308 (+1.5%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+2.15%)
AzosA to Z Sky Operating System / Microservice Chassis Framework
Stars: ✭ 137 (-93.98%)
LukytSmall Java 8 JVM made in Lua
Stars: ✭ 95 (-95.82%)
ToydbDistributed SQL database in Rust, written as a learning project
Stars: ✭ 1,329 (-41.56%)