SGDLibraryMATLAB/Octave library for stochastic optimization algorithms: Version 1.0.20
Stars: ✭ 165 (+534.62%)
ByteSlice"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Stars: ✭ 24 (-7.69%)
dxramA distributed in-memory key-value storage for billions of small objects.
Stars: ✭ 25 (-3.85%)
nebulaA distributed, fast open-source graph database featuring horizontal scalability and high availability
Stars: ✭ 8,196 (+31423.08%)
subsemblesubsemble R package for ensemble learning on subsets of data
Stars: ✭ 40 (+53.85%)
incubator-liminalApache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (+350%)
GDLibraryMatlab library for gradient descent algorithms: Version 1.0.1
Stars: ✭ 50 (+92.31%)
gan deeplearning4jAutomatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-26.92%)
hadoop-deployment-bashCode for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.
Stars: ✭ 31 (+19.23%)
automile-phpAutomile offers a simple, smart, cutting-edge telematics solution for businesses to track and manage their business vehicles.
Stars: ✭ 28 (+7.69%)
liquibase-impalaLiquibase extension to add Impala Database support
Stars: ✭ 23 (-11.54%)
lubeckHigh level linear algebra library for Dlang
Stars: ✭ 57 (+119.23%)
aaocp一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (+103.85%)
beekeeperService for automatically managing and cleaning up unreferenced data
Stars: ✭ 43 (+65.38%)
ngmswissgeol.ch gives you insight in geoscientific data - above and below the surface.
Stars: ✭ 23 (-11.54%)
clickhouse hadoopImport data from clickhouse to hadoop with pure SQL
Stars: ✭ 26 (+0%)
automile-netAutomile offers a simple, smart, cutting-edge telematics solution for businesses to track and manage their business vehicles.
Stars: ✭ 24 (-7.69%)
xcastA High-Performance Data Science Toolkit for the Earth Sciences
Stars: ✭ 28 (+7.69%)
hadoopofficeHadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (+115.38%)
big-data-upfRECSM-UPF Summer School: Social Media and Big Data Research
Stars: ✭ 21 (-19.23%)
siembolAn open-source, real-time Security Information & Event Management tool based on big data technologies, providing a scalable, advanced security analytics framework.
Stars: ✭ 153 (+488.46%)
FIW KRTFamilies In the WIld: A Kinship Recogntion Toolbox.
Stars: ✭ 18 (-30.77%)
flokkrDocumentation placeholder and utilities for all the other containers.
Stars: ✭ 30 (+15.38%)
storm-mlan online learning algorithm library for Storm
Stars: ✭ 18 (-30.77%)
shiftingA privacy-focused list of alternatives to mainstream services to help the competition.
Stars: ✭ 31 (+19.23%)
hive to es同步Hive数据仓库数据到Elasticsearch的小工具
Stars: ✭ 21 (-19.23%)
learning-sparkTidy up Spark and Hadoop tutorials.
Stars: ✭ 28 (+7.69%)
yildiz🦄🌟 Graph Database layer on top of Google Bigtable
Stars: ✭ 24 (-7.69%)
darwinAvro Schema Evolution made easy
Stars: ✭ 26 (+0%)
HadoopDedup🍉基于Hadoop和HBase的大规模海量数据去重
Stars: ✭ 27 (+3.85%)
data-viz-utilsFunctions for easily making publication-quality figures with matplotlib.
Stars: ✭ 16 (-38.46%)
pyspark-algorithmsPySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+176.92%)
SynapseMLSimple and Distributed Machine Learning
Stars: ✭ 3,355 (+12803.85%)
UBAUEBA Solution for Insider Security. This repo is archived. Thanks!
Stars: ✭ 36 (+38.46%)
CS Book🔥 Latest computer science e-books。提供最新技术类电子书下载, “我无非就是想卷死各位,或者被各位卷死!”
Stars: ✭ 40 (+53.85%)
smart-data-lakeSmart Automation Tool for building modern Data Lakes and Data Pipelines
Stars: ✭ 79 (+203.85%)
oci-clouderaTerraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)
Stars: ✭ 20 (-23.08%)
lidboxEnd-to-end spoken language identification out of the box.
Stars: ✭ 39 (+50%)
learning-hadoop-and-sparkCompanion to Learning Hadoop and Learning Spark courses on Linked In Learning
Stars: ✭ 146 (+461.54%)
hive-jdbc-driverAn alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (+19.23%)
beam-siteApache Beam Site
Stars: ✭ 28 (+7.69%)
awesome-toolscurated list of awesome tools and libraries for specific domains
Stars: ✭ 31 (+19.23%)
merkle-dbHigh-scalability analytics database built on immutable merkle-trees
Stars: ✭ 44 (+69.23%)