Kibble 1Apache Kibble - a tool to collect, aggregate and visualize data about any software project
Stars: ✭ 54 (+80%)
Kafka UiOpen-Source Web GUI for Apache Kafka Management
Stars: ✭ 230 (+666.67%)
ByteSlice"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Stars: ✭ 24 (-20%)
Macro mlCourse Website on Macroeconomic Analysis with Machine Learning and Big Data
Stars: ✭ 53 (+76.67%)
meetups-archivosPpts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (+100%)
Datumbox FrameworkDatumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
Stars: ✭ 1,063 (+3443.33%)
mmtf-sparkMethods for the parallel and distributed analysis and mining of the Protein Data Bank using MMTF and Apache Spark.
Stars: ✭ 20 (-33.33%)
LoL-Match-PredictionWin probability predictions for League of Legends matches using neural networks
Stars: ✭ 34 (+13.33%)
TraildbTrailDB is an efficient tool for storing and querying series of events
Stars: ✭ 1,029 (+3330%)
PoseidonA search engine which can hold 100 trillion lines of log data.
Stars: ✭ 1,793 (+5876.67%)
incubator-liminalApache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Stars: ✭ 117 (+290%)
beekeeperService for automatically managing and cleaning up unreferenced data
Stars: ✭ 43 (+43.33%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+683.33%)
siembolAn open-source, real-time Security Information & Event Management tool based on big data technologies, providing a scalable, advanced security analytics framework.
Stars: ✭ 153 (+410%)
EgadsA Java package to automatically detect anomalies in large scale time-series data
Stars: ✭ 997 (+3223.33%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (+356.67%)
CS Book🔥 Latest computer science e-books。提供最新技术类电子书下载, “我无非就是想卷死各位,或者被各位卷死!”
Stars: ✭ 40 (+33.33%)
Esper TvEsper instance for TV news analysis
Stars: ✭ 37 (+23.33%)
spark-recordsBulletproof Apache Spark jobs with fast root cause analysis of failures.
Stars: ✭ 67 (+123.33%)
bagriXML/Document DB on top of distributed cache
Stars: ✭ 40 (+33.33%)
RemoteShuffleServiceCeleborn provides an elastic and high-performance service for shuffle and spilled data.
Stars: ✭ 262 (+773.33%)
dxramA distributed in-memory key-value storage for billions of small objects.
Stars: ✭ 25 (-16.67%)
QcportalA client interface to the QCArchive Project (read-only image of QCFractal)
Stars: ✭ 29 (-3.33%)
img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Stars: ✭ 1,173 (+3810%)
Lite Virtual ListVirtual list component library supporting waterfall flow based on vue
Stars: ✭ 223 (+643.33%)
GDLibraryMatlab library for gradient descent algorithms: Version 1.0.1
Stars: ✭ 50 (+66.67%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+105293.33%)
lcbo-apiA crawler and API server for Liquor Control Board of Ontario retail data
Stars: ✭ 152 (+406.67%)
HamaMirror of Apache Hama
Stars: ✭ 129 (+330%)
gan deeplearning4jAutomatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-36.67%)
PhoenixMirror of Apache Phoenix
Stars: ✭ 867 (+2790%)
FlameStreamDistributed stream processing model and its implementation
Stars: ✭ 14 (-53.33%)
ngmswissgeol.ch gives you insight in geoscientific data - above and below the surface.
Stars: ✭ 23 (-23.33%)
SparkjniA heterogeneous Apache Spark framework.
Stars: ✭ 11 (-63.33%)
automile-netAutomile offers a simple, smart, cutting-edge telematics solution for businesses to track and manage their business vehicles.
Stars: ✭ 24 (-20%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-46.67%)
Hazelcast JetDistributed Stream and Batch Processing
Stars: ✭ 855 (+2750%)
FIW KRTFamilies In the WIld: A Kinship Recogntion Toolbox.
Stars: ✭ 18 (-40%)
UsqlU-SQL Examples and Issue Tracking
Stars: ✭ 221 (+636.67%)
AutodlAutomated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (+2746.67%)
Quantitative-Big-Imaging-2018(Latest semester at https://github.com/kmader/Quantitative-Big-Imaging-2019) The material for the Quantitative Big Imaging course at ETHZ for the Spring Semester 2018
Stars: ✭ 50 (+66.67%)
iogrowCRMCRM for Social Selling, on Google. Integrated with LinkedIn, Twitter, Facebook & Gmail.
Stars: ✭ 28 (-6.67%)
eidea4企业框架 scm erp wms
Stars: ✭ 53 (+76.67%)
Library-SpringThe library web application where you can borrow books. It's Spring MVC and Hibernate project.
Stars: ✭ 73 (+143.33%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+10046.67%)
FluoApache Fluo
Stars: ✭ 159 (+430%)
FlinkApache Flink is an open source project of The Apache Software Foundation (ASF).
The Apache Flink project originated from the Stratosphere research project.
Stars: ✭ 17,781 (+59170%)