pigletA compiler for Pig Latin to Spark and Flink.
Stars: ✭ 23 (-94.52%)
cryptoizationData visualization application showing all BTC transactions in real-time
Stars: ✭ 12 (-97.14%)
LeharVisualize data using relative ordering
Stars: ✭ 81 (-80.71%)
Apartment-Interest-PredictionPredict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text and images.
Stars: ✭ 17 (-95.95%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-81.19%)
SparklintA tool for monitoring and tuning Spark jobs for efficiency.
Stars: ✭ 316 (-24.76%)
aws-ai-ml-workshop-krA collection of localized (Korean) AWS AI/ML workshop materials for hands-on labs.
Stars: ✭ 65 (-84.52%)
Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-82.14%)
Book本项目收藏这些年来看过或者听过的一些不错的书籍,在整理文件时看见这些,发现删掉有点可惜,放着又太浪费空间,本着分享的原则,就把它们共享出来,一方面给需要的读者提供这些书籍,另一方面也是一种像知识库的积累吧
Stars: ✭ 47 (-88.81%)
SparkFirely's open source FHIR server
Stars: ✭ 174 (-58.57%)
fixAllows you to use OpenAI Codex to fix errors in the command line.
Stars: ✭ 72 (-82.86%)
Lpa DetectorOptimize and improve the Label propagation algorithm
Stars: ✭ 75 (-82.14%)
WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (-11.43%)
yt-channels-DS-AI-ML-CSA comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (+147.14%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+499.52%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (-82.86%)
migratorA backup solution and data migration utility for Android
Stars: ✭ 56 (-86.67%)
ai-background-removeCut out objects and remove backgrounds from pictures with artificial intelligence
Stars: ✭ 70 (-83.33%)
KontextfreiWriting application logic for Spark jobs that can be unit-tested without a SparkContext
Stars: ✭ 67 (-84.05%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-84.52%)
ML-For-Beginners12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
Stars: ✭ 40,023 (+9429.29%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-84.76%)
Python文献下载助手(ArticelsHelper) 基线拉平程序(Baseline Alignment) Q-PCR数据处理(Q-PCR Data)
Stars: ✭ 28 (-93.33%)
YOLOv4MLNetUse the YOLO v4 and v5 (ONNX) models for object detection in C# using ML.Net
Stars: ✭ 61 (-85.48%)
RoffildlibraryLibrary for MQL5 (MetaTrader) with Python, Java, Apache Spark, AWS
Stars: ✭ 63 (-85%)
WaimakWaimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (-85.71%)
ml-lpiMaterials for ML course at Lebedev Physical Institute
Stars: ✭ 31 (-92.62%)
Zemberek Nlp ServerZemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Stars: ✭ 60 (-85.71%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-86.19%)
AutovizAutomatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.
Stars: ✭ 310 (-26.19%)
NimbusmlPython machine learning package providing simple interoperability between ML.NET and scikit-learn components.
Stars: ✭ 265 (-36.9%)
spark-druid-olapSparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Stars: ✭ 286 (-31.9%)
Deeplearning4jSuite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…
Stars: ✭ 12,277 (+2823.1%)
Docker HadoopA Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-87.14%)
uber dataUber web interface crawler / scraper - Convert the trips table into a CSV file
Stars: ✭ 40 (-90.48%)
Spark Submit UiThis is a based on playframwork for submit spark app
Stars: ✭ 53 (-87.38%)
k3ai-coreK3ai-core is the core library for the GO installer. Go installer will replace the current bash installer
Stars: ✭ 23 (-94.52%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (-88.1%)
Differentiable PlasticityImplementations of the algorithms described in Differentiable plasticity: training plastic networks with gradient descent, a research paper from Uber AI Labs.
Stars: ✭ 371 (-11.67%)
Awesome Recommendation EngineThe purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.
Stars: ✭ 47 (-88.81%)
MLSummerSchoolМатериалы факультатива по машинному обучению и искусственному интеллекту
Stars: ✭ 27 (-93.57%)
GeopysparkGeoTrellis for PySpark
Stars: ✭ 167 (-60.24%)
RgfHome repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.
Stars: ✭ 341 (-18.81%)
Awesome Mlops😎 A curated list of awesome MLOps tools
Stars: ✭ 258 (-38.57%)
target-and-marketA data-driven tool to identify the best candidates for a marketing campaign and optimize it.
Stars: ✭ 19 (-95.48%)
ODSC India 2018My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-93.81%)