HomeApacheCN 开源组织:公告、介绍、成员、活动、交流方式
Stars: ✭ 1,199 (+1294.19%)
Kamu CliNext generation tool for decentralized exchange and transformation of semi-structured data
Stars: ✭ 69 (-19.77%)
Gitpkguse a sub directory of a github repo as yarn / npm dependency directly
Stars: ✭ 54 (-37.21%)
Docker HadoopA Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Stars: ✭ 54 (-37.21%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (-16.28%)
Sparkit LearnPySpark + Scikit-learn = Sparkit-learn
Stars: ✭ 1,073 (+1147.67%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+1144.19%)
Vitamin WebDecathlon Design System libraries for web applications
Stars: ✭ 70 (-18.6%)
Spark Submit UiThis is a based on playframwork for submit spark app
Stars: ✭ 53 (-38.37%)
Awesome SparkA curated list of awesome Apache Spark packages and resources.
Stars: ✭ 1,061 (+1133.72%)
Flow Mono CliA command line interface that aims to solve a few issues while working with flow typed codebases in a mono-repo.
Stars: ✭ 84 (-2.33%)
MleapMLeap: Deploy ML Pipelines to Production
Stars: ✭ 1,232 (+1332.56%)
Tf YarnTrain TensorFlow models on YARN in just a few lines of code!
Stars: ✭ 76 (-11.63%)
Npm Link Up🔄 Link your NPM projects automatically, for sophisticated / modular local development.
Stars: ✭ 68 (-20.93%)
UserscriptsVarious user scripts that add features to the review queue or to the chat room
Stars: ✭ 51 (-40.7%)
Spark Sklearn(Deprecated) Scikit-learn integration package for Apache Spark
Stars: ✭ 1,055 (+1126.74%)
Create React AppYarn Workspaces Monorepo support for Create-React-App / React-Scripts.
Stars: ✭ 76 (-11.63%)
Fast MrmrAn improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).
Stars: ✭ 67 (-22.09%)
React Use ApiAsync HTTP request data for axios. Designed for diverse UI states, SSR and data pre-caching.
Stars: ✭ 49 (-43.02%)
LeharVisualize data using relative ordering
Stars: ✭ 81 (-5.81%)
KontextfreiWriting application logic for Spark jobs that can be unit-tested without a SparkContext
Stars: ✭ 67 (-22.09%)
Awesome Recommendation EngineThe purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacionhadoop.com The idea is to show how to play with apache spark streaming, kafka,mongo, spark machine learning algorithms.
Stars: ✭ 47 (-45.35%)
Mobx React FormReactive MobX Form State Management
Stars: ✭ 1,031 (+1098.84%)
Have ItThe fastest NPM install does nothing because you already have it
Stars: ✭ 75 (-12.79%)
ThingsboardOpen-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+12139.53%)
Magiql🌐 💫 Simple and powerful GraphQL Client, love child of react-query ❤️ relay
Stars: ✭ 45 (-47.67%)
RsparklingRSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-24.42%)
Ci Yarn UpgradeKeep NPM dependencies up-to-date with CI, providing version-to-version diff for each library
Stars: ✭ 85 (-1.16%)
Communitya community based on Node.js
Stars: ✭ 44 (-48.84%)
Spark GbtlrHybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark
Stars: ✭ 81 (-5.81%)
Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-12.79%)
Spark BigqueryGoogle BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
Stars: ✭ 65 (-24.42%)
Delta ArchitectureStreaming data changes to a Data Lake with Debezium and Delta Lake pipeline
Stars: ✭ 43 (-50%)
JumbuneJumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,
Stars: ✭ 64 (-25.58%)
TsdxZero-config CLI for TypeScript package development
Stars: ✭ 9,010 (+10376.74%)
GatkOfficial code repository for GATK versions 4 and up
Stars: ✭ 1,002 (+1065.12%)
MlflowOpen source platform for the machine learning lifecycle
Stars: ✭ 10,898 (+12572.09%)
W2vWord2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-25.58%)
PixiedustPython Helper library for Jupyter Notebooks
Stars: ✭ 998 (+1060.47%)
SnappydataProject SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Stars: ✭ 995 (+1056.98%)
Rails WebpackerRails on webpack and yarn with new webpacker gem. Multiple examples using react, vue and angular
Stars: ✭ 80 (-6.98%)
Ds CheatsheetsList of Data Science Cheatsheets to rule the world
Stars: ✭ 9,452 (+10890.7%)
Pysparkgeoanalysis🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-26.74%)