Bootplus基于SpringBoot + Shiro + MyBatisPlus的权限管理框架
Stars: ✭ 88 (-96.42%)
AnseriniA Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (-76.71%)
PowderkegLive-coding the cluster!
Stars: ✭ 152 (-93.82%)
FessFess is very powerful and easily deployable Enterprise Search Server.
Stars: ✭ 561 (-77.2%)
Spark Nlp ModelsModels and Pipelines for the Spark NLP library
Stars: ✭ 88 (-96.42%)
Spark DariaEssential Spark extensions and helper methods ✨😲
Stars: ✭ 553 (-77.52%)
PyroaringbitmapAn efficient and light-weight ordered set of 32 bits integers.
Stars: ✭ 128 (-94.8%)
LopqTraining of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
Stars: ✭ 530 (-78.46%)
SolrpluginsDice Solr Plugins from Simon Hughes Dice.com
Stars: ✭ 86 (-96.5%)
CdapAn open source framework for building data analytic applications.
Stars: ✭ 509 (-79.31%)
Sparkstreaming💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (-92.72%)
PointblankData validation and organization of metadata for data frames and database tables
Stars: ✭ 480 (-80.49%)
CuesheetA framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-96.5%)
JavaewahA compressed alternative to the Java BitSet class
Stars: ✭ 474 (-80.73%)
FeastFeature Store for Machine Learning
Stars: ✭ 2,576 (+4.72%)
Bdp Dataplatform大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (-81.46%)
Hops ExamplesExamples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-96.59%)
FlintA Time Series Library for Apache Spark
Stars: ✭ 878 (-64.31%)
Lucene SolrApache Lucene and Solr open-source search software
Stars: ✭ 4,217 (+71.42%)
SmartstoreOpen Source ASP.NET Core Enterprise eCommerce Shopping Cart Solution
Stars: ✭ 82 (-96.67%)
TurniloBusiness intelligence, data exploration and visualization web application for Druid, formerly known as Swiv and Pivot
Stars: ✭ 427 (-82.64%)
OpenubaA robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-94.84%)
Dji Firmware ToolsTools for handling firmwares of DJI products, with focus on quadcopters.
Stars: ✭ 424 (-82.76%)
FeatranA Scala feature transformation library for data science and machine learning
Stars: ✭ 420 (-82.93%)
HibitsetHierarchical bit set container
Stars: ✭ 81 (-96.71%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-83.21%)
Cape PythonCollaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-94.92%)
MarmarayGeneric Data Ingestion & Dispersal Library for Hadoop
Stars: ✭ 414 (-83.17%)
Spark GbtlrHybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark
Stars: ✭ 81 (-96.71%)
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (-83.5%)
AztkAZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-93.82%)
BitvecA crate for managing memory bit by bit
Stars: ✭ 411 (-83.29%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-96.79%)
Awesome ElasticsearchA curated list of the most important and useful resources about elasticsearch: articles, videos, blogs, tips and tricks, use cases. All about Elasticsearch!
Stars: ✭ 4,168 (+69.43%)
Spark Bigquery ConnectorBigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Stars: ✭ 126 (-94.88%)
IcebergIceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (-84.02%)
HomeApacheCN 开源组织:公告、介绍、成员、活动、交流方式
Stars: ✭ 1,199 (-51.26%)
Hibernate SearchHibernate Search: full-text search for domain model
Stars: ✭ 382 (-84.47%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-92.89%)
RedashMake Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+718.98%)
BigdlBuilding Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+55%)
NimrodNimrod - 基于 Spring Boot 构建 的 Java Web 平台企业级单体应用快速开发框架,适合中小型项目的应用和开发。所采用的技术栈包括 Spring Boot、Spring、Spring Web MVC、MyBatis、Thymeleaf 等,遵守阿里巴巴 Java 开发规约,帮助养成良好的编码习惯。整体采用 RBAC ( Role-Based Access Control ,基于角色的访问控制),具有严格的权限控制模块,支持系统与模块分离开发。最后希望这个项目能够对你有所帮助。Nimrod 开发交流群:547252502(QQ 群)
Stars: ✭ 125 (-94.92%)
WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (-84.88%)
Ds CheatsheetsList of Data Science Cheatsheets to rule the world
Stars: ✭ 9,452 (+284.23%)
SparkmeasureThis is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
Stars: ✭ 368 (-85.04%)
Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (-94.02%)
LogigskA Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-95.65%)
TedsdsApache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-99.43%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-99.43%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-96.99%)
Seconds Kill基于 Springboot + Redis + Kafka 的秒杀系统,乐观锁 + 缓存 + 限流 + 异步,TPS 从 500 优化到 3000
Stars: ✭ 180 (-92.68%)
XsqlUnified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-92.85%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (-15.28%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-93.58%)