pyspark-algorithmsPySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+9.09%)
IotdbApache IoTDB
Stars: ✭ 1,221 (+1750%)
merkle-dbHigh-scalability analytics database built on immutable merkle-trees
Stars: ✭ 44 (-33.33%)
ZeppelinWeb-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+8253.03%)
HelicalinsightHelical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.
Stars: ✭ 214 (+224.24%)
CakebaseCakebase is an asynchronous json database for nodejs.
Stars: ✭ 28 (-57.58%)
docsSource code of the ArangoDB online documentation
Stars: ✭ 18 (-72.73%)
MochaDBA .NET ACID RDBMS and NoSQL(with mods/tools) database.
Stars: ✭ 19 (-71.21%)
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+31853.03%)
mmtf-sparkMethods for the parallel and distributed analysis and mining of the Protein Data Bank using MMTF and Apache Spark.
Stars: ✭ 20 (-69.7%)
shelfdbA tiny documents database for Python
Stars: ✭ 35 (-46.97%)
awesome-coder-resources编程路上加油站!------【持续更新中...欢迎star,欢迎常回来看看......】【内容:编程/学习/阅读资源,开源项目,面试题,网站,书,博客,教程等等】
Stars: ✭ 54 (-18.18%)
Vue Virtual Scroll List⚡️A vue component support big amount data list with high render performance and efficient.
Stars: ✭ 3,201 (+4750%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+274.24%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+271.21%)
couchdb-pkgApache CouchDB Packaging support files
Stars: ✭ 24 (-63.64%)
Kafka UiOpen-Source Web GUI for Apache Kafka Management
Stars: ✭ 230 (+248.48%)
sgdAn R package for large scale estimation with stochastic gradient descent
Stars: ✭ 55 (-16.67%)
incubator-tezMirror of Apache Tez (Incubating)
Stars: ✭ 60 (-9.09%)
bagriXML/Document DB on top of distributed cache
Stars: ✭ 40 (-39.39%)
ytprivYT metadata exporter
Stars: ✭ 28 (-57.58%)
arangorsEasy to use rust driver for arangoDB
Stars: ✭ 120 (+81.82%)
dislibThe Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.
Stars: ✭ 39 (-40.91%)
Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (+90.91%)
mascMicrosoft's contributions for Spark with Apache Accumulo
Stars: ✭ 20 (-69.7%)
solrApache Solr open-source search software
Stars: ✭ 651 (+886.36%)
scikit-learn-intelexIntel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Stars: ✭ 887 (+1243.94%)
metriqlThe metrics layer for your data. Join us at https://metriql.com/slack
Stars: ✭ 227 (+243.94%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+4512.12%)
android-thinkmap-treeviewTree View; Mind map; Think map; tree map; custom view; 自定义;关系图;树状图;思维导图;组织机构图;层次图
Stars: ✭ 314 (+375.76%)
CboardAn easy to use, self-service open BI reporting and BI dashboard platform.
Stars: ✭ 2,795 (+4134.85%)
HyperspaceAn open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
Stars: ✭ 246 (+272.73%)
yserialNoSQL y_serial Python module – warehouse compressed objects with SQLite
Stars: ✭ 17 (-74.24%)
TrafodionApache Trafodion
Stars: ✭ 242 (+266.67%)
DataTankerEmbedded persistent key-value store for .NET. Pure C# code.
Stars: ✭ 53 (-19.7%)
Selinon An advanced distributed task flow management on top of Celery
Stars: ✭ 237 (+259.09%)
hocassian-people-neo4jNoSQL可视化人脉图谱项目:非关系型数据库作为更符合人脑记忆的数据展现形式,在未来理论会成为应用界的主流,希望该项目能够成为推动HelpDesk、数据可视化、数据看板等IT基础能力持续降低上手门槛的起点。
Stars: ✭ 26 (-60.61%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+256.06%)
Books整理一些书籍 ,包含 C&C++ 、git 、Java、Keras 、Linux 、NLP 、Python 、Scala 、TensorFlow 、大数据 、推荐系统、数据库、数据挖掘 、机器学习 、深度学习 、算法等。
Stars: ✭ 222 (+236.36%)
cdp-servicecdp数据平台,帮助企业充分了解客户,实现千人千面的精准营销。
Stars: ✭ 30 (-54.55%)
MindMap-Of-ES6基于阮一峰老师的《ECMAScript 6入门》绘制的ES6思维导图
Stars: ✭ 28 (-57.58%)
Lite Virtual ListVirtual list component library supporting waterfall flow based on vue
Stars: ✭ 223 (+237.88%)
NakedtensorBare bone examples of machine learning in TensorFlow
Stars: ✭ 2,443 (+3601.52%)