Learned IndexesImplementation of BTree part for paper 'The Case for Learned Index Structures'
Stars: ✭ 64 (-41.28%)
Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-31.19%)
SiodbThe simplicity of REST and the power of SQL combined in a database that automatized security and performance. Forget the database, develop faster and safer!
Stars: ✭ 31 (-71.56%)
Spark FfmFFM (Field-Awared Factorization Machine) on Spark
Stars: ✭ 101 (-7.34%)
Spark FlamegraphEasy CPU Profiling for Apache Spark applications
Stars: ✭ 30 (-72.48%)
Devops ResourcesDevOps resources - Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP
Stars: ✭ 1,194 (+995.41%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+775.23%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-32.11%)
Sqlfaker轻量级、易拓展的数据库智能填充Java开源库
Stars: ✭ 109 (+0%)
Lol dbalol_dba is a small package of rake tasks that scan your application models and displays a list of columns that probably should be indexed. Also, it can generate .sql migration scripts.
Stars: ✭ 1,363 (+1150.46%)
DashScalable Hashing on Persistent Memory
Stars: ✭ 86 (-21.1%)
Pysparkgeoanalysis🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-42.2%)
Data Algorithms Book MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+770.64%)
WikipediatrendA convenience R package for getting Wikipedia article access statistics (and more).
Stars: ✭ 73 (-33.03%)
Scikit FdaFunctional Data Analysis Python package
Stars: ✭ 91 (-16.51%)
LabsResearch on distributed system
Stars: ✭ 73 (-33.03%)
Pystan2PyStan, the Python interface to Stan
Stars: ✭ 915 (+739.45%)
MahaA framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-7.34%)
FacsimileFacsimile Simulation Library
Stars: ✭ 20 (-81.65%)
OdindexOnedrive index transplanted from Heymind.
Stars: ✭ 91 (-16.51%)
FlintA Time Series Library for Apache Spark
Stars: ✭ 878 (+705.5%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (-33.94%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-87.16%)
GriddbGridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
Stars: ✭ 1,587 (+1355.96%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+7541.28%)
Fecon236Tools for financial economics. Curated wrapper over Python ecosystem. Source code for fecon235 Jupyter notebooks.
Stars: ✭ 72 (-33.94%)
AlembicA database migrations tool for SQLAlchemy.
Stars: ✭ 874 (+701.83%)
Jcabi JdbcFluent Wrapper of JDBC
Stars: ✭ 90 (-17.43%)
HotcoldSmart touch typing learning with instant key glow indications, live statistics, live graphs and dynamic course creation.
Stars: ✭ 12 (-88.99%)
AndlAndl is A New Database Language
Stars: ✭ 71 (-34.86%)
Sparkling TitanicTraining models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-88.99%)
Node Sonic Channel🦉 Sonic Channel integration for Node. Used in pair with Sonic, the fast, lightweight and schema-less search backend.
Stars: ✭ 101 (-7.34%)
PhoenixMirror of Apache Phoenix
Stars: ✭ 867 (+695.41%)
MareMaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-89.91%)
Cloudquerycloudquery transforms your cloud infrastructure into SQL or Graph database for easy monitoring, governance and security.
Stars: ✭ 1,300 (+1092.66%)
Bigdata Interview🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Stars: ✭ 857 (+686.24%)
MetriculousMeasure and visualize machine learning model performance without the usual boilerplate.
Stars: ✭ 71 (-34.86%)
DramaanalysisAn R package for analysis of dramatic texts
Stars: ✭ 10 (-90.83%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+10338.53%)
KtormA lightweight ORM framework for Kotlin with strong-typed SQL DSL and sequence APIs.
Stars: ✭ 843 (+673.39%)
Live2SAP HANA Academy - Live2 project code samples for playlist https://www.youtube.com/playlist?list=PLkzo92owKnVyIXgkK__7Z1o_C7pyNc3SR
Stars: ✭ 8 (-92.66%)
ZeligA statistical framework that serves as a common interface to a large range of models
Stars: ✭ 89 (-18.35%)
Reporting Services Examples📕 Various example reports I use for SQL Server Reporting Services (SSRS) as well as documents for unit testing, requirements and a style guide template.
Stars: ✭ 63 (-42.2%)
Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (-21.1%)
Sqlite orm❤️ SQLite ORM light header only library for modern C++
Stars: ✭ 1,121 (+928.44%)
Dynamodb OopSpeak fluent DynamoDB, write code with fashion, I Promise() 😃
Stars: ✭ 104 (-4.59%)
PorcupineThreading, Resiliency and Monitoring for Java EE 7/8
Stars: ✭ 99 (-9.17%)
Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (-22.02%)
EventqlDistributed "massively parallel" SQL query engine
Stars: ✭ 1,121 (+928.44%)
RoffildlibraryLibrary for MQL5 (MetaTrader) with Python, Java, Apache Spark, AWS
Stars: ✭ 63 (-42.2%)