NBiNBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (-64.34%)
Kotlin Spark ApiThis projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Stars: ✭ 183 (-36.01%)
Graph-OLAPAn attempt to model an OLAP cube with Neo4j.
Stars: ✭ 37 (-87.06%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+964.34%)
XsqlUnified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-38.46%)
spark-stringmetricSpark functions to run popular phonetic and string matching algorithms
Stars: ✭ 51 (-82.17%)
Kraps RpcA RPC framework leveraging Spark RPC module
Stars: ✭ 175 (-38.81%)
Every Single Day I TldrA daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (-12.94%)
Spark NlpState of the Art Natural Language Processing
Stars: ✭ 2,518 (+780.42%)
MLiFCCourse Material for the machine learning in financial context bootcamp
Stars: ✭ 102 (-64.34%)
TransmogrifaiTransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Stars: ✭ 2,084 (+628.67%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-13.64%)
Spark-ArResources for Spark AR
Stars: ✭ 43 (-84.97%)
Neo4j Spark ConnectorNeo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
Stars: ✭ 245 (-14.34%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-42.66%)
harlanHarlan é o sistema modular que permite você automatizar toda sua governança cadastral da nuvem.
Stars: ✭ 25 (-91.26%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+712.24%)
RecommendationsystemBook recommender system using collaborative filtering based on Spark
Stars: ✭ 244 (-14.69%)
GlowAn open-source toolkit for large-scale genomic analysis
Stars: ✭ 159 (-44.41%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-44.76%)
Hadoop Docker基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
Stars: ✭ 238 (-16.78%)
dashinatorDashinator the daringly delightful dashboard. A replacement for dashing
Stars: ✭ 56 (-80.42%)
PowderkegLive-coding the cluster!
Stars: ✭ 152 (-46.85%)
swordfishOpen-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-87.76%)
MydatascienceportfolioApplying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (-20.63%)
AztkAZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Stars: ✭ 152 (-46.85%)
automile-phpAutomile offers a simple, smart, cutting-edge telematics solution for businesses to track and manage their business vehicles.
Stars: ✭ 28 (-90.21%)
Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (-48.6%)
Spark WorkshopApache Spark™ and Scala Workshops
Stars: ✭ 224 (-21.68%)
DatacompyPandas and Spark DataFrame comparison for humans
Stars: ✭ 147 (-48.6%)
Technology Talk汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+4143.36%)
Sagemaker SparkA Spark library for Amazon SageMaker.
Stars: ✭ 219 (-23.43%)
Spark AuthorizerA Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-50.7%)
automile-netAutomile offers a simple, smart, cutting-edge telematics solution for businesses to track and manage their business vehicles.
Stars: ✭ 24 (-91.61%)
Data science blogsA repository to keep track of all the code that I end up writing for my blog posts.
Stars: ✭ 139 (-51.4%)
GimelBig Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-24.48%)
fastdata-clusterFast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-93.01%)
Isolation ForestA Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm.
Stars: ✭ 139 (-51.4%)
neo4j-jdbcJDBC driver for Neo4j
Stars: ✭ 110 (-61.54%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+4075.87%)
Spark Knnk-Nearest Neighbors algorithm on Spark
Stars: ✭ 205 (-28.32%)
OpaqueAn encrypted data analytics platform
Stars: ✭ 129 (-54.9%)
naruNeural Relation Understanding: neural cardinality estimators for tabular data
Stars: ✭ 76 (-73.43%)
Spark PracticeApache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-30.07%)
BallistaDistributed compute platform implemented in Rust, and powered by Apache Arrow.
Stars: ✭ 2,274 (+695.1%)
MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+913.64%)
ODSC India 2018My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Stars: ✭ 26 (-90.91%)
sparkar-voltsAn extensive non-reactive Typescript framework that eases the development experience in Spark AR
Stars: ✭ 15 (-94.76%)