All Projects → microsoft → masc

microsoft / masc

Licence: Apache-2.0 license
Microsoft's contributions for Spark with Apache Accumulo

Programming Languages

java
68154 projects - #9 most used programming language
Jupyter Notebook
11667 projects
scala
5932 projects
shell
77523 projects

Projects that are alternatives of or similar to masc

nifi
Deploy a secured, clustered, auto-scaling NiFi service in AWS.
Stars: ✭ 37 (+85%)
Mutual labels:  big-data, apache
couchdb-pkg
Apache CouchDB Packaging support files
Stars: ✭ 24 (+20%)
Mutual labels:  big-data, apache
accumulo-testing
Apache Accumulo Testing
Stars: ✭ 14 (-30%)
Mutual labels:  big-data, accumulo
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+650%)
Mutual labels:  big-data, apache
Hive
Apache Hive
Stars: ✭ 4,031 (+20055%)
Mutual labels:  big-data, apache
hadoop-data-ingestion-tool
OLAP and ETL of Big Data
Stars: ✭ 17 (-15%)
Mutual labels:  big-data, apache
accumulo-docker
Apache Accumulo Docker
Stars: ✭ 17 (-15%)
Mutual labels:  big-data, accumulo
Tez
Apache Tez
Stars: ✭ 313 (+1465%)
Mutual labels:  big-data, apache
Gaffer
A large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+8110%)
Mutual labels:  big-data, accumulo
Couchdb Docker
Semi-official Apache CouchDB Docker images
Stars: ✭ 194 (+870%)
Mutual labels:  big-data, apache
Selinon
An advanced distributed task flow management on top of Celery
Stars: ✭ 237 (+1085%)
Mutual labels:  big-data
Kafka Ui
Open-Source Web GUI for Apache Kafka Management
Stars: ✭ 230 (+1050%)
Mutual labels:  big-data
Clickhouse
ClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+105345%)
Mutual labels:  big-data
predictionio-template-recommender
PredictionIO Recommendation Engine Template (Scala-based parallelized engine)
Stars: ✭ 80 (+300%)
Mutual labels:  big-data
Eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+1075%)
Mutual labels:  big-data
Koalas
Koalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+15120%)
Mutual labels:  big-data
Books
整理一些书籍 ,包含 C&C++ 、git 、Java、Keras 、Linux 、NLP 、Python 、Scala 、TensorFlow 、大数据 、推荐系统、数据库、数据挖掘 、机器学习 、深度学习 、算法等。
Stars: ✭ 222 (+1010%)
Mutual labels:  big-data
Lite Virtual List
Virtual list component library supporting waterfall flow based on vue
Stars: ✭ 223 (+1015%)
Mutual labels:  big-data
Nakedtensor
Bare bone examples of machine learning in TensorFlow
Stars: ✭ 2,443 (+12115%)
Mutual labels:  big-data
Detecting-Malicious-URL-Machine-Learning
No description or website provided.
Stars: ✭ 47 (+135%)
Mutual labels:  big-data

Microsoft MASC, an Apache Spark connector for Apache Accumulo

The goal of this repository is to facilitate the use of Apache Spark and its machine learning ecosystem with Apache Accumulo as an external data source.

Contents

  • The connector provides connectivity to read from / write to Accumulo using Spark. See the README for more details about supported functionality.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Build

Build Status Maven Central Maven Central

License

All code provided, except where otherwise documented in OpenSource and NOTICE, is covered by the Apache License 2.0

Trademarks

Apache®, Apache Spark, Apache Accumulo and Accumulo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].