All Projects → aut → Similar Projects or Alternatives

1453 Open source projects that are alternatives of or similar to aut

spark-sql-internals
The Internals of Spark SQL
Stars: ✭ 331 (+198.2%)
Mutual labels:  apache-spark
knime-r
KNIME Interactive R Statistics Integration
Stars: ✭ 18 (-83.78%)
Mutual labels:  analysis
oci-cloudera
Terraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)
Stars: ✭ 20 (-81.98%)
Mutual labels:  hadoop
PHAT
Pathogen-Host Analysis Tool - A modern Next-Generation Sequencing (NGS) analysis platform
Stars: ✭ 17 (-84.68%)
Mutual labels:  analysis
spring-startup-analysis
Simple module to analyse bean construction in Java Spring
Stars: ✭ 76 (-31.53%)
Mutual labels:  analysis
beam-site
Apache Beam Site
Stars: ✭ 28 (-74.77%)
Mutual labels:  big-data
splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (+63.06%)
Mutual labels:  spark
volkscv
A Python toolbox for computer vision research and project
Stars: ✭ 58 (-47.75%)
Mutual labels:  analysis
go-mnd
Magic number detector for Go.
Stars: ✭ 153 (+37.84%)
Mutual labels:  analysis
hadoopoffice
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (-49.55%)
Mutual labels:  hadoop
predictionio-sdk-php
PredictionIO PHP SDK
Stars: ✭ 269 (+142.34%)
Mutual labels:  big-data
airavata-php-gateway
Mirror of Apache Airavata PHP Gateway
Stars: ✭ 15 (-86.49%)
Mutual labels:  big-data
ggshakeR
An analysis and visualization R package that works with publicly available soccer data
Stars: ✭ 69 (-37.84%)
Mutual labels:  analysis
shamash
Autoscaling for Google Cloud Dataproc
Stars: ✭ 31 (-72.07%)
Mutual labels:  spark
skein
A tool and library for easily deploying applications on Apache YARN
Stars: ✭ 128 (+15.32%)
Mutual labels:  hadoop
Bitcoin Analysis-
Python Bitcoin is widely used cryptocurrency for digital market. It is decentralised that means it is not own by government or any other company.Transactions are simple and easy as it doesn’t belong to any country.Records data are stored in Blockchain.Bitcoin price is variable and it is widely used so it is important to predict the price of it f…
Stars: ✭ 42 (-62.16%)
Mutual labels:  analysis
cummings.ee
A collection of the work of Edward Estlin Cummings, as it enters the public domain.
Stars: ✭ 32 (-71.17%)
Mutual labels:  digital-humanities
character-extraction
Extracts character names from a text file and performs analysis of text sentences containing the names.
Stars: ✭ 40 (-63.96%)
Mutual labels:  analysis
Spark-Ar
Resources for Spark AR
Stars: ✭ 43 (-61.26%)
Mutual labels:  spark
pandapower gui
A Graphical User Interface for the open source pandapower load flow program. [ I was so inexperienced when I started this, but maybe we can try again]
Stars: ✭ 28 (-74.77%)
Mutual labels:  analysis
platys-modern-data-platform
Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
Stars: ✭ 35 (-68.47%)
Mutual labels:  hadoop
GroupDocs.Classification-for-.NET
GroupDocs.Classification-for-.NET samples and showcase (text and documents classification and sentiment analysis)
Stars: ✭ 38 (-65.77%)
Mutual labels:  analysis
ohloh-ui
Web Application for the Ohloh Stack.
Stars: ✭ 72 (-35.14%)
Mutual labels:  analysis
wiki
从diy行为艺术到diy苏格拉底式对话,从diy一个仪式到diy一次旷课,各种活动指南的百科。diy💔是706孵化的一个非代码开源项目。
Stars: ✭ 49 (-55.86%)
Mutual labels:  digital-humanities
pulsar-adapters
Apache Pulsar Adapters
Stars: ✭ 18 (-83.78%)
Mutual labels:  apache-spark
decaylanguage
Package to parse decay files, describe and convert particle decays between digital representations.
Stars: ✭ 34 (-69.37%)
Mutual labels:  analysis
hypothetical
Hypothesis and statistical testing in Python
Stars: ✭ 49 (-55.86%)
Mutual labels:  analysis
xxhadoop
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (-66.67%)
Mutual labels:  hadoop
census
📜Automated review of open source software projects
Stars: ✭ 111 (+0%)
Mutual labels:  analysis
TwitterSearch2Gephi
This windows CLI app lets you collect data from twitter via REST API and convert it into a CSV data set that can be used with Gephi. Other social networks (Reddit, Youtube, WWW) are also supported.
Stars: ✭ 21 (-81.08%)
Mutual labels:  analysis
jmx exporter-cloudera-hadoop
Prometheus jmx_exporter configurations for Cloudera Hadoop
Stars: ✭ 33 (-70.27%)
Mutual labels:  hadoop
fluent-plugin-webhdfs
Hadoop WebHDFS output plugin for Fluentd
Stars: ✭ 57 (-48.65%)
Mutual labels:  hadoop
dmarc-viewer
Django based web-app to visually analyze DMARC aggregate reports
Stars: ✭ 51 (-54.05%)
Mutual labels:  analysis
PysparkCheatsheet
PySpark Cheatsheet
Stars: ✭ 25 (-77.48%)
Mutual labels:  apache-spark
spark-stringmetric
Spark functions to run popular phonetic and string matching algorithms
Stars: ✭ 51 (-54.05%)
Mutual labels:  spark
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-77.48%)
Mutual labels:  pyspark
booknlp
BookNLP, a natural language processing pipeline for books
Stars: ✭ 636 (+472.97%)
Mutual labels:  digital-humanities
ceja
PySpark phonetic and string matching algorithms
Stars: ✭ 24 (-78.38%)
Mutual labels:  pyspark
analysis-net
Static analysis framework for .NET programs.
Stars: ✭ 19 (-82.88%)
Mutual labels:  analysis
BigCLAM-ApacheSpark
Overlapping community detection in Large-Scale Networks using BigCLAM model build on Apache Spark
Stars: ✭ 40 (-63.96%)
Mutual labels:  apache-spark
Spark-and-Kafka IoT-Data-Processing-and-Analytics
Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time
Stars: ✭ 42 (-62.16%)
Mutual labels:  pyspark
visualize-data-with-python
A Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.
Stars: ✭ 60 (-45.95%)
Mutual labels:  spark
story-generator
Budget Visualization Tool to explore and analyse major fiscal indicators across various states in India
Stars: ✭ 17 (-84.68%)
Mutual labels:  analysis
scarf
Toolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (-51.35%)
Mutual labels:  big-data
disq
A library for manipulating bioinformatics sequencing formats in Apache Spark
Stars: ✭ 29 (-73.87%)
Mutual labels:  hadoop
ros hadoop
Hadoop splittable InputFormat for ROS. Process rosbag with Hadoop Spark and other HDFS compatible systems.
Stars: ✭ 92 (-17.12%)
Mutual labels:  hadoop
appdata-environment-desktop
A selection of script and the manual for Privacy International's data interception environment
Stars: ✭ 70 (-36.94%)
Mutual labels:  analysis
corc
An ORC File Scheme for the Cascading data processing platform.
Stars: ✭ 14 (-87.39%)
Mutual labels:  hadoop
hotmap
WebGL Heatmap Viewer for Big Data and Bioinformatics
Stars: ✭ 13 (-88.29%)
Mutual labels:  big-data
covid19analysis
COVID-10 Analysis
Stars: ✭ 16 (-85.59%)
Mutual labels:  analysis
BigInsights-on-Apache-Hadoop
Example projects for 'BigInsights for Apache Hadoop' on IBM Bluemix
Stars: ✭ 21 (-81.08%)
Mutual labels:  hadoop
polars
Fast multi-threaded DataFrame library in Rust | Python | Node.js
Stars: ✭ 6,368 (+5636.94%)
Mutual labels:  dataframe
Unitor
Tool for analysing and disassembling any unity game. Supports both mono and il2cpp.
Stars: ✭ 31 (-72.07%)
Mutual labels:  analysis
TraduXio
A participative platform for cultural texts translators
Stars: ✭ 19 (-82.88%)
Mutual labels:  digital-humanities
taint-with-frida
just an experiment
Stars: ✭ 17 (-84.68%)
Mutual labels:  analysis
pypar
Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.
Stars: ✭ 66 (-40.54%)
Mutual labels:  big-data
pathpy
pathpy is an OpenSource python package for the modeling and analysis of pathways and temporal networks using higher-order and multi-order graphical models
Stars: ✭ 124 (+11.71%)
Mutual labels:  analysis
net.jgp.books.spark.ch07
Spark in Action, 2nd edition - chapter 7 - Ingestion from files
Stars: ✭ 13 (-88.29%)
Mutual labels:  apache-spark
RemoteShuffleService
Celeborn provides an elastic and high-performance service for shuffle and spilled data.
Stars: ✭ 262 (+136.04%)
Mutual labels:  big-data
couchdb-mango
Mirror of Apache CouchDB Mango
Stars: ✭ 34 (-69.37%)
Mutual labels:  big-data
241-300 of 1453 similar projects