GeodeApache Geode
Stars: ✭ 2,016 (-11.46%)
Memex ExplorerViewers for statistics and dashboarding of Domain Search Engine data
Stars: ✭ 115 (-94.95%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-93.85%)
Xlearning Xdmlextremely distributed machine learning
Stars: ✭ 113 (-95.04%)
Instagram BotAn Instagram bot developed using the Selenium Framework
Stars: ✭ 138 (-93.94%)
Avro Hadoop StarterExample MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (-95.17%)
PrestoThe official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+469.04%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (-18.49%)
EchartsApache ECharts is a powerful, interactive charting and data visualization library for browser
Stars: ✭ 49,119 (+2057.18%)
N2h4네이버 뉴스 수집을 위한 도구
Stars: ✭ 177 (-92.23%)
SupersetApache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+1772.38%)
Beyond Jupyter🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (-94.07%)
PulsarTurn large Web sites into tables and charts using simple SQLs.
Stars: ✭ 100 (-95.61%)
Holiday Cn📅🇨🇳 中国法定节假日数据 自动每日抓取国务院公告
Stars: ✭ 157 (-93.1%)
Mod auth casAn Apache httpd module for integrating with Apereo CAS Server project.
Stars: ✭ 130 (-94.29%)
StormkafkamonDumps state of Storm Kafka consumers
Stars: ✭ 99 (-95.65%)
HtconvertConvert .htaccess redirects to nginx.conf redirects
Stars: ✭ 171 (-92.49%)
GrawlerGrawler is a tool written in PHP which comes with a web interface that automates the task of using google dorks, scrapes the results, and stores them in a file.
Stars: ✭ 98 (-95.7%)
Hadoop CommonMirror of Apache Hadoop common
Stars: ✭ 155 (-93.19%)
Airflow PipelineAn Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-94.38%)
Wifi基于wifi抓取信息的大数据查询分析系统
Stars: ✭ 93 (-95.92%)
GoaccessGoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
Stars: ✭ 14,096 (+519.06%)
TinkerpopApache TinkerPop - a graph computing framework
Stars: ✭ 1,309 (-42.51%)
SpydraEphemeral Hadoop clusters using Google Compute Platform
Stars: ✭ 128 (-94.38%)
DockerwebA docker-powered bash script for shared web hosting management. The ultimate Docker LAMP/LEMP Stack.
Stars: ✭ 89 (-96.09%)
CorreiosA client library for Brazilian Correios APIs and services (SIGEP & SRO).
Stars: ✭ 153 (-93.28%)
Bhban rpa6개월 치 업무를 하루 만에 끝내는 업무 자동화(생능출판사, 2020)의 예제 코드입니다. 파이썬을 한 번도 배워본 적 없는 분들을 위한 예제이며, 엑셀부터 디자인, 매크로, 크롤링까지 업무 자동화와 관련된 다양한 분야 예제가 제공됩니다.
Stars: ✭ 124 (-94.55%)
Docker SupersetRepository for Docker Image of Apache-Superset. [Docker Image: https://hub.docker.com/r/abhioncbr/docker-superset]
Stars: ✭ 86 (-96.22%)
Qpid ProtonMirror of Apache Qpid Proton
Stars: ✭ 164 (-92.8%)
Spark StatesCustom state store providers for Apache Spark
Stars: ✭ 83 (-96.35%)
CorpuscrawlerCrawler for linguistic corpora
Stars: ✭ 127 (-94.42%)
Dig Etl EngineDownload DIG to run on your laptop or server.
Stars: ✭ 81 (-96.44%)
Parquet4sRead and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
Stars: ✭ 125 (-94.51%)
Guacamole Install Rhel 7Apache Guacamole installation bash script for RHEL 7 and CentOS 7 including options for Nginx, HTTPS, SSL, LDAP, Let's Encrypt certificates and more
Stars: ✭ 174 (-92.36%)
Big WhaleSpark、Flink等离线任务的调度以及实时任务的监控
Stars: ✭ 163 (-92.84%)
SquidwarcSquidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (-94.51%)
LucenenetApache Lucene.NET
Stars: ✭ 1,704 (-25.16%)
Php Apache TikaApache Tika bindings for PHP: extract text and metadata from documents, images and other formats
Stars: ✭ 76 (-96.66%)
Docker Spark🚢 Docker image for Apache Spark
Stars: ✭ 78 (-96.57%)
CrawlerGo process used to crawl websites
Stars: ✭ 147 (-93.54%)
ProxyA simple tool for fetching usable proxies from several websites.
Stars: ✭ 124 (-94.55%)
ChukwaMirror of Apache Chukwa
Stars: ✭ 77 (-96.62%)
Poi Android📈 Apache POI for Android
Stars: ✭ 77 (-96.62%)
Tf YarnTrain TensorFlow models on YARN in just a few lines of code!
Stars: ✭ 76 (-96.66%)
RareFast, realtime regex-extraction, and aggregation into common formats such as histograms, numerical summaries, tables, and more!
Stars: ✭ 76 (-96.66%)