OWASP Mth3l3m3nt Framework is a penetration testing aiding tool and exploitation framework. It fosters a principle of attack the web using the web as well as pentest on the go through its responsive interface.

Stars: ✭ 139 (-93.9%)

Mutual labels: apache

Gobblin

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

Stars: ✭ 2,006 (-11.9%)

Mutual labels: apache

Eel Sdk

Big Data Toolkit for the JVM

Stars: ✭ 140 (-93.85%)

Mutual labels: hadoop

Xlearning

AI on Hadoop

Stars: ✭ 1,709 (-24.95%)

Mutual labels: hadoop

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (-92.49%)

Mutual labels: crawling

Htaccess

✂A collection of useful .htaccess snippets.

Stars: ✭ 11,830 (+419.54%)

Mutual labels: apache

Newspaper

News, full-text, and article metadata extraction in Python 3. Advanced docs:

Stars: ✭ 11,545 (+407.03%)

Mutual labels: crawling

Aliyun Emapreduce Datasources

Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.

Stars: ✭ 132 (-94.2%)

Mutual labels: hadoop

Abot

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

Stars: ✭ 1,961 (-13.88%)

Mutual labels: web-crawler

Collector Http

Norconex HTTP Collector is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.

Stars: ✭ 130 (-94.29%)

Mutual labels: web-crawler

Massivedl

Download a large list of files concurrently

Stars: ✭ 141 (-93.81%)

Mutual labels: crawling

Geode

Apache Geode

Stars: ✭ 2,016 (-11.46%)

Mutual labels: apache

Azure Event Hubs Spark

Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs

Stars: ✭ 140 (-93.85%)

Mutual labels: apache

Apache exporter

Prometheus exporter for Apache.

Stars: ✭ 172 (-92.45%)

Mutual labels: apache

Instagram Bot

An Instagram bot developed using the Selenium Framework

Stars: ✭ 138 (-93.94%)

Mutual labels: crawling

Presto

The official home of the Presto distributed SQL query engine for big data

Stars: ✭ 12,957 (+469.04%)

Mutual labels: hadoop

Hbaseclient

HBase客户端数据管理软件

Stars: ✭ 135 (-94.07%)

Mutual labels: hadoop

N2h4

네이버 뉴스 수집을 위한 도구

Stars: ✭ 177 (-92.23%)

Mutual labels: crawling

Beyond Jupyter

🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)

Stars: ✭ 135 (-94.07%)

Mutual labels: apache

Holiday Cn

📅🇨🇳 中国法定节假日数据自动每日抓取国务院公告

Stars: ✭ 157 (-93.1%)

Mutual labels: crawling

Mod auth cas

An Apache httpd module for integrating with Apereo CAS Server project.

Stars: ✭ 130 (-94.29%)

Mutual labels: apache

Htconvert

Convert .htaccess redirects to nginx.conf redirects

Stars: ✭ 171 (-92.49%)

Mutual labels: apache

Hadoop Common

Mirror of Apache Hadoop common

Stars: ✭ 155 (-93.19%)

Mutual labels: hadoop

Calcite Avatica

Mirror of Apache Calcite - Avatica

Stars: ✭ 130 (-94.29%)

Mutual labels: hadoop

Gaffer

A large-scale entity and relation database supporting aggregation of properties

Stars: ✭ 1,642 (-27.89%)

Mutual labels: hadoop

Airflow Pipeline

An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR

Stars: ✭ 128 (-94.38%)

Mutual labels: hadoop

Goaccess

GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.

Stars: ✭ 14,096 (+519.06%)

Mutual labels: apache

Bigdata Playground

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Stars: ✭ 177 (-92.23%)

Mutual labels: hadoop

Deeplearning4j

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learni…

Stars: ✭ 12,277 (+439.17%)

Mutual labels: hadoop

Movie recommend

基于Spark的电影推荐系统，包含爬虫项目、web网站、后台管理系统以及spark推荐系统

Stars: ✭ 2,092 (-8.12%)

Mutual labels: hadoop

Serverpilot Letsencrypt

Automate the installation of Let's Encrypt SSL on the free plan of ServerPilot

Stars: ✭ 129 (-94.33%)

Mutual labels: apache

Spydra

Ephemeral Hadoop clusters using Google Compute Platform