All Projects → fb_scraper → Similar Projects or Alternatives

166 Open source projects that are alternatives of or similar to fb_scraper

flink-connector-kudu
基于Apache-bahir-kudu-connector的flink-connector-kudu,支持Flink1.11.x DynamicTableSource/Sink,支持Range分区等
Stars: ✭ 40 (-34.43%)
Mutual labels:  flink
gr-eventstream
gr-eventstream is a set of GNU Radio blocks for creating precisely timed events and either inserting them into, or extracting them from normal data-streams precisely. It allows for the definition of high speed time-synchronous c++ burst event handlers, as well as bridging to standard GNU Radio Async PDU messages with precise timing easily.
Stars: ✭ 38 (-37.7%)
Mutual labels:  extract-data
flink-demo
Flink Demo
Stars: ✭ 39 (-36.07%)
Mutual labels:  flink
seatunnel-example
seatunnel plugin developing examples.
Stars: ✭ 27 (-55.74%)
Mutual labels:  flink
flink-spark-submiter
从本地IDEA提交Flink/Spark任务到Yarn/k8s集群
Stars: ✭ 157 (+157.38%)
Mutual labels:  flink
hadoopoffice
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (-8.2%)
Mutual labels:  flink
bigdata-doc
大数据学习笔记,学习路线,技术案例整理。
Stars: ✭ 37 (-39.34%)
Mutual labels:  flink
tf-idf-python
Term frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (+60.66%)
Mutual labels:  tf-idf
Recommender-Systems
Implementing Content based and Collaborative filtering(with KNN, Matrix Factorization and Neural Networks) in Python
Stars: ✭ 46 (-24.59%)
Mutual labels:  tf-idf
dockerfiles
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-52.46%)
Mutual labels:  flink
extract-colors-py
Extract colors from an image. Colors are grouped based on visual similarities using the CIE76 formula.
Stars: ✭ 48 (-21.31%)
Mutual labels:  extract-data
pygrams
Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence
Stars: ✭ 52 (-14.75%)
Mutual labels:  tf-idf
TextAudit
一个短视频app文本审核模块的实现思路及demo
Stars: ✭ 63 (+3.28%)
Mutual labels:  tf-idf
logparser
Easy parsing of Apache HTTPD and NGINX access logs with Java, Hadoop, Hive, Pig, Flink, Beam, Storm, Drill, ...
Stars: ✭ 139 (+127.87%)
Mutual labels:  flink
wink-bm25-text-search
Fast Full Text Search based on BM25
Stars: ✭ 44 (-27.87%)
Mutual labels:  tf-idf
Insider-Trading
This program extracts insider trading data from the sec website and stores it in excel file for the specified time frame.
Stars: ✭ 43 (-29.51%)
Mutual labels:  extract-data
text-classification-cn
中文文本分类实践,基于搜狗新闻语料库,采用传统机器学习方法以及预训练模型等方法
Stars: ✭ 81 (+32.79%)
Mutual labels:  tf-idf
dpkb
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (+101.64%)
Mutual labels:  flink
SentimentAnalysis
(BOW, TF-IDF, Word2Vec, BERT) Word Embeddings + (SVM, Naive Bayes, Decision Tree, Random Forest) Base Classifiers + Pre-trained BERT on Tensorflow Hub + 1-D CNN and Bi-Directional LSTM on IMDB Movie Reviews Dataset
Stars: ✭ 40 (-34.43%)
Mutual labels:  tf-idf
clusterix
Visual exploration of clustered data.
Stars: ✭ 44 (-27.87%)
Mutual labels:  tf-idf
text-classification-baseline
Pipeline for fast building text classification TF-IDF + LogReg baselines.
Stars: ✭ 55 (-9.84%)
Mutual labels:  tf-idf
ResumeRise
An NLP tool which classifies and summarizes resumes
Stars: ✭ 29 (-52.46%)
Mutual labels:  tf-idf
flink-training-troubleshooting
No description or website provided.
Stars: ✭ 41 (-32.79%)
Mutual labels:  flink
bns-short-text-similarity
📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
Stars: ✭ 24 (-60.66%)
Mutual labels:  tf-idf
Websockets-Vertx-Flink-Kafka
A simple request response cycle using Websockets, Eclipse Vert-x server, Apache Kafka, Apache Flink.
Stars: ✭ 14 (-77.05%)
Mutual labels:  flink
html2data
Library and cli for extracting data from HTML via CSS selectors
Stars: ✭ 62 (+1.64%)
Mutual labels:  extract-data
Lidea
大型分布式系统实时监控平台
Stars: ✭ 28 (-54.1%)
Mutual labels:  flink
devsearch
A web search engine built with Python which uses TF-IDF and PageRank to sort search results.
Stars: ✭ 52 (-14.75%)
Mutual labels:  tf-idf
apache-flink-jdbc-streaming
Sample project for Apache Flink with Streaming Engine and JDBC Sink
Stars: ✭ 22 (-63.93%)
Mutual labels:  flink
emma
A quotation-based Scala DSL for scalable data analysis.
Stars: ✭ 61 (+0%)
Mutual labels:  flink
open-stream-processing-benchmark
This repository contains the code base for the Open Stream Processing Benchmark.
Stars: ✭ 37 (-39.34%)
Mutual labels:  flink
minimal-search-engine
最小のサーチエンジン/PageRank/tf-idf
Stars: ✭ 18 (-70.49%)
Mutual labels:  tf-idf
dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Streaming & Batch and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
Stars: ✭ 1,535 (+2416.39%)
Mutual labels:  flink
Keywords-Abstract-TFIDF-TextRank4ZH
使用tf-idf, TextRank4ZH等不同方式从中文文本中提取关键字,从中文文本中提取摘要和关键词
Stars: ✭ 26 (-57.38%)
Mutual labels:  tf-idf
topic modelling financial news
Topic modelling on financial news with Natural Language Processing
Stars: ✭ 51 (-16.39%)
Mutual labels:  tf-idf
coolplayflink
Flink: Stateful Computations over Data Streams
Stars: ✭ 14 (-77.05%)
Mutual labels:  flink
SANSA-Stack
Big Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/
Stars: ✭ 130 (+113.11%)
Mutual labels:  flink
Real-time-Data-Warehouse
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
Stars: ✭ 52 (-14.75%)
Mutual labels:  flink
TiBigData
TiDB connectors for Flink/Hive/Presto
Stars: ✭ 192 (+214.75%)
Mutual labels:  flink
Archived-SANSA-Query
SANSA Query Layer
Stars: ✭ 31 (-49.18%)
Mutual labels:  flink
parquet-flinktacular
How to use Parquet in Flink
Stars: ✭ 29 (-52.46%)
Mutual labels:  flink
review-notes
团队分享学习、复盘笔记资料共享。Java、Scala、Flink...
Stars: ✭ 27 (-55.74%)
Mutual labels:  flink
flink-deployer
A tool that help automate deployment to an Apache Flink cluster
Stars: ✭ 143 (+134.43%)
Mutual labels:  flink
html-table-extractor
extract data from html table
Stars: ✭ 74 (+21.31%)
Mutual labels:  extract-data
Content-based-Recommender-System
It is a content based recommender system that uses tf-idf and cosine similarity for N Most SImilar Items from a dataset
Stars: ✭ 64 (+4.92%)
Mutual labels:  tf-idf
flink-connectors
Apache Flink connectors for Pravega.
Stars: ✭ 84 (+37.7%)
Mutual labels:  flink
flink-client
Java library for managing Apache Flink via the Monitoring REST API
Stars: ✭ 48 (-21.31%)
Mutual labels:  flink
flink-streaming-source-analysis
flink 流处理源码分析
Stars: ✭ 47 (-22.95%)
Mutual labels:  flink
FlinkExperiments
Experiments with Apache Flink.
Stars: ✭ 3 (-95.08%)
Mutual labels:  flink
piglet
A compiler for Pig Latin to Spark and Flink.
Stars: ✭ 23 (-62.3%)
Mutual labels:  flink
fdp-modelserver
An umbrella project for multiple implementations of model serving
Stars: ✭ 47 (-22.95%)
Mutual labels:  flink
Keyword-Extracter
Problem Statement: Given a particular PDF/Text document ,How to extract keywords and arrange in order of their weightage using Python?
Stars: ✭ 17 (-72.13%)
Mutual labels:  tf-idf
KeywordExtraction
Implementation of algorithm in keyword extraction,including TextRank,TF-IDF and the combination of both
Stars: ✭ 95 (+55.74%)
Mutual labels:  tf-idf
flink-learn
Learning Flink : Flink CEP,Flink Core,Flink SQL
Stars: ✭ 70 (+14.75%)
Mutual labels:  flink
ArkSavegameToolkitNet
Library for reading ARK Survival Evolved savegame files using C#.
Stars: ✭ 19 (-68.85%)
Mutual labels:  extract-data
FlinkTutorial
FlinkTutorial 专注大数据Flink流试处理技术。从基础入门、概念、原理、实战、性能调优、源码解析等内容,使用Java开发,同时含有Scala部分核心代码。欢迎关注我的博客及github。
Stars: ✭ 46 (-24.59%)
Mutual labels:  flink
soan
Social Analysis based on Whatsapp data
Stars: ✭ 106 (+73.77%)
Mutual labels:  tf-idf
cassandra.realtime
Different ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (-59.02%)
Mutual labels:  flink
fb-post-screenshot
Firefox Web Extension to save Facebook posts as images
Stars: ✭ 18 (-70.49%)
Mutual labels:  facebook-scraper
Nepali-News-Classifier
Text Classification of Nepali Language Document. This Mini Project was done for the partial fulfillment of NLP Course : COMP 473.
Stars: ✭ 13 (-78.69%)
Mutual labels:  tf-idf
1-60 of 166 similar projects