acl19 subtaggerCode for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers
Stars: ✭ 33 (-29.79%)
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+878.72%)
scikitcrf NERPython library for custom entity recognition using Sklearn CRF
Stars: ✭ 17 (-63.83%)
react-taggyA simple zero-dependency React component for tagging user-defined entities within a block of text.
Stars: ✭ 29 (-38.3%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (+163.83%)
PersianNERNamed-Entity Recognition in Persian Language
Stars: ✭ 48 (+2.13%)
BM25Transformer(Python) transform a document-term matrix to an Okapi/BM25 representation
Stars: ✭ 50 (+6.38%)
disk基于hadoop+hbase+springboot实现分布式网盘系统
Stars: ✭ 53 (+12.77%)
LogAnalyzeHelper论坛日志分析系统清洗程序(包含IP规则库,UDF开发,MapReduce程序,日志数据)
Stars: ✭ 33 (-29.79%)
CrossNERCrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)
Stars: ✭ 87 (+85.11%)
DRhardSIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.
Stars: ✭ 93 (+97.87%)
pyspark-ML-in-ColabPyspark in Google Colab: A simple machine learning (Linear Regression) model
Stars: ✭ 32 (-31.91%)
anonymisationAnonymization of legal cases (Fr) based on Flair embeddings
Stars: ✭ 85 (+80.85%)
tutorialsA tutorial series by Preferred.AI
Stars: ✭ 136 (+189.36%)
learning-sparkTidy up Spark and Hadoop tutorials.
Stars: ✭ 28 (-40.43%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-17.02%)
arabic-taggerAQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training
Stars: ✭ 38 (-19.15%)
skeinA tool and library for easily deploying applications on Apache YARN
Stars: ✭ 128 (+172.34%)
dockerfilesMulti docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-38.3%)
hyperdraftTurn your notes into a website.
Stars: ✭ 59 (+25.53%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (+38.3%)
disqA library for manipulating bioinformatics sequencing formats in Apache Spark
Stars: ✭ 29 (-38.3%)
anonymization-apiHow to build and deploy an anonymization API with FastAPI
Stars: ✭ 51 (+8.51%)
corcAn ORC File Scheme for the Cascading data processing platform.
Stars: ✭ 14 (-70.21%)
IP-TrackerTrack any ip address with IP-Tracker. IP-Tracker is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracker.
Stars: ✭ 53 (+12.77%)
almanacsA recipe for everything 🗒️
Stars: ✭ 47 (+0%)
oci-clouderaTerraform module to deploy Cloudera on Oracle Cloud Infrastructure (OCI)
Stars: ✭ 20 (-57.45%)
nlp-cheat-sheet-pythonNLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+46.81%)
rastercuberastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)
Stars: ✭ 15 (-68.09%)
banglabertThis repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…
Stars: ✭ 186 (+295.74%)
deepnlp小时候练手的nlp项目
Stars: ✭ 11 (-76.6%)
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (-8.51%)
entity spell systemAn entity and spell system c++ godot engine module, for complex (optionally multiplayer) RPGs.
Stars: ✭ 86 (+82.98%)
kleros-api-DEPRECATEDA Javascript library that makes it easy to build relayers and other DApps that use the Kleros protocol. DEPRECATED use https://github.com/kleros/archon for interfacing with standard arbitration contracts.
Stars: ✭ 20 (-57.45%)
molminerPython library and command-line tool for extracting compounds from scientific literature. Written in Python.
Stars: ✭ 38 (-19.15%)
xxhadoopData Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (-21.28%)
typeorm-factoriesCreate factories for your TypeORM entities. Useful for NestJS applications
Stars: ✭ 43 (-8.51%)
srctools for fast reading of docs
Stars: ✭ 40 (-14.89%)
allsummarizerMultilingual automatic text summarizer using statistical approach and extraction
Stars: ✭ 28 (-40.43%)
HARCode for WWW2019 paper "A Hierarchical Attention Retrieval Model for Healthcare Question Answering"
Stars: ✭ 22 (-53.19%)
nalcosSearch Git commits in natural language
Stars: ✭ 50 (+6.38%)
languaA suite of language tools
Stars: ✭ 29 (-38.3%)
ml4irMachine Learning for Information Retrieval
Stars: ✭ 75 (+59.57%)
kexKex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public datasets.
Stars: ✭ 46 (-2.13%)
Few-NERDCode and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"
Stars: ✭ 317 (+574.47%)
cs6101The Web IR / NLP Group (WING)'s public reading group at the National University of Singapore.
Stars: ✭ 17 (-63.83%)
CogIECogIE: An Information Extraction Toolkit for Bridging Text and CogNet. ACL 2021
Stars: ✭ 47 (+0%)