Spacy Transformers🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Stars: ✭ 919 (+331.46%)
PytextrankPython implementation of TextRank for phrase extraction and summarization of text documents
Stars: ✭ 1,675 (+686.38%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-68.08%)
Spacy Stanza💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
Stars: ✭ 508 (+138.5%)
Ner AnnotatorNamed Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.
Stars: ✭ 127 (-40.38%)
Spacy Lookups Data📂 Additional lookup tables and data resources for spaCy
Stars: ✭ 48 (-77.46%)
Spacy Course👩🏫 Advanced NLP with spaCy: A free online course
Stars: ✭ 1,920 (+801.41%)
CltkThe Classical Language Toolkit
Stars: ✭ 650 (+205.16%)
ExcelcyExcel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.
Stars: ✭ 89 (-58.22%)
Python nlp tutorialThis repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (-66.2%)
Spacy💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+10218.31%)
TextacyNLP, before and after spaCy
Stars: ✭ 1,849 (+768.08%)
Text Analytics With PythonLearn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+431.46%)
Web ScrapingDetailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, SHFE and news data crawlers on BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Stars: ✭ 153 (-28.17%)
Project TauroA Router WiFi key recovery/cracking tool with a twist.
Stars: ✭ 52 (-75.59%)
Cookiecutter Spacy FastapiCookiecutter API for creating Custom Skills for Azure Search using Python and Docker
Stars: ✭ 179 (-15.96%)
Stealth🚀 Stealth - Secure, Peer-to-Peer, Private and Automateable Web Browser/Scraper/Proxy
Stars: ✭ 659 (+209.39%)
Jupyterlab Prodigy🧬 A JupyterLab extension for annotating data with Prodigy
Stars: ✭ 97 (-54.46%)
MordecaiFull text geoparsing as a Python library
Stars: ✭ 579 (+171.83%)
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (-30.52%)
Mexican Government ReportText Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file to plotting the results.
Stars: ✭ 473 (+122.07%)
Spacy Graphql🤹♀️ Query spaCy's linguistic annotations using GraphQL
Stars: ✭ 81 (-61.97%)
100projectsofcodeA list of practical knowledge-building projects.
Stars: ✭ 1,183 (+455.4%)
Subreddit AnalyzerA comprehensive Data and Text Mining workflow for submissions and comments from any given public subreddit.
Stars: ✭ 447 (+109.86%)
Rasa💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
Stars: ✭ 13,219 (+6106.1%)
Sense2vec🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (+455.87%)
Spacy Wordnetspacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
Stars: ✭ 156 (-26.76%)
CascadiaGo cascadia package command line CSS selector
Stars: ✭ 67 (-68.54%)
Html MetadataMetaData html scraper and parser for Node.js (supports Promises and callback style)
Stars: ✭ 129 (-39.44%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+425.82%)
Displacy Ent💥 displaCy-ent.js: An open-source named entity visualiser for the modern web
Stars: ✭ 191 (-10.33%)
Scrapy CraigslistWeb Scraping Craigslist's Engineering Jobs in NY with Scrapy
Stars: ✭ 54 (-74.65%)
SoupWeb Scraper in Go, similar to BeautifulSoup
Stars: ✭ 1,685 (+691.08%)
PyinflectA python module for word inflections designed for use with spaCy.
Stars: ✭ 52 (-75.59%)
Spacymoji💙 Emoji handling and meta data for spaCy with custom extension attributes
Stars: ✭ 151 (-29.11%)
Lambda PacksPrecompiled packages for AWS Lambda
Stars: ✭ 997 (+368.08%)
Spacy Js🎀 JavaScript API for spaCy with Python REST API
Stars: ✭ 123 (-42.25%)
ScispacyA full spaCy pipeline and models for scientific/biomedical documents.
Stars: ✭ 855 (+301.41%)
Neuralcoref✨Fast Coreference Resolution in spaCy with Neural Networks
Stars: ✭ 2,453 (+1051.64%)
Spacy Models💫 Models for the spaCy Natural Language Processing (NLP) library
Stars: ✭ 796 (+273.71%)
LemminflectA python module for English lemmatization and inflection.
Stars: ✭ 105 (-50.7%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+207.98%)
Wheelwright🎡 Automated build repo for Python wheels and source packages
Stars: ✭ 148 (-30.52%)
Tageditor🏖TagEditor - Annotation tool for spaCy
Stars: ✭ 92 (-56.81%)
KlayersPython Packages as AWS Lambda Layers
Stars: ✭ 557 (+161.5%)
CleannlpR package providing annotators and a normalized data model for natural language processing
Stars: ✭ 174 (-18.31%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+2150.23%)
DaftlistingsA library that enables programmatic interaction with daft.ie. Daft.ie has nationwide coverage and contains about 80% of the total available properties in Ireland.
Stars: ✭ 86 (-59.62%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+117.84%)
Detect CmsPHP Library for detecting CMS
Stars: ✭ 78 (-63.38%)
SpacyrR wrapper to spaCy NLP
Stars: ✭ 202 (-5.16%)
Thinc🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
Stars: ✭ 2,422 (+1037.09%)
NegspacyspaCy pipeline object for negating concepts in text
Stars: ✭ 162 (-23.94%)
Practical Machine Learning With PythonMaster the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+777%)
DframcyDataframe Integration with spaCy.
Stars: ✭ 74 (-65.26%)