All Projects → microsoft → SkillsExtractorCognitiveSearch

microsoft / SkillsExtractorCognitiveSearch

Licence: MIT License
Azure Search Cognitive Skill to extract technical and business skills from text

Programming Languages

python
139335 projects - #7 most used programming language
powershell
5483 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to SkillsExtractorCognitiveSearch

botsfromzerotohero
This repository contains all the code samples of "Developing Intelligent Bots from Zero to Hero" channel9/MVA course.
Stars: ✭ 14 (-72.55%)
Mutual labels:  azure-search
named-entity-recognition-template
Build a deep learning model for predicting the named entities from text.
Stars: ✭ 51 (+0%)
Mutual labels:  named-entity-recognition
nlp ner workshop
Named-Entity-Recognition Workshop
Stars: ✭ 15 (-70.59%)
Mutual labels:  named-entity-recognition
ner
A command-line utility for extracting names of people, places, and organizations from text on macOS.
Stars: ✭ 75 (+47.06%)
Mutual labels:  named-entity-recognition
mitie-ruby
Named-entity recognition for Ruby
Stars: ✭ 77 (+50.98%)
Mutual labels:  named-entity-recognition
react-taggy
A simple zero-dependency React component for tagging user-defined entities within a block of text.
Stars: ✭ 29 (-43.14%)
Mutual labels:  named-entity-recognition
AlpacaTag
AlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging (ACL 2019 Demo)
Stars: ✭ 126 (+147.06%)
Mutual labels:  named-entity-recognition
LNEx
📍 🏢 🏦 🏣 🏪 🏬 LNEx: Location Name Extractor
Stars: ✭ 21 (-58.82%)
Mutual labels:  named-entity-recognition
bern
A neural named entity recognition and multi-type normalization tool for biomedical text mining
Stars: ✭ 151 (+196.08%)
Mutual labels:  named-entity-recognition
IE Paper Notes
Paper notes for Information Extraction, including Relation Extraction (RE), Named Entity Recognition (NER), Entity Linking (EL), Event Extraction (EE), Named Entity Disambiguation (NED).
Stars: ✭ 14 (-72.55%)
Mutual labels:  named-entity-recognition
eve-bot
EVE bot, a customer service chatbot to enhance virtual engagement for Twitter Apple Support
Stars: ✭ 31 (-39.22%)
Mutual labels:  named-entity-recognition
AspNetCoreAzureSearch
ASP.NET Core with Azure Cognitive Search
Stars: ✭ 12 (-76.47%)
Mutual labels:  azure-search
NER corpus chinese
NER(命名实体识别)中文语料,一站式获取
Stars: ✭ 102 (+100%)
Mutual labels:  named-entity-recognition
ner-d
Python module for Named Entity Recognition (NER) using natural language processing.
Stars: ✭ 14 (-72.55%)
Mutual labels:  named-entity-recognition
huner
Named Entity Recognition for biomedical entities
Stars: ✭ 44 (-13.73%)
Mutual labels:  named-entity-recognition
BERT-NER
Using pre-trained BERT models for Chinese and English NER with 🤗Transformers
Stars: ✭ 114 (+123.53%)
Mutual labels:  named-entity-recognition
watchman
Watchman: An open-source social-media event-detection system
Stars: ✭ 18 (-64.71%)
Mutual labels:  named-entity-recognition
CLNER
[ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning
Stars: ✭ 50 (-1.96%)
Mutual labels:  named-entity-recognition
clinical concept extraction
Clinical Concept Extraction with Contextual Word Embedding
Stars: ✭ 34 (-33.33%)
Mutual labels:  named-entity-recognition
TorchBlocks
A PyTorch-based toolkit for natural language processing
Stars: ✭ 85 (+66.67%)
Mutual labels:  named-entity-recognition

Deprecation Notice

We've launched a better version of this service with Azure Cognitive Serivces - Text Analytics in the new V3 of the Named Entity Recognition (NER) endpoint.

NER V3 Information

You can read more about this work and how to use it here:

(NEW) Custom Entity Lookup Skill

Azure Cognitive Search recently introduced a new built-in Cognitive Skill that does essentially what this repository does. You provide a dictionary of terms you want to match and it will extract those for you from any text field in your search index. You can read more about that here: https://docs.microsoft.com/en-us/azure/search/cognitive-search-skill-custom-entity-lookup

Custom Cognitive Skill with Containers

This repo is no longer supported but you're free to use the index and skill definitions provided to enable the personalized job recommendations scenario. If you are just looking to deploy a container as a custom skill, I highly recommend utilizing this more generic cookiecutter repository: https://github.com/microsoft/cookiecutter-spacy-fastapi. We will continue to support this project. For more information on deploying Containers on Azure see:


Introduction

The Skills Extractor is a Named Entity Recognition (NER) model that takes text as input, extracts skill entities from that text, then matches these skills to a knowledge base (in this sample a simple JSON file) containing metadata on each skill. It then returns a flat list of the skills identified.

Definitions

What is a Cognitive Skill?

A Cognitive Skill is a Feature of Azure Search designed to Augment data in a search index.

What is a Skill in terms of the Skills Extractor?

A Skill is a Technical Concept/Tool or a Business related/Personal attribute.

Example skills: Machine Learning, Artificial Intelligence, PyTorch, Business, Advertising

For the current goals of the service, we are focused on technical skills. Technical skills are the abilities and knowledge needed to perform specific tasks. They are practical, and often relate to mechanical, information technology, mathematical, or scientific tasks. The Taxonomies the API pulls from primarily consist of concepts and tools related to technology. For example, Programming Languages are considered a higher-level technical skill, and C# or Python are a sub of that larger skill.

P.S. Sorry for the confusing naming.

Skill Sources

We pull skills and technologies from many open online sources and build Record Linkage models to conflate skills and categories across each source into a single Knowledge Graph.

Here is a list of our sources:

Use Cases

The original idea stemmed from a few organizational needs. Here are a few:

  • Determine the skills required for a job opening at your company and match applicant resumes based on skills.

  • Extract skills from Learning Content that your company creates to improve search and recommendations.

  • Identify the technical and professional skills of your team or organization and work to close skill gaps.

Prerequisites

Before running this sample, you must have the following:

  • Install the Azure CLI. This article requires the Azure CLI version 2.0 or later. Run az --version to find the version you have.
    You can also use the Azure Cloud Shell.

Dependencies

Quickstart: Extract Skills for your data in Azure Search using a Custom Cognitive Skill

If you're unfamiliar with Azure Search Cognitive Skills you can read more about them here: https://docs.microsoft.com/en-us/azure/search/cognitive-search-concept-intro

Follow one of the scenarios below

  1. Extract Skills from an Existing Search Index
  2. Use the sample Search Scenario of extracting Skills from Jobs and Resumes

Create your own Custom Cognitive Skill

If you would like to create your own Custom Skill leveraging the NLP power of the Python Ecosystem you can use this cookiecutter project to bootstrap a containerized API to deploy in your own infrastructure.

https://github.com/Microsoft/cookiecutter-azure-search-cognitive-skill

Feedback & Support

We’re eager to improve, so please take a couple of minutes to answer some questions about your experience https://aka.ms/AA4xoy5. For support, please contact: [email protected]

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.

When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].