All Projects → microsoft → Recognizers Text

microsoft / Recognizers Text

Licence: mit
Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, and date/time expressed in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI. Partial support for NL, JA, KO, SV). Contributions are greatly welcome! Packages are available at https://www.nuget.org/profiles/Recognizers.Text and https://www.npmjs.com/~recognizers.text

Projects that are alternatives of or similar to Recognizers Text

extractacy
Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)
Stars: ✭ 47 (-94.86%)
Mutual labels:  ner, entity-extraction
Ner Bert Pytorch
PyTorch solution of named entity recognition task Using Google AI's pre-trained BERT model.
Stars: ✭ 249 (-72.79%)
Mutual labels:  ner, entity-extraction
Nlp.js
An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more
Stars: ✭ 4,670 (+410.38%)
Mutual labels:  hacktoberfest, entity-extraction
Libfaketime
libfaketime modifies the system time for a single application
Stars: ✭ 1,932 (+111.15%)
Mutual labels:  hacktoberfest, datetime
scikitcrf NER
Python library for custom entity recognition using Sklearn CRF
Stars: ✭ 17 (-98.14%)
Mutual labels:  ner, entity-extraction
Date Io
Abstraction over common javascript date management libraries
Stars: ✭ 382 (-58.25%)
Mutual labels:  hacktoberfest, datetime
Entity Recognition Datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Stars: ✭ 891 (-2.62%)
Mutual labels:  ner, entity-extraction
Grafana Sunandmoon Datasource
SunAndMoon is a datasource plugin for Grafana that calculates the position of Sun and Moon as well as the Moon illumination.
Stars: ✭ 20 (-97.81%)
Mutual labels:  hacktoberfest
Event Bot
📣 Discord Bot to make announcements about upcoming sessions for the Fellows using Google Calendar and Calendly
Stars: ✭ 21 (-97.7%)
Mutual labels:  hacktoberfest
Cdc With Pact
Stars: ✭ 20 (-97.81%)
Mutual labels:  hacktoberfest
Node Openid Client
OpenID Certified™ Relying Party (OpenID Connect/OAuth 2.0 Client) implementation for Node.js.
Stars: ✭ 887 (-3.06%)
Mutual labels:  hacktoberfest
Ad Bs Converter
A javascript implementation to convert bikram samvat to anno domini and vice-versa
Stars: ✭ 20 (-97.81%)
Mutual labels:  hacktoberfest
App Cc
Aplicativo móvel para uso de alunos, professores e técnicos do curso de Ciência da Computação da UFFS
Stars: ✭ 21 (-97.7%)
Mutual labels:  hacktoberfest
Wingpanel Indicator Nightlight
A Wingpanel indicator for Night Light
Stars: ✭ 20 (-97.81%)
Mutual labels:  hacktoberfest
React Weather App
⛅️ A weather app built with React.
Stars: ✭ 21 (-97.7%)
Mutual labels:  hacktoberfest
Coursera Financialaid
Stars: ✭ 20 (-97.81%)
Mutual labels:  hacktoberfest
Magma
Platform for building access networks and modular network services
Stars: ✭ 913 (-0.22%)
Mutual labels:  hacktoberfest
Kamekazi
Self destructing messages
Stars: ✭ 21 (-97.7%)
Mutual labels:  hacktoberfest
Lab
Lab wraps Git or Hub, making it simple to clone, fork, and interact with repositories on GitLab
Stars: ✭ 911 (-0.44%)
Mutual labels:  hacktoberfest
Socli
Stack overflow command line client. Search and browse stack overflow without leaving the terminal 💻
Stars: ✭ 911 (-0.44%)
Mutual labels:  hacktoberfest

Microsoft Recognizers Text Overview

Build Status Build Status

Microsoft.Recognizers.Text provides robust recognition and resolution of entities like numbers, units, and date/time; expressed in multiple languages. Full support for Chinese, English, French, Spanish, Portuguese, German, Italian, Turkish, and Hindi. Partial support for Dutch, Japanese, Korean, and Swedish. More on the way.

Utilizing the Project

Microsoft.Recognizers.Text powers pre-built entities in both LUIS: Language Understanding Intelligent Service and Microsoft Bot Framework; base entity types in Text Analytics Cognitive Service; and it is also available as standalone packages (for the base classes and the different entity recognizers).

The Microsoft.Recognizers.Text packages currently target four platforms:

Contributions are greatly welcome! Both for fixes and extensions in the currently supported languages and for expansion to new ones. Especially for Dutch, Japanese, Korean, Hindi, and others! More info below.

Help

If you have any questions, please go ahead and open an issue, even if it's not an actual bug. Issues are an acceptable discussion forum as well.

Contributing

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Good starting points for contribution are:

  • the list of open issues (especially those marked as help wanted);
  • the json spec cases temporarily marked as NotSupported (Specs); and
  • translating json test spec cases that work in English, but don't yet exist in a target language.

The links below describe the project structure and provide both an overview and tips on how to contribute (although some steps may have become a little out-of-date). Thank you!

Supported Entities across Cultures

The table below summarizes the currently supported entities. Support for English is usually more complete than others. The primary platform is .NET (shown in table) and support should propagate to the others.

Entity Type EN ZH-CN NL FR DE IT JA KO PT ES
Number (cardinal)
Ordinal PA/EO
Percentage PA/EO
Number Range PA/EO PA/EO
Unit - Age SO
Unit - Currency SP
Unit - Dimensions SP
Unit - Temperature SP
Choice - Boolean SO
Seq. - E-mail G G* G G G G G* G* G G
Seq. - GUID G G G G G G G G G G
Seq. - Social G G G G G G G G G G
Seq. - IP Address G G G G G G G G G G
Seq. - Phone Number G G G G G G G G G G
Seq. - URL G G* G G G G G* G* G G
DateTime (+subtypes) PA SP SP
Entity Type SV BG TR HI AR
Number (cardinal) PA/EO
Ordinal PA/EO
Percentage PA/EO
Number Range PA/EO
Unit - Age
Unit - Currency
Unit - Dimensions
Unit - Temperature
Choice - Boolean
Seq. - E-mail G G G G G
Seq. - GUID G G G G G
Seq. - Social G G G G G
Seq. - IP Address G G G G G
Seq. - Phone Number
Seq. - URL G G G G* G*
DateTime (+subtypes)
  • G: Generic entity, not language-specific (* unicode TLDs not-supported);
  • EO: Extraction-only (parsing/resolution/normalization pending);
  • PA: Partial support (type not fully supported);
  • SO: Specs-only (test specs coverage OK, but support pending);
  • SP: Partial specs;
  • SI: Very initial specs (typically language support start for a new language).
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].