All Projects → UglyToad → PragmaticSegmenterNet

UglyToad / PragmaticSegmenterNet

Licence: other
Port of PragmaticSegmenter for sentence boundary detection

Programming Languages

C#
18002 projects
HTML
75241 projects

Projects that are alternatives of or similar to PragmaticSegmenterNet

ImcSegmentationPipeline
A pixel classification based multiplexed image segmentation pipeline
Stars: ✭ 62 (+148%)
Mutual labels:  segmentation
cellpose-napari
napari plugin for cellpose (see www.cellpose.org) - an anatomical segmentation tool
Stars: ✭ 30 (+20%)
Mutual labels:  segmentation
axondeepseg
Axon/Myelin segmentation using Deep Learning
Stars: ✭ 102 (+308%)
Mutual labels:  segmentation
Brain-Segmentation
Brain Segmentation on MRBrains18
Stars: ✭ 37 (+48%)
Mutual labels:  segmentation
mutex-watershed
The mutex watershed for image segmentation.
Stars: ✭ 54 (+116%)
Mutual labels:  segmentation
deepseg
Chinese word segmentation in tensorflow 2.x
Stars: ✭ 23 (-8%)
Mutual labels:  segmentation
airs
Road Segmentation in Satellite Aerial Images
Stars: ✭ 51 (+104%)
Mutual labels:  segmentation
keras-semantic-segmentation-example
Example of semantic segmentation in Keras
Stars: ✭ 53 (+112%)
Mutual labels:  segmentation
Skin Lesion Detection Deep Learning
Skin lesion detection from dermoscopic images using Convolutional Neural Networks
Stars: ✭ 48 (+92%)
Mutual labels:  segmentation
navis
Python 3 library for analysis of neuroanatomical data
Stars: ✭ 68 (+172%)
Mutual labels:  segmentation
root painter
RootPainter: Deep Learning Segmentation of Biological Images with Corrective Annotation
Stars: ✭ 28 (+12%)
Mutual labels:  segmentation
maskSLIC
Simple linear iterative clustering (SLIC) in a region of interest (ROI)
Stars: ✭ 28 (+12%)
Mutual labels:  segmentation
segmentation-enhanced-resunet
Urban building extraction in Daejeon region using Modified Residual U-Net (Modified ResUnet) and applying post-processing.
Stars: ✭ 34 (+36%)
Mutual labels:  segmentation
UNI-EM
A unified environment for DNN-based automated segmentation of neuronal EM images
Stars: ✭ 33 (+32%)
Mutual labels:  segmentation
wink-nlp
Developer friendly Natural Language Processing ✨
Stars: ✭ 312 (+1148%)
Mutual labels:  sentence-boundary-detection
probabilistic nlg
Tensorflow Implementation of Stochastic Wasserstein Autoencoder for Probabilistic Sentence Generation (NAACL 2019).
Stars: ✭ 28 (+12%)
Mutual labels:  sentence
crowd density segmentation
The code for preparing the training data for crowd counting / segmentation algorithm.
Stars: ✭ 21 (-16%)
Mutual labels:  segmentation
image segmentation dl
🍞 基于深度学习方法的图像分割(含语义分割、实例分割、全景分割)。
Stars: ✭ 76 (+204%)
Mutual labels:  segmentation
Active-Contour-Model-Matlab
Some matlab code of Active Contour Model for image segmentation
Stars: ✭ 44 (+76%)
Mutual labels:  segmentation
Brainy
Brainy is a virtual MRI analyzer. Just upload the MRI scan file and get 3 different classes of tumors detected and segmented. In Beta.
Stars: ✭ 29 (+16%)
Mutual labels:  segmentation

PragmaticSegmenterNet

Build status

This project is a direct port of Pragmatic Segmenter which provides rule-based sentence boundary detection.

Usage

The Segmenter class provides the Segment method which in the simplest usage takes a string:

using PragmaticSegmenterNet;

IReadOnlyList<string> result = Segmenter.Segment("One Sentence. And another sentence.");

// ["One Sentence.", "And another sentence."]

IReadOnlyList<string> result2 = Segmenter.Segment("Anything.", Language.Italian);

// ["Anything"]

The Segment method has a number of optional parameters:

IReadOnlyList<string> Segment(string text, Language language = Language.English, bool cleanText = true, DocumentType documentType = DocumentType.Any)
  • Language - An enum representing the supported languages, the default is English, see the supported languages list below for the list of currently supported languages.
  • CleanText - A boolean indicating whether the input text should be cleaned prior to segmentation. Cleaning removes extra newlines and whitespace. Defaults to true.
  • DocumentType - Used by the text cleaning process to determine which reformatting to apply. For PDFs this handles newlines in the middle of a sentence whereas for HTML documents this will handle HMTL tags. Defaults to any which does not apply any special formatting.

Languages

  • English = 0 (default)
  • Amharic = 1
  • Arabic = 2
  • Armenian = 3
  • Bulgarian = 4
  • Burmese = 5
  • Chinese = 6
  • Danish = 7
  • Dutch = 8
  • French = 9
  • German = 10
  • Greek = 11
  • Hindi = 12
  • Italian = 13
  • Japanese = 14
  • Kazakh = 15 (partial support, potentially only for the Cyrillic form of the alphabet)
  • Persian = 16
  • Polish = 17
  • Russian = 18
  • Spanish = 19
  • Urdu = 20

Credit

This project wouldn't be possible without the work done by Pragmatic Segmenter team.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].