Hrnet Semantic SegmentationThe OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
Stars: ✭ 2,369 (+411.66%)
Setr PytorchRethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Stars: ✭ 96 (-79.27%)
HRFormerThis is an official implementation of our NeurIPS 2021 paper "HRFormer: High-Resolution Transformer for Dense Prediction".
Stars: ✭ 357 (-22.89%)
Pvt Stars: ✭ 379 (-18.14%)
Medical TransformerPytorch Code for "Medical Transformer: Gated Axial-Attention for Medical Image Segmentation"
Stars: ✭ 153 (-66.95%)
U-Net-SatelliteRoad Detection from satellite images using U-Net.
Stars: ✭ 38 (-91.79%)
LEDNetThis is an unofficial implemention of LEDNet https://arxiv.org/abs/1905.02423
Stars: ✭ 37 (-92.01%)
enformer-pytorchImplementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch
Stars: ✭ 146 (-68.47%)
MONAILabelMONAI Label is an intelligent open source image labeling and learning tool.
Stars: ✭ 249 (-46.22%)
DSegInvariant Superpixel Features for Object Detection
Stars: ✭ 18 (-96.11%)
UCTransNetImplementation of our AAAI'22 work: 'UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer'.
Stars: ✭ 132 (-71.49%)
FragmentVCAny-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
Stars: ✭ 134 (-71.06%)
php-halHAL+JSON & HAL+XML API transformer outputting valid (PSR-7) API Responses.
Stars: ✭ 30 (-93.52%)
transformer-sltSign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)
Stars: ✭ 92 (-80.13%)
RSTNetRSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words (CVPR 2021)
Stars: ✭ 71 (-84.67%)
CAPECylinder and Plane Extraction from Depth Cameras
Stars: ✭ 107 (-76.89%)
Restormer[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.
Stars: ✭ 586 (+26.57%)
pedxPython tools for working with PedX dataset.
Stars: ✭ 26 (-94.38%)
CarND-Detect-Lane-Lines-And-VehiclesUse segmentation networks to recognize lane lines and vehicles. Infer position and curvature of lane lines relative to self.
Stars: ✭ 66 (-85.75%)
LightNetLightNet: Light-weight Networks for Semantic Image Segmentation (Cityscapes and Mapillary Vistas Dataset)
Stars: ✭ 710 (+53.35%)
DigiPathAIDigital Pathology AI
Stars: ✭ 43 (-90.71%)
EntityEntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation
Stars: ✭ 313 (-32.4%)
Walk-TransformerFrom Random Walks to Transformer for Learning Node Embeddings (ECML-PKDD 2020) (In Pytorch and Tensorflow)
Stars: ✭ 26 (-94.38%)
MinTLMinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Stars: ✭ 61 (-86.83%)
Image-CaptionUsing LSTM or Transformer to solve Image Captioning in Pytorch
Stars: ✭ 36 (-92.22%)
mobilenet segmentationBinary semantic segmentation with UNet based on MobileNetV2 encoder
Stars: ✭ 18 (-96.11%)
volkscvA Python toolbox for computer vision research and project
Stars: ✭ 58 (-87.47%)
TadTREnd-to-end Temporal Action Detection with Transformer. [Under review for a journal publication]
Stars: ✭ 55 (-88.12%)
LaTeX-OCRpix2tex: Using a ViT to convert images of equations into LaTeX code.
Stars: ✭ 1,566 (+238.23%)
golgothaContextualised Embeddings and Language Modelling using BERT and Friends using R
Stars: ✭ 39 (-91.58%)
Dynamic ORB SLAM2Visual SLAM system that can identify and exclude dynamic objects.
Stars: ✭ 89 (-80.78%)
transformerNeutron: A pytorch based implementation of Transformer and its variants.
Stars: ✭ 60 (-87.04%)
Vision-Language-TransformerVision-Language Transformer and Query Generation for Referring Segmentation (ICCV 2021)
Stars: ✭ 127 (-72.57%)
TS-CAMCodes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.
Stars: ✭ 96 (-79.27%)
EmbeddingEmbedding模型代码和学习笔记总结
Stars: ✭ 25 (-94.6%)
text2keywordsTrained T5 and T5-large model for creating keywords from text
Stars: ✭ 53 (-88.55%)
Lyrics-to-Audio-AlignmentAligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structures and then uses hidden markov models to obtain alignment within segments. The final alignment is concatenation of time stamps of lyrics within the segments for each song.
Stars: ✭ 57 (-87.69%)
M3DETRCode base for M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
Stars: ✭ 47 (-89.85%)
OverlapPredator[CVPR 2021, Oral] PREDATOR: Registration of 3D Point Clouds with Low Overlap.
Stars: ✭ 293 (-36.72%)
visualizationa collection of visualization function
Stars: ✭ 189 (-59.18%)
uoaisCodes of paper "Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion Modeling", ICRA 2022
Stars: ✭ 77 (-83.37%)
Learning-Lab-C-LibraryThis library provides a set of basic functions for different type of deep learning (and other) algorithms in C.This deep learning library will be constantly updated
Stars: ✭ 20 (-95.68%)
deformer[ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
Stars: ✭ 111 (-76.03%)
pcanPrototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation, NeurIPS 2021 Spotlight
Stars: ✭ 294 (-36.5%)
torch-points3dPytorch framework for doing deep learning on point clouds.
Stars: ✭ 1,823 (+293.74%)