picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (+454.39%)
sepia-stt-serverSEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (-21.05%)
Paper-NotesPaper notes in deep learning/machine learning and computer vision
Stars: ✭ 37 (-35.09%)
RobustnessCorruption and Perturbation Robustness (ICLR 2019)
Stars: ✭ 463 (+712.28%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (+21.05%)
TensorlayerDeep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥
Stars: ✭ 6,796 (+11822.81%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-8.77%)
cepCEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.
Stars: ✭ 140 (+145.61%)
Jetson InferenceHello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
Stars: ✭ 5,191 (+9007.02%)
specAugmentTensor2tensor experiment with SpecAugment
Stars: ✭ 46 (-19.3%)
SymbolSymbol .net library
Stars: ✭ 14 (-75.44%)
Divide And Co Training[Paper 2020] Towards Better Accuracy-efficiency Trade-offs: Divide and Co-training. Plus, an image classification toolbox includes ResNet, Wide-ResNet, ResNeXt, ResNeSt, ResNeXSt, SENet, Shake-Shake, DenseNet, PyramidNet, and EfficientNet.
Stars: ✭ 54 (-5.26%)
py-faster-rcnn-imagenetTrain faster rcnn on imagine dataset, related blog post: https://andrewliao11.github.io/object/detection/2016/07/23/detection/
Stars: ✭ 133 (+133.33%)
BAKESelf-distillation with Batch Knowledge Ensembling Improves ImageNet Classification
Stars: ✭ 79 (+38.6%)
Ssd.pytorchA PyTorch Implementation of Single Shot MultiBox Detector
Stars: ✭ 4,499 (+7792.98%)
torchsubbandPytorch implementation of subband decomposition
Stars: ✭ 63 (+10.53%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-56.14%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+214.04%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+1317.54%)
cozmo-tensorflow🤖 Cozmo the Robot recognizes objects with TensorFlow
Stars: ✭ 61 (+7.02%)
CvTThis is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
Stars: ✭ 262 (+359.65%)
visualsearchVisual Search is a little app to find and cluster similar images using Tagbox
Stars: ✭ 31 (-45.61%)
MsdnetMulti-Scale Dense Networks for Resource Efficient Image Classification (ICLR 2018 Oral)
Stars: ✭ 443 (+677.19%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+259.65%)
TriangleGANTriangleGAN, ACM MM 2019.
Stars: ✭ 28 (-50.88%)
sharpmaskTensorFlow implementation of DeepMask and SharpMask
Stars: ✭ 31 (-45.61%)
Constrained attention filter(ECCV 2020) Tensorflow implementation of A Generic Visualization Approach for Convolutional Neural Networks
Stars: ✭ 36 (-36.84%)
good-speech-web-clientPractice your speech level in any language using speech recognition
Stars: ✭ 26 (-54.39%)
obviA Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (-5.26%)
Class Balanced LossClass-Balanced Loss Based on Effective Number of Samples. CVPR 2019
Stars: ✭ 433 (+659.65%)
TF-Speech-Recognition-Challenge-SolutionSource code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (+1.75%)
aws-rekognitionA Laravel Package/Facade for the AWS Rekognition API
Stars: ✭ 20 (-64.91%)
Pytorch image classificationPyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet
Stars: ✭ 795 (+1294.74%)
object-flaw-detector-cppDetect various irregularities of a product as it moves along a conveyor belt.
Stars: ✭ 19 (-66.67%)
GFNet[NeurIPS 2021] Global Filter Networks for Image Classification
Stars: ✭ 199 (+249.12%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-63.16%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+8571.93%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+47.37%)
Image-Detection-SamplesThis sample app supports "Building a MVP with Face recognition and AR" and "Quest of a Hero part 2" presentations as well as it has two different possibilities to build face detection mechanism. The first one is OpenCV based and the second one is by means of Camera 2 API
Stars: ✭ 36 (-36.84%)
nested-transformerNested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf
Stars: ✭ 174 (+205.26%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-17.54%)
EffcientNetV2EfficientNetV2 implementation using PyTorch
Stars: ✭ 94 (+64.91%)
CCAligner🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Stars: ✭ 131 (+129.82%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+612.28%)
ocr recognitionuse java opencv tesseract ocr image words detects and recognition,use python generate jTessBoxEditor train box file
Stars: ✭ 52 (-8.77%)
concurrent-video-analytic-pipeline-optimization-sample-lCreate a concurrent video analysis pipeline featuring multistream face and human pose detection, vehicle attribute detection, and the ability to encode multiple videos to local storage in a single stream.
Stars: ✭ 39 (-31.58%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-52.63%)
rosechoTianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用
Stars: ✭ 28 (-50.88%)
ImagenetTrial on kaggle imagenet object localization by yolo v3 in google cloud
Stars: ✭ 56 (-1.75%)
Image recognitionPackages for image recognition - Robocup TU/e Robotics
Stars: ✭ 53 (-7.02%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-24.56%)