All Projects → team-re-verb → Re Verb

team-re-verb / Re Verb

Licence: mit
speaker diarization system using an LSTM

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Re Verb

Photonix
This is a new web-based photo management application. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, location awareness, color analysis and other ML algorithms.
Stars: ✭ 592 (+2092.59%)
Mutual labels:  ai, ml
Hyperparameter hunter
Easy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Stars: ✭ 648 (+2300%)
Mutual labels:  ai, ml
Xai
XAI - An eXplainability toolbox for machine learning
Stars: ✭ 596 (+2107.41%)
Mutual labels:  ai, ml
Fastocloud
Self-hosted IPTV/NVR/CCTV/Video service (Community version)
Stars: ✭ 464 (+1618.52%)
Mutual labels:  ai, ml
Awesome Ai Ml Dl
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Stars: ✭ 831 (+2977.78%)
Mutual labels:  ai, ml
Gofamily
🔥 大厂 BAT 面试高频知识点,后端技术体系。包含了 C GO Python, 网络,Redis ,MySQL ,消息队列 ,高并发,微服务,缓存,操作系统,算法,LeetCode 刷题等知识
Stars: ✭ 474 (+1655.56%)
Mutual labels:  ai, redis
Ffdl
Fabric for Deep Learning (FfDL, pronounced fiddle) is a Deep Learning Platform offering TensorFlow, Caffe, PyTorch etc. as a Service on Kubernetes
Stars: ✭ 640 (+2270.37%)
Mutual labels:  ai, ml
Differentiable Plasticity
Implementations of the algorithms described in Differentiable plasticity: training plastic networks with gradient descent, a research paper from Uber AI Labs.
Stars: ✭ 371 (+1274.07%)
Mutual labels:  ai, ml
Aoe
AoE (AI on Edge,终端智能,边缘计算) 是一个终端侧AI集成运行时环境 (IRE),帮助开发者提升效率。
Stars: ✭ 759 (+2711.11%)
Mutual labels:  ai, ml
Flutter Ai Rubik Cube Solver
Flutter-Python rubiks cube solver.
Stars: ✭ 744 (+2655.56%)
Mutual labels:  ai, ml
Awesome Mlops
A curated list of references for MLOps
Stars: ✭ 7,119 (+26266.67%)
Mutual labels:  ai, ml
M Mall Admin
🐶 微信小程序-小商城后台(基于 Node.js、MongoDB、Redis 开发的系统...)
Stars: ✭ 895 (+3214.81%)
Mutual labels:  redis, express
Metaflow
🚀 Build and manage real-life data science projects with ease!
Stars: ✭ 5,108 (+18818.52%)
Mutual labels:  ai, ml
Ai Economist
Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).
Stars: ✭ 537 (+1888.89%)
Mutual labels:  ai, ml
Kglib
Grakn Knowledge Graph Library (ML R&D)
Stars: ✭ 405 (+1400%)
Mutual labels:  ai, ml
Node Express Mongodb Jwt Rest Api Skeleton
This is a basic API REST skeleton written on JavaScript using async/await. Great for building a starter web API for your front-end (Android, iOS, Vue, react, angular, or anything that can consume an API). Demo of frontend in VueJS here: https://github.com/davellanedam/vue-skeleton-mvp
Stars: ✭ 603 (+2133.33%)
Mutual labels:  redis, express
Csinva.github.io
Slides, paper notes, class notes, blog posts, and research on ML 📉, statistics 📊, and AI 🤖.
Stars: ✭ 342 (+1166.67%)
Mutual labels:  ai, ml
Node Tutorial
☺️Some of the node tutorial -《Node学习笔记》
Stars: ✭ 364 (+1248.15%)
Mutual labels:  redis, express
Windows Machine Learning
Samples and Tools for Windows ML.
Stars: ✭ 663 (+2355.56%)
Mutual labels:  ai, ml
Wheels
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+3200%)
Mutual labels:  ai, ml

Logo

Logo


About the project

RE: VERB is speaker diarization system, it allows the user to send/record audio of a conversation and receive timestamps of who spoke when

RE:VERB is our final project in Magshimim, and consists of a web client and a server.

  • The client can record audio and show the the timestamp results graphically

  • The server can be used with many other clients with the simple REST API it has.

Built With

client

server

  • Pytorch - library for deep learning with python that has great support for GPUs with CUDA

  • Express.js - Node.js web server framework

Getting Started

The project contains the server and the web client(a CLI client also exists for debug purposes).

the server is located at ./server and the web client is located at ./client/website.


Server

The model alongside the scripts for downloading, training and the weights from our training is located at ./server/speech_diarization/model

we used Docker to create a cross-platform environment to run the server on.

The server is made up of:

  • a container for the web server
  • a container for the diarization process
  • a container for a redis database that will allow the others to communicate

docker compose will run and manage all 3 at once

Docker and docker-compose need to be installed in order to build and run the server, all the rest will be taken care of.

Installing

cd server
docker-compose up

This will run all 3 containers and install dependencies.

If you make a change in the server, use

docker-compose up --build

to rebuild.

usage:

sending a HTTP POST request with an audio file to the server at http://localhost:1337/upload (default port and url) will return a JSON file with the timestamps in milliseconds.

{"0": [[40, 120], [3060, 3460], [3480, 3560]], "1": [[1260, 1660], [1680, 1960]]}

Client

The client needs npm or yarn to be installed, more info about the client can be found here.

to install:

cd client/website
npm install

afterwards you can use

npm run serve

to run a development server


Authors

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Acknowledgments

  • The diarization algorithm is an implementation of this research, we also used their implementation of the spectral clustering

  • We took inspiration and some code from Harry volek's implementation of a different but similar problem - Speaker Verification

Future Plans

  • We had problems with training on the AMI corpus so we used the TIMIT corpus for the model provided.

  • We plan to train again on the VoxCeleb 1 and 2 datasets which contain a lot more data and hopefully improve feature extraction

  • We want to add integration with a speech-to-text service and transcribe the created segments

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].