All Projects → midas-research → Audino

midas-research / Audino

Licence: mit
Open source audio annotation tool for humans™

Programming Languages

javascript
184084 projects - #8 most used programming language
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Audino

Dtln
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Stars: ✭ 147 (-80.14%)
Mutual labels:  speech-processing, audio-processing
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+3.24%)
Mutual labels:  speech-processing, audio-processing
DiViMe
ACLEW Diarization Virtual Machine
Stars: ✭ 28 (-96.22%)
Mutual labels:  speech-processing, audio-processing
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-93.65%)
Mutual labels:  speech-processing, audio-processing
Surfboard
Novoic's audio feature extraction library
Stars: ✭ 318 (-57.03%)
Mutual labels:  speech-processing, audio-processing
Awesome Twitter Data
A list of Twitter datasets and related resources.
Stars: ✭ 533 (-27.97%)
Mutual labels:  datasets
Datasets For Recommender Systems
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
Stars: ✭ 564 (-23.78%)
Mutual labels:  datasets
Wave U Net
Implementation of the Wave-U-Net for audio source separation
Stars: ✭ 506 (-31.62%)
Mutual labels:  audio-processing
Awesome Dataset Tools
🔧 A curated list of awesome dataset tools
Stars: ✭ 495 (-33.11%)
Mutual labels:  datasets
Awesome Transit
Community list of transit APIs, apps, datasets, research, and software 🚌🌟🚋🌟🚂
Stars: ✭ 713 (-3.65%)
Mutual labels:  datasets
Label Studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+881.62%)
Mutual labels:  datasets
Soundfingerprinting
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
Stars: ✭ 554 (-25.14%)
Mutual labels:  audio-processing
Datasette
An open source multi-tool for exploring and publishing data
Stars: ✭ 5,640 (+662.16%)
Mutual labels:  datasets
Audio Visualizer Android
🎵 [Android Library] A light-weight and easy-to-use Audio Visualizer for Android.
Stars: ✭ 581 (-21.49%)
Mutual labels:  audio-processing
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (-30.27%)
Mutual labels:  speech-processing
Awesome Diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (-9.05%)
Mutual labels:  speech-processing
Voice datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (-33.24%)
Mutual labels:  datasets
Annotated Semantic Relationships Datasets
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
Stars: ✭ 553 (-25.27%)
Mutual labels:  datasets
Beethoven
🎸 A maestro of pitch detection.
Stars: ✭ 601 (-18.78%)
Mutual labels:  audio-processing
Chromaprint
C library for generating audio fingerprints used by AcoustID
Stars: ✭ 553 (-25.27%)
Mutual labels:  audio-processing

audino

GitHub license PRs Welcome

audino is an open source audio annotation tool. It provides features such as transcription and labeling which enables annotation for Voice Activity Detection (VAD), Diarization, Speaker Identification, Automated Speech Recognition, Emotion Recognition tasks and more.

Features

Current features of the tool include:

  1. Multi-language support
  2. Collaborative annotation
  3. JWT based authentication
  4. User-level project, role and data assignment
  5. Project-level API Key based datapoint creation
  6. Emoji support
  7. Flexibility in label creation

Usage

Note: Please see getting started guide for configurations and concrete usage.

Please install the following dependencies to run audino on your system:

  1. git [tested on v2.23.0]
  2. docker [tested on v19.03.8, build afacb8b]
  3. docker-compose [tested on v1.25.5, build 8a1c60f6]

Clone the repository

$ git clone https://github.com/midas-research/audino.git
$ cd audino

Note for Windows users: Please configure git to handle line endings correctly as services might throw an error and not come up. You can do this by cloning the project this way:

$ git clone https://github.com/midas-research/audino.git --config core.autocrlf=input

For Production

You can either run the project on default configuration or modify them to your need. Note: Before proceeding further, you might need to give docker sudo access or run the commands listed below as sudo.

To build the services, run:

$ docker-compose -f docker-compose.prod.yml build

To bring up the services, run:

$ docker-compose -f docker-compose.prod.yml up

Then, in browser, go to http://0.0.0.0/ to view the application.

To bring down the services, run:

$ docker-compose -f docker-compose.prod.yml down

For Development

Similar to production setup, you need to use development configuration for working on the project, fixing bugs and making contributions. Note: Before proceeding further, you might need to give docker sudo access or run the commands listed below as sudo.

To build the services, run:

$ docker-compose -f docker-compose.dev.yml build

To bring up the services, run:

$ docker-compose -f docker-compose.dev.yml up

Then, in browser, go to http://localhost:3000/ to view the application.

To bring down the services, run:

$ docker-compose -f docker-compose.dev.yml down

Tutorials

We provide a set of tutorials to guide users to achieve certain tasks. If you feel something is missing and should be included, please open an issue.

Citation

Currently, the paper is under review. For now, please cite it as:

@misc{grover2020audino,
    title={audino: A Modern Annotation Tool for Audio and Speech},
    author={Manraj Singh Grover and Pakhi Bamdev and Yaman Kumar and Mika Hama and Rajiv Ratn Shah},
    year={2020},
    eprint={2006.05236},
    archivePrefix={arXiv},
    primaryClass={cs.SD}
}

License

MIT © MIDAS, IIIT Delhi

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].