Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → midas-research → Audino

midas-research / Audino

Licence: mit

Open source audio annotation tool for humans™

Programming Languages

184084 projects - #8 most used programming language

139335 projects - #7 most used programming language

Labels

machine-learning datasets audio-processing speech-processing

Projects that are alternatives of or similar to Audino

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Stars: ✭ 147 (-80.14%)

Mutual labels: speech-processing, audio-processing

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+3.24%)

Mutual labels: speech-processing, audio-processing

ACLEW Diarization Virtual Machine

Stars: ✭ 28 (-96.22%)

Mutual labels: speech-processing, audio-processing

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (-93.65%)

Mutual labels: speech-processing, audio-processing

Novoic's audio feature extraction library

Stars: ✭ 318 (-57.03%)

Mutual labels: speech-processing, audio-processing

Awesome Twitter Data

A list of Twitter datasets and related resources.

Stars: ✭ 533 (-27.97%)

Mutual labels: datasets

Datasets For Recommender Systems

This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)

Stars: ✭ 564 (-23.78%)

Mutual labels: datasets

Implementation of the Wave-U-Net for audio source separation

Stars: ✭ 506 (-31.62%)

Mutual labels: audio-processing

Awesome Dataset Tools

🔧 A curated list of awesome dataset tools

Stars: ✭ 495 (-33.11%)

Mutual labels: datasets

Awesome Transit

Community list of transit APIs, apps, datasets, research, and software 🚌🌟🚋🌟🚂

Stars: ✭ 713 (-3.65%)

Mutual labels: datasets

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Stars: ✭ 7,264 (+881.62%)

Mutual labels: datasets

Soundfingerprinting

Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.

Stars: ✭ 554 (-25.14%)

Mutual labels: audio-processing

An open source multi-tool for exploring and publishing data

Stars: ✭ 5,640 (+662.16%)

Mutual labels: datasets

Audio Visualizer Android

🎵 [Android Library] A light-weight and easy-to-use Audio Visualizer for Android.

Stars: ✭ 581 (-21.49%)

Mutual labels: audio-processing

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (-30.27%)

Mutual labels: speech-processing

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Stars: ✭ 673 (-9.05%)

Mutual labels: speech-processing

🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).

Stars: ✭ 494 (-33.24%)

Mutual labels: datasets

Annotated Semantic Relationships Datasets

A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)

Stars: ✭ 553 (-25.27%)

Mutual labels: datasets

🎸 A maestro of pitch detection.

Stars: ✭ 601 (-18.78%)

Mutual labels: audio-processing

C library for generating audio fingerprints used by AcoustID

Stars: ✭ 553 (-25.27%)

Mutual labels: audio-processing

View All Similar Projects ➔

audino

audino is an open source audio annotation tool. It provides features such as transcription and labeling which enables annotation for Voice Activity Detection (VAD), Diarization, Speaker Identification, Automated Speech Recognition, Emotion Recognition tasks and more.

Features

Current features of the tool include:

Multi-language support
Collaborative annotation
JWT based authentication
User-level project, role and data assignment
Project-level API Key based datapoint creation
Emoji support
Flexibility in label creation

Usage

Note: Please see getting started guide for configurations and concrete usage.

Please install the following dependencies to run audino on your system:

git [tested on v2.23.0]
docker [tested on v19.03.8, build afacb8b]
docker-compose [tested on v1.25.5, build 8a1c60f6]

Clone the repository

$ git clone https://github.com/midas-research/audino.git
$ cd audino

Note for Windows users: Please configure git to handle line endings correctly as services might throw an error and not come up. You can do this by cloning the project this way:

$ git clone https://github.com/midas-research/audino.git --config core.autocrlf=input

For Production

You can either run the project on default configuration or modify them to your need. Note: Before proceeding further, you might need to give docker sudo access or run the commands listed below as sudo.

To build the services, run:

$ docker-compose -f docker-compose.prod.yml build

To bring up the services, run:

$ docker-compose -f docker-compose.prod.yml up

Then, in browser, go to http://0.0.0.0/ to view the application.

To bring down the services, run:

$ docker-compose -f docker-compose.prod.yml down

For Development

Similar to production setup, you need to use development configuration for working on the project, fixing bugs and making contributions. Note: Before proceeding further, you might need to give docker sudo access or run the commands listed below as sudo.

To build the services, run:

$ docker-compose -f docker-compose.dev.yml build

To bring up the services, run:

$ docker-compose -f docker-compose.dev.yml up

Then, in browser, go to http://localhost:3000/ to view the application.

To bring down the services, run:

$ docker-compose -f docker-compose.dev.yml down

Tutorials

We provide a set of tutorials to guide users to achieve certain tasks. If you feel something is missing and should be included, please open an issue.

Citation

Currently, the paper is under review. For now, please cite it as:

@misc{grover2020audino,
    title={audino: A Modern Annotation Tool for Audio and Speech},
    author={Manraj Singh Grover and Pakhi Bamdev and Yaman Kumar and Mika Hama and Rajiv Ratn Shah},
    year={2020},
    eprint={2006.05236},
    archivePrefix={arXiv},
    primaryClass={cs.SD}
}

License

MIT © MIDAS, IIIT Delhi

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 740

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (33) 🔗