A text-to-speech program using VAE on Mel spectrograms of phonemes.
-
Updated
Jun 3, 2020 - Python
A text-to-speech program using VAE on Mel spectrograms of phonemes.
Raspberry Pi audio-to-visual pipeline with sentiment analysis
Sound Classification Using CNN
Mini Automatic Speech Recognition
Extraction of text from audio clip using moviepy and speech_recognition library, python
Music genre classification is a machine learning model by which the model can predict music and classify the music based on popular genres like pop,jazz,rock,hip-hop,lofi etc.
BeatFarmer stands out in the realm of audio analysis tools by not only offering deep insights into audio samples but also by being a source of creative inspiration for music producers, hobbyists, and enthusiasts. With BeatFarmer, elevate your music production process and bring organization and innovation to your audio samples.
Audio Classification
UrbanSound8K Audio Classifier: TensorFlow Model
Silly Sequencer assigns a random sample to each note of each channel of a MIDI file and outputs the resulting audio file.
Speaker diarization simulation built with python
8th place solution in 2020 AICup - Music Transcription
An application that collects and preprocesses audio clips of single-word utterances found in WAV files
This project is about a Music genre classification, which classifies the type of genre (Blues, Disco, Rock, etc..) using a CNN.
Classification of digits based on their Audio Inputs.
In this notebook, we are recognizing digits from 0 to 9 based on audio recordings file. Input data will be in the form of speech signal and output will be a single digit.
Cat and Dog audio classification using CNN
An attempt at the speech emotion recognition (SER) task on the CREMA-D dataset using TensorFlow 1D & 2D RCNN models.
Add a description, image, and links to the librosa topic page so that developers can more easily learn about it.
To associate your repository with the librosa topic, visit your repo's landing page and select "manage topics."