This is a web application that performs speech recognition on audio files. It allows you to upload an audio file (in .wav, .mp3, or .ogg format) and transcribes the speech content using an AI speech recognition model.
- Accepts .wav, .mp3, and .ogg audio file formats.
- Transcribes the speech content of the uploaded audio file.
- Displays the transcription result in a user-friendly interface.
- Provides a "Copy" button to easily copy the transcription text.
- Python 3.6 or higher
- Flask (install using
pip install flask
) - Whisper (install using
pip install openai-whisper
)
-
Clone the repository:
git clone https://github.com/hakunamatata1997/Speec-to-Text-WebApp.git
-
Navigate to the project directory:
cd speech-recognition-webapp
-
Install the required dependencies:
pip install -r requirements.txt
-
Start the Flask development server:
python app.py
-
Open your web browser and go to
http://localhost:52323
. -
Upload an audio file using the provided form.
-
Click the "Recognize Speech" button to initiate the transcription process.
-
The transcription result will be displayed on the page. You can copy the text by clicking the "Copy" button.
- The Whisper ASR model used in this project is developed by OpenAI. Visit the Whisper GitHub repository for more information.