hamos ASR Web Application

🗣️ Automatic Speech Recognition using Whisper by OpenAI ✨

Description

This web application utilizes the Whisper ASR system developed by OpenAI to perform automatic speech recognition. It allows users to upload audio files in various formats and generates transcriptions using the selected Whisper model.

Features

Supports popular audio formats including WAV, MP3, OGG, WMA, AAC, FLAC, MP4, and FLV.
Conversion of uploaded audio files to MP3 format for compatibility.
Selection of different Whisper model types (Tiny, Base, Small, Medium, Large).
Real-time transcript generation.
Downloadable transcripts as text files.

Installation

Clone the repository:

git clone https://github.com/your-username/whisper-asr-webapp.git

pip install -r requirements.txt

Usage

1.Run the application:

streamlit run app.py

2.Access the web application in your browser at http://localhost:8501.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
transcripts		transcripts
uploads		uploads
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
demo.gif		demo.gif
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hamos ASR Web Application

Description

Features

Installation

Usage

About

Releases

Packages

Languages

License

abdelkareemkobo/hamos

Folders and files

Latest commit

History

Repository files navigation

hamos ASR Web Application

Description

Features

Installation

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages