This is a submission for the AssemblyAI Challenge : Sophisticated Speech-to-Text.
What I Built
I built a Speech-to-Text Application that showcases the power of Universal-2, AssemblyAI’s latest speech-to-text model. The application:
- Supports Multilingual Transcription Users can choose from multiple languages, ensuring global accessibility.
- Outputs with Formatting and Timestamps application delivers well-structured transcripts, complete with proper nouns, punctuation, and timestamps.
- User-Friendly Interface built using Streamlit, the app features an intuitive frontend for easy navigation and interaction.
Demo
Journey
Incorporating Universal-2:
The application utilizes Universal-2 through AssemblyAI’s robust API. The backend:
- Uploads audio files using AssemblyAI's upload endpoint.
- Submits transcription requests, including optional parameters like language_code and punctuate.
- Polls transcription progress until completion and fetches the final transcript with timestamps, and word-by-word breakdown.
Screenshots
Team Submission:
I worked on this project independently-Sunder Kumar
Top comments (0)