Upload a video file and it will be transcribed to text using the Shotstack Ingest API. You can also provide the URL of a video file online. The video transcription process will generate a VTT subtitle file that can be used to display subtitles on the video. In this demo the VTT file is used with the native HTML5 video player to overlay and display the subtitles.
Adding subtitles to your video content will make it more accessible to a wider audience. Subtitles help people who are deaf or have hearing impairments. They are also useful for users who have the volume on their devices muted or disabled by default.
Generative AI and Large Language Models (LLMs) make it possible to take a video transcription and repurpose it for other uses. For example, you can use the transcription to generate a blog post or a social media post. You could generate a summary of the video or suggestions on how the script could be improved.
Whether you want to generate subtitles for a video or use the transcription for other purposes, the Shotstack Ingest API makes it easy to transcribe a video and generate the VTT (or SRT) subtitle file. The API can handle video transcoding, audio
The Shotstack Ingest API, is used to upload, transform and store videos and images and is just one of the API's on offer. One of the Ingest API's features is transcription which is showcased in this open source demo.
Included with the demo source code is the HTML form and jQuery script that captures the form input and sends the data via an AJAX call to a Node.js Express application. The Node.js backend validates the request, formats the API JSON payload correctly, and POSTs it to the Ingest API. The API then handles the video transcription and generates the VTT subtitle file before sending everything back to the web page for display.
The open-source code for this project is available on GitHub. The project might inspire you to create your own subtitles or transcription project or act as the starting point for a new AI video project.