I’m trying to make use of the close_caption setting in the Meetings REST API [1]. Since we have an in-house Wowza infrastructure, we’d like to provide Zoom captions during our webcast events when they originate from Zoom meetings or webinars.
To do this, I need to understand which standard is used to embed captions in the stream, and how I should configure my Wowza application to support and display them correctly. Currently, this isn’t working as expected—when using standard HLS players [2], I only see very small caption fragments displayed at the top of the screen.
This is a good question. Zoom automated captions are based on transcription from automated speech recognition. I believe it is relaying what one would receiving in the VTT or TXT files for speakers. However, I believe 708 is being used because you have the ability to customize the captions
in a meeting or webinar.