Zoom audio stream access

Hello, I am trying to build a feature, where I need to access the speakers audio stream, use that and process it to text. I have gone through many posts saying to use RTMP server, creating a saperate desktop apps for capturing the audio, and using Meeting Bots, but none of them seems to be viable to use for creating a light weight app. Is there any other way for using the access for the audio stream directly? And also any possible suggestions for this case?

@viwinkumar.p20 , unfortunately the options are only RTMP live streaming, desktop app, or meeting bots.

The lightest weight method would be to use the Recall.ai API for meeting bots – it’s literally just one API call in your app.