Need advice on approach for bidirectional AI streaming app

I’m building a bidirectional, conversational, voice ai app that will participate in Zoom meetings. I tried the web meeting SDK and my bot is joining the meeting. However, it seems that I need to use the Video SDK to access the video and audio live stream. I need the raw pcm audio stream that will be sent to my GenAI backend.

The documentation on the live stream use case isn’t great and bidirectional is almost non existent. I’m not 100% sure that I can achieve a working demo of this app using the web approach versus building a native app. Please can someone advise on which SDK I can use to achieve this use case in the simplest way. Any samples or demoes would be appreciated.

@Steve4m , if you are building something which needs to be bidirectional, you will probably need to use Meeting SDK.

In this case, you will either need to use Windows Meeting SDK or Linux Meeting SDK to get raw PCM audio, pass it to your backend service, generate a PCM audio and use raw data to send raw audio back to the meeting.

thanks for confirming. Does that mean that there isn’t a MacOS SDK that can handle this use case?

@Steve4m macOS SDK would work as well. From a hosting and deployment point of view, windows and Linux are the more common server OS