Need advice on approach for bidirectional AI streaming app

Steve4m · April 14, 2025, 8:41pm

I’m building a bidirectional, conversational, voice ai app that will participate in Zoom meetings. I tried the web meeting SDK and my bot is joining the meeting. However, it seems that I need to use the Video SDK to access the video and audio live stream. I need the raw pcm audio stream that will be sent to my GenAI backend.

The documentation on the live stream use case isn’t great and bidirectional is almost non existent. I’m not 100% sure that I can achieve a working demo of this app using the web approach versus building a native app. Please can someone advise on which SDK I can use to achieve this use case in the simplest way. Any samples or demoes would be appreciated.

chunsiong.zoom · April 16, 2025, 7:51am

@Steve4m , if you are building something which needs to be bidirectional, you will probably need to use Meeting SDK.

In this case, you will either need to use Windows Meeting SDK or Linux Meeting SDK to get raw PCM audio, pass it to your backend service, generate a PCM audio and use raw data to send raw audio back to the meeting.

Steve4m · April 16, 2025, 9:01am

thanks for confirming. Does that mean that there isn’t a MacOS SDK that can handle this use case?

chunsiong.zoom · April 16, 2025, 11:40am

@Steve4m macOS SDK would work as well. From a hosting and deployment point of view, windows and Linux are the more common server OS

Topic		Replies	Views
We need to get the audio stream from a meeting, can we get it using the Meeting SDK? Or can we only get it using the Video SDK? Video SDK api	2	858	February 25, 2023
RoadMap for access to Audio and Video raw content in meetng Zoom Apps	7	709	January 31, 2023
Queries related to Custom Zoom Bot Web video-sdk	3	266	July 19, 2023
Making live translation app Meeting SDK	2	269	May 8, 2024
Zoom Bot Streaming Video Meeting SDK video-sdk	12	1519	February 16, 2024

Need advice on approach for bidirectional AI streaming app

Related topics