Raw media streams via Meeting SDK for web

Is it possible to access raw media streams via the Zoom Meeting SDK for web?

The documentation about accessing media streams, states that a bot “initiates raw recording using StartRawRecording or StartLiveStreamWithSteamingURL (Meeting SDK for Windows), or mediaCapture (Meeting SDK for web)”.
Then it states “The bot obtains raw audio and/or video data and does the required processing.”

This is confusing, as it indicates one can obtain raw audio using the Meeting SDK for web but it links to instructions oriented towards Windows.

@gibron ,

There is no build in functionality in Zoom Meeting SDK for Web to access raw media stream.

Initial raw recording is a step which is done prior to actual recording, this would enable the prompt “this meeting is being recorded” to users.

The mediacapture in Meeting SDK for web is to enable the prompt. This mediacapture is to ensure developers who are using other ways of recording using Meeting SDK for web, can complies with privacy regulations.

@chunsiong.zoom If the Zoom Meeting SDK for Web does not currently support accessing media streams, then the documentation page is written in an extremely misleading way: Meeting Bots Accessing Media Streams.

It is titled “Accessing Media Streams” and is filled with examples and implications about the Zoom Meeting SDK for Web.

I cannot imagine we are the first team to arrive that this should be possible based on the docs.

4 Likes

Hey @gibron , although the Zoom Meeting SDK doesn’t support accessing media streams, you can use the Zoom Windows SDK or the very newly released Zoom Linux SDK :slight_smile:

If you are using the Zoom Windows or Linux SDK, here are the steps on how to build a meeting bot:

  1. Spin up a server. We recommend AWS, GCP, or Digital Ocean.
  2. Use either the Windows or Linux Zoom SDK to launch an instance of the Zoom client.
  3. Once you have the Zoom SDK launched, and use the Raw Data functionality to extract the video and audio streams.
  4. This will return the video in I420 raw frames and audio in PCM 16LE raw format, so you’ll need to encode the audio and video yourself afterwards.
  5. Once you have one instance of this working, you’ll need to scale this across several servers if you want to run multiple bots simultaneously, which is required to have bots for multiple meetings.

Finally, another option is Recall.ai. It’s a simple 3rd party API that lets you use meeting bots to get raw audio/video from meetings without you needing to spend months to build, scale and maintain these bots.

Let me know if you have any questions!

Thanks Amanda. We ended up rolling our own windows server to test out the Windows SDK.

Did you (or anyone else reading this reply) ever overcome the obstacles identified here: Not receiving audio on `AudioRawData` interface if the SDK doesn't detect a microphone or an audio device on the OS?

Thanks for tip re: Linux SDK, will have to look at that as well; I see it is in beta now.

can you provide a (full working) example and/or elaborate on what “… and use the Raw Data functionality” means & entails, exactly? THANK YOU!!!

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.