Zoom Audio bots

I’m making an audio bot for zoom, I have a backend that generates a response based on the received audio stream. I’m trying to figure out how to do this in the meeting sdk, but I haven’t found an easy way to add a bot to an existing meeting and get an audio stream from it, without having to run app for Windows or mac. Any ideas?

@anmalygin it’s true there’s no easy way to add a bot to a Zoom meeting.

Happy to break it down for you if you wanted to do it through. To build a meeting bot:

  1. Spin up a server. We recommend AWS, GCP, or Digital Ocean.
  2. Use either the Windows or Mac Zoom SDK to launch an instance of the Zoom client.
  3. Once you have the Zoom SDK launched, and use the Raw Data functionality to extract the video and audio streams.
  4. This will return the video in I420 raw frames and audio in PCM 16LE raw format, so you’ll need to encode the audio and video yourself afterwards.
  5. Once you have one instance of this working, you’ll need to scale this across several servers if you want to run multiple bots simultaneously, which is required to have bots for multiple meetings.

Another option is Recall.ai. It’s a simple 3rd party API that lets you use meeting bots to get the raw audio/video from meetings + output video/audio without you needing to spend months to build, scale and maintain these bots.

Let me know if you have any questions!

1 Like

Thanks!
We tried the first option and it is really not easy :slight_smile:

We will try recall.ai, my boss has scheduled a meeting with you today

вт, 20 сент. 2022 г. в 06:19, Amanda via Zoom Developer Forum <notifications@zoomdeveloper.discoursemail.com>: