Zoom API for Seamless Voice Interaction

kais.lamine.work · July 17, 2023, 2:27pm

Hello everyone,

I am excited to share my project involving a voice bot designed to conduct job interviews on Visio conference platforms like Zoom. The voice bot has been trained to simulate real recruiter-led job interviews, where it asks questions and evaluates candidate responses to determine the next steps of the interview process.

To achieve this, I need the platform to provide APIs that allow me to manage the flow of the interview seamlessly. Here are the key functionalities I am seeking through APIs:

1-Joining a Zoom meeting using the interviewer’s host account.
2-Capturing the candidate’s audio responses and feeding them back to the voice bot.
3-Sending voice responses from the voice bot to Zoom, allowing the interviewer to deliver them.

While researching similar topics, I found some discussions dating back to 2020. However, I believe Zoom’s APIs have undergone significant development since then. As someone new to setting up APIs, I would greatly appreciate your input on this matter. Specifically, I would like to know which APIs are most suitable for achieving these functionalities and whether it is possible to accomplish them using Zoom’s APIs.

I am grateful for the excellent developer documentation provided and look forward to your valuable insights.

Best regards,
Kais

gianni.zoom · July 24, 2023, 7:05pm

Hi @kais.lamine.work ,

Welcome to the Zoom Developer Platform!

Here’s some guidance on using bots with Zoom: Meeting Bots Accessing Media Streams

We will also being sharing a blog post soon on using bots with real time audio which can definitely help on your journey. I’ll be sure to share when it becomes available.

kais.lamine.work · July 25, 2023, 10:57am

Hi @gianni.zoom ,
Thank you for your reply!
Will be waiting for the blog post, it could help me out a lot

amanda-recallai · August 8, 2023, 1:17am

@kais.lamine.work, you *can do this via a meeting bot.

Happy to break it down how to do it. To build a meeting bot:

Spin up a server. We recommend AWS, GCP, or Digital Ocean.
Use either the Windows or Mac Zoom SDK to launch an instance of the Zoom client.
Once you have the Zoom SDK launched, and use the Raw Data functionality to extract the video and audio streams.
This will return the video in I420 raw frames and audio in PCM 16LE raw format, so you’ll need to encode the audio and video yourself afterwards.
Run the audio through a transcription provider like AWS Transcribe.
Output audio from the bot to respond.
Once you have one instance of this working, you’ll need to scale this across several servers if you want to run multiple bots simultaneously, which is required to have bots for multiple meetings.

Another option is Recall.ai. It’s a simple 3rd party API that lets you use meeting bots to get the raw audio/video from meetings + output video/audio without you needing to spend months to build, scale and maintain these bots.

Let me know if you have any questions!

gianni.zoom · August 8, 2023, 4:30pm

Hi @kais.lamine.work ,

While not exactly what you’re trying to do, could these blog posts be helpful?

Topic		Replies	Views
Zoom APIs for Seamless Voice Interaction Video SDK recording , api	0	276	July 7, 2023
Is it possible to use the Zoom Companion APIs? API and Webhooks api	1	665	January 22, 2024
How to create zoom bot which can join and record meeting using official zoom API or SDK way? API and Webhooks api	7	4008	April 21, 2024
Total newbie question on zoom API API and Webhooks	2	528	February 3, 2022
Join a bot to a meeting App Marketplace	2	5556	August 21, 2020

Zoom API for Seamless Voice Interaction

Related Topics