Description : 
Our team would like to integrate a solution with zoom, we are considering analyzing video and audio in real time for metrics. In this case, we need access to the microphone buffer and the video buffer.
Specific case  
In the first test, we are based on the Windows system, but the project also applies to Macs and mobile devices. Is there any chance to do one time development with specific api?
Do you have any example for these cases?
Best regards
             
            
               
               
               
            
            
           
          
            
            
              @jesus.londono , if you are trying to analyze the audio and video in real-time from Zoom meetings, there are a few options:
 1. Use the Zoom RTMP live-streaming API 
Pros: 
Doesn’t require any 3rd party services 
Lighter weight than building and running a Zoom bot 
 
Cons: 
Needs to initiated on a per-meeting basis 
You need to set up an RTMP server to receive the data, which requires engineering effort to deploy, scale, and monitor 
Participants can get spooked by the “live” badge that appears in the meeting (even if it’s a privte meeting) 
You don’t get separate audio streams or video streams, you get one mixed stream. 
 
 2. Build a desktop app to capture Zoom audio and video 
Pros: 
One of the most cost effective solutions 
 
Cons: 
You need to build a separate app for Windows, Mac and Linux 
It is especially difficult to tap into computer audio on Mac 
App runs on users’ computer so it can slow their computer down/make computer fans go off 
You don’t get separate audio streams or video streams, you get one mixed stream. 
 
 3. Build a Zoom bot with the Meeting SDK 
Pros: 
You get the separate audio streams and video streams in real-time per participant. This is great for analysis. 
 
Cons: 
It is very heavy-weight as you would need to spin up multiple servers to run the Zoom client for the bot 
Running infrastructure for Zoom bot costs more than live streaming. 
You need to encode the raw video and audio yourself 
 
 4. Use Recall.ai 
It’s a unified API that lets you send meeting bots to video conferencing platforms to capture the audio, 
video and transcription in real-time.
Pros: 
Handles spinning up the servers, and providing the real-time raw audio/transcript so all you interact with is a simple API. 
You get the separate audio streams and video streams in real-time per participant. This is great for analysis. 
Works on any Zoom plan (including Free) 
Works on all video conferencing platforms. 
 
Cons: 
It’s another 3rd party service in your stack 
 
Let me know if you have any questions!
             
            
               
               
              2 Likes 
            
            
           
          
            
              
                system  
                (system)
                  Closed 
               
              
                  
                    December 2, 2022,  2:22am
                   
                   
              3 
               
             
            
              This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.