Hi,
I am writing to seek guidance on implementing programmatic audio injection using the Zoom Meeting SDK for a legitimate business application.
PROJECT CONTEXT:
We are developing an application that needs to programmatically inject audio content (audio files, text-to-speech, generated tones) into Zoom meetings for participants to hear. This is for a legitimate business use case involving automated announcements and audio content delivery during meetings.
TECHNICAL ENVIRONMENT:
-
Platform: Web-based application (JavaScript)
-
SDK: Zoom Meeting SDK v4.0.0 (Web)
-
Authentication: Meeting SDK auth endpoint (Node.js backend)
-
Meeting Type: Regular Zoom meetings with SDK app credentials
METHODS ATTEMPTED:
- Web Audio API with MediaStream manipulation
- Attempted to replace microphone stream with combined audio
- Result: Blocked by Zoom SDK security restrictions
- Virtual Audio Microphone approach
- Researched ZoomVideoSDKVirtualAudioMic functionality
- Issue: Not available in Meeting SDK, requires Video SDK
- Screen sharing with computer audio
- Attempted to use screen sharing to transmit audio
- Result: Limited functionality, requires manual user interaction
- Direct audio stream injection
- Tried various Web Audio API routing methods
- Result: Zoom SDK prevents direct stream manipulation
SPECIFIC QUESTIONS:
-
Virtual Audio Mic Support: Does the current Zoom Meeting SDK (Web) support virtual audio microphone functionality similar to ZoomVideoSDKVirtualAudioMic mentioned in Video SDK documentation?
-
Official Audio Injection Method: What is the recommended/official way to programmatically inject audio content into meetings using the Meeting SDK?
-
SDK Upgrade Path: Should we migrate to the Zoom Video SDK for proper audio injection support? If so, what are the differences in implementation?
-
API Availability: Are there any undocumented APIs or beta features for audio injection that we could access?
-
Alternative Approaches: What methods do you recommend for legitimate audio injection use cases?
BUSINESS REQUIREMENT:
We need to be able to:
-
Play audio files during meetings
-
Generate text-to-speech announcements
-
Create audio notifications/alerts
-
Stream audio content to meeting participants
TECHNICAL SPECIFICATIONS NEEDED: -
Audio format requirements (sample rate, bit depth, channels)
-
Chunk size and timing specifications
-
Any SDK configuration requirements
-
Platform limitations or requirements
CURRENT CONFIGURATION:
// Current SDK initialization
var authEndpoint = “http://localhost:4000”;
var meetingNumber = “85613539124”;
var role = 0; // Participant
var userName = “Arun”;
var userEmail = “arun@asdasda*.**”;
ZoomMtg.init({
leaveUrl: leaveUrl,
patchJsMedia: true,
leaveOnPageUnload: true,
// … rest of configuration
});
REQUEST:
Could you please provide:
Official documentation or examples for audio injection
Code samples demonstrating proper implementation
Clarification on SDK capabilities and limitations
Guidance on whether Meeting SDK or Video SDK is appropriate for our use case
Any licensing or approval requirements for audio injection features
We are committed to implementing this properly according to Zoom’s guidelines and security requirements. Any guidance you can provide would be greatly appreciated.
Thank you for your time and assistance.