Can a Zoom App record the user’s own voice (not the meeting) using Zoom App SDK?

Hi everyone,

I’m building a Zoom App (web-based) and I want to clarify whether a specific use case is supported by the Zoom App SDK.

I already have a web app where users can record their own voice messages (for example, short voice reminders or prompts). I’m exploring whether this same functionality can be implemented inside a Zoom App.

To be very clear:

  • :cross_mark: I am not trying to record the Zoom meeting audio

  • :cross_mark: I am not trying to record other participants

  • :white_check_mark: I only want to allow the current user to record their own microphone input, similar to a normal web voice recorder

  • The recorded audio would be used later as a personal voice reminder, not stored as a meeting recording

My questions:

  1. Does the Zoom App SDK allow access to the user’s microphone for custom audio recording (outside of meeting recording)?

  2. Can standard browser APIs like getUserMedia() be used inside a Zoom App for this purpose?

  3. Are there any permission or policy limitations I should be aware of for this use case?

Any guidance or references to relevant documentation would be greatly appreciated.
Thanks!

No - today a Zoom App can’t capture the user’s mic for arbitrary “voice memo” recording. Zoom has confirmed the Zoom client proactively blocks navigator.mediaDevices.getUserMedia() in Zoom Apps and that Zoom Apps don’t currently provide access to media devices.

Because of that, standard browser recording flows that rely on getUserMedia() won’t work inside the in-client embedded browser. The practical workaround is to open your existing recorder page in the system browser via zoomSdk.openUrl (note the domain whitelisting requirement called out in that same doc section).

There isn’t a Marketplace permission you can toggle to enable mic capture inside the Zoom App; it’s a client-side privacy/security restriction. If your requirements ever shift to meeting audio/video, Zoom’s supported paths are Meeting SDK raw data or Realtime Media Streams (RTMS) (both are for meeting media, not standalone mic memos).