Ineed to decode YU420p stream buffer to RGB image for C# or C++ language


Description
I need to decode zoom frame YUV420 planar to RGB image formatter as PNG or JPEG

Which Windows Meeting SDK version?
5.11.4

Screenshots

Troubleshooting Routes
Render recordings from Video raw data - #14 by chump

Device (please complete the following information):

  • Device: lenovo
  • OS: Windows 10

Additional context
I need a decoding algorithm or Code for C++ or C# instead

@zoomuser, happy to see if we can help.

Are you building a recording bot, or are you building a desktop app?

Yes, it’s a recording process in a bot.

Hi pro.

I am going to build a meeting bot that records audio only. Can you please help to share some hints that I should start with

Really appreciated your help. Thanks!

Hey @zoomuser, the JPEG and PNG specifications are quite complicated – this is most likely not something you want to write yourself by hand. What we’d suggest is to use a tool like FFMPEG, which has this capability built-in.
We wrote a bit more about this here: How to Convert Zoom Raw Data into PNG and MP4

Hey @nguyenhavinh, happy to share how to build a meeting bot.

If you’re using the Zoom Raw Data SDK, these are the steps:

  1. Spin up a server. We recommend AWS, GCP, or Digital Ocean.
  2. Use either the Windows or Mac Zoom SDK to launch an instance of the Zoom client.
  3. Once you have the Zoom SDK launched, and use the Raw Data functionality to extract the video and audio streams.
  4. This will return the video in I420 raw frames and audio in PCM 16LE raw format, so you’ll need to encode the audio and video yourself afterwards.
  5. Once you have one instance of this working, you’ll need to scale this across several servers if you want to run multiple bots simultaneously, which is required to have bots for multiple meetings.

You could also use Twilio to build a bot that dials in to meetings. This is much easier, but some meetings don’t have a dial-in option enabled - just keep that in mind.

Finally, another option is Recall.ai. It’s a simple 3rd party API that lets you use meeting bots to get raw audio/video from meetings without you needing to spend months to build, scale and maintain these bots.

Let me know if you have any questions!

1 Like

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.