/media-azure-serverless-workflow

Audio Diagnose, Media Enhance, and Speech-to-Text running on Azure Serverless Functions.

Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause

description languages products
An example Media Workflow with Dolby.io presented at the 2021 Azure Serverless Conf
python
azure-functions
azure-storage
azure-cognitive-services

Media Pipeline

This sample highlights how to use Azure functions and Dolby.io to create a media processing pipeline that transcribes and enhances input media.

Getting Started

Prerequisites

Params.json

  • "cog_api_key": Azure Cognitive Services Speech-to-text API Key,

  • "dolbyio_api_key": Dolby.io media API Key,

  • "azure_api_key" : Azure storage account API key,

  • "AZURE_ACC_NAME" : Azure storage account name,

  • "AZURE_CONTAINER" : Azure storage container name,

  • "AZURE_BLOB_OUTPUT_SUFFIX" : the output suffix added to the enhanced version of the input media,

  • "SCORE_THRESHOLD" : the score threshold that decides if the audio quality of the input media is too low,

  • "transcription_url" : the Azure cognitive services input URL for transcription, dependent on region.

  • "tunneling_url" : The location of your Serverless HTTP trigger, or your HTTP tunnel (if you run locally)

Steps

Local with VS Code (Recommended for development)

Server with VS Code (Recommended for deployment)

References