/audioinsight

AudioInsight is a web application that processes audio, generates transcriptions, and allows users to ask questions about the related audio.

Primary LanguageTypeScriptMIT LicenseMIT

Audio Insight Running Exampl

AudioInsight

AudioInsight is a full-stack application that processes audio, generates transcriptions, and allows users to ask questions about the related audio.

Its creation was motivated by participation in a dev.to challenge.

Table of Contents

How to Install

  1. Start by cloning this repository:
git clone git@github.com:gabrielsenadev/audioinsight.git
  1. Install dependencies:
npm ci
  1. Configure your environment

See Environment Variables.

  1. Run application
npm run dev

Environment Variables

This application depends on some providers to work with ai and database. It has been developed with minimal provider dependency. So, if you prefer a different provider, you can easily switch.

Cloudflare AI:

This application integrates with the Cloudflare AI ecosystem to utilize AI Models.

  • CLOUDFLARE_ACCOUNT_ID
  • CLOUDFLARE_API_TOKEN

Netlify Blobs:

For storing audio data, this application relies on Netlify Blobs. You will need a Netlify Site and Account.

  • NETLIFY_SITE_ID
  • NETLIFY_TOKEN

MongoDB:

MongoDB is used to store chats and chat messages.

  • MONGODB_URL
  • MONGODB_DATABASE

Technologies and Tools

  • Next.js: Frontend React Application and API Application
  • Cloudflare AI: Interaction with Cloudflare AI Models
  • MongoDB: Database to store chats and messages
  • Netlify Blobs: To store audio files
  • Zod: User input API validation
  • Tabler Icons: Application icons
  • Wavesurfer: Audio wave visualization player