Pinned Repositories
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
agents
Build real-time multimodal AI applications 🤖🎙️📹
audio-to-speech-pipeline
This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline
financial-demo
A demo for a financial services bot
flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
indic-punct
mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
xla
Enabling PyTorch on XLA Devices (e.g. Google TPU)
testcontainers-python
Testcontainers is a Python library that providing a friendly API to run Docker container. It is designed to create runtime environment to use during your automatic tests.
mehadi92's Repositories
mehadi92/agents
Build real-time multimodal AI applications 🤖🎙️📹
mehadi92/audio-to-speech-pipeline
This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline
mehadi92/financial-demo
A demo for a financial services bot
mehadi92/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
mehadi92/indic-punct
mehadi92/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
mehadi92/nvidia-terraform-modules
Infrastructure as code for GPU accelerated managed Kubernetes clusters.
mehadi92/rasa-demo
:tiger: Sara - the Rasa Demo Bot: An example of a contextual AI assistant built with the open source Rasa Stack
mehadi92/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
mehadi92/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.