AKSHILMY's Stars
getomni-ai/zerox
Zero shot pdf OCR with gpt-4o-mini
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
SuyashMore/MevonAI-Speech-Emotion-Recognition
Identify the emotion of multiple speakers in an Audio Segment
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Yazdi9/Talking_Face_Avatar
Avatar Generation For Characters and Game Assets Using Deep Fakes
google/generative-ai-docs
Documentation for Google's Gen AI site - including the Gemini API and Gemma
dawntcherian/Google-speech-to-text-python-websocket-server-using-microphone-stream
Python WebSocket server which converts input audio stream from microphone to text using Google speech to text
diego-vicente/som-tsp
Solving the Traveling Salesman Problem using Self-Organizing Maps
ruslanmv/Speech-to-Text-by-using-React
How to create a website in react to convert speech to text by using Google Cloud Platform
aishwaryanr/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
krishnadey30/LeetCode-Questions-CompanyWise
Contains Company Wise Questions sorted based on Frequency and all time
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
microsoft/autogen
A programming framework for agentic AI 🤖
Dhriti03/Noise-Reduction
The Real time Noise cancellation from Audio data signal . Like the construction noise with the denoising the signal .
Zheng-Chong/CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
casbin/pycasbin
An authorization library that supports access control models like ACL, RBAC, ABAC in Python
ashishps1/awesome-system-design-resources
Learn System Design concepts and prepare for interviews using free resources.
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
typesense/typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
HeyPuter/puter
🌐 The Internet OS! Free, Open-Source, and Self-Hostable.
AKSHILMY/tidb-vector-python
TiDB Vector SDK for Python, including code examples. Join our Discord: https://discord.gg/XzSW23Jg9p
pgvector/pgvector
Open-source vector similarity search for Postgres
aws-samples/rag-with-amazon-opensearch-and-sagemaker
Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service
magic-research/magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
KwaiVGI/LivePortrait
Bring portraits to life!
youssefHosni/Awesome-AI-Data-Guided-Projects
A curated list of data science & AI guided projects to start building your portfolio
patronus-ai/Lynx-hallucination-detection
SolarEdgeTech/pyctuator
Monitor Python applications using Spring Boot Admin
robustsam/RobustSAM
RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)
adam-maj/deep-learning
A deep-dive on the entire history of deep-learning