singharyan's Stars
toeverything/AFFiNE
There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable and ready to use.
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
wandb/openui
OpenUI let's you describe UI using your imagination, then see it rendered live.
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并支持api调用
WasmEdge/WasmEdge
WasmEdge is a lightweight, high-performance, and extensible WebAssembly runtime for cloud native, edge, and decentralized applications. It powers serverless apps, embedded functions, microservices, smart contracts, and IoT devices.
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
entropy-research/Devon
Devon: An open-source pair programmer
Camb-ai/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
ictnlp/StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
pykeio/ort
Fast ML inference & training for Rust with ONNX Runtime
met4citizen/TalkingHead
Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.
airbadge-dev/airbadge
calvinckho/capacitor-jitsi-meet
This plugin is used to make video calls using Jitsi video platform (https://meet.jit.si) on iOS and Android using Ionic Capacitor.
caliwyr/Software
Sofware Tools 👻
SyedAhkam/spacy-wasm
spaCy on the web
drbh/html2svelte
✏️ Convert HTML to Svelte components in a snap
Rafaelmdcarneiro/fire_chat_rust
An LLM interface (chat bot) implemented in pure Rust using HuggingFace/Candle over Axum Websockets, an SQLite Database, and a Leptos (Wasm) frontend packaged with Tauri.
RafaelPortacio/InfraNodus-on-Streamlit
A Text Network generator from PDF, with visualization and classification. Based on the InfraNodus tool made by Dmitry Paranyushkin.
ostix360/optimized-LLM
This is a try to create the most optimized llm architecture
promplate/pyth-on-line
Online Python IDE with built-in Copilot
Satellite-im/UplinkWeb
Alpha Web Frontend for Uplink