Kang812's Stars
instructkr/LogicKor
한국어 언어모델 다분야 사고력 벤치마크
roboflow/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
Sense-X/Co-DETR
[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training
Beomi/KoAlpaca
KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model to understand Korean instructions)
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
dimitribarbot/sd-webui-live-portrait
LivePortrait for AUTOMATIC1111 Stable Diffusion WebUI
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
pytube/pytube
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
bukosabino/ta
Technical Analysis Library using Pandas and Numpy
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
stack-auth/stack
Open-source Auth0/Clerk alternative
J-Seo/KoCommonGEN-V2
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Chainlit/chainlit
Build Conversational AI in minutes ⚡️
black-forest-labs/flux
Official inference repo for FLUX.1 models
LG-AI-EXAONE/EXAONE-3.0
Official repository for EXAONE built by LG AI Research
lm-sys/RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
mlabonne/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
StreamPot/StreamPot
Run FFmpeg as an API with fluent-ffmpeg compatibility, queues and S3 storage.
NUS-HPC-AI-Lab/SpeeD
SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
BennyKok/comfyui-deploy
An open source `vercel` like deployment platform for Comfy UI
Picsart-AI-Research/Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
BAAI-DCAI/Bunny
A family of lightweight multimodal models.
syvaidya/openstego
OpenStego is a steganography application that provides two functionalities: a) Data Hiding: It can hide any data within an image file. b) Watermarking: Watermarking image files with an invisible signature. It can be used to detect unauthorized file copying.