SUNGBEOMCHOI's Stars
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
tldraw/tldraw
whiteboard / infinite canvas SDK
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
meta-llama/llama3
The official Meta Llama 3 GitHub site
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
camelot-dev/camelot
A Python library to extract tabular data from PDFs
mckinsey/vizro
Vizro is a low-code toolkit for building high-quality data visualization apps.
AI-Citizen/SolidGPT
Developer AI Persona Search Agent
AkariAsai/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
Anil-matcha/Awesome-GPT-Store
Custom GPT Store - A collection of major GPTS available in public
lukasschwab/arxiv.py
Python wrapper for the arXiv API
BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
nomadkaraoke/python-audio-separator
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
yuwvandy/KG-LLM-MDQA
BM-K/Sentence-Embedding-Is-All-You-Need
Korean Sentence Embedding Repository
vis-nlp/ChartQA
mitmedialab/vizml
Plotly dataset-visualization pairs, feature extraction scripts, and model training code for VizML (CHI 2019)
LynnHaDo/Document-Layout-Analysis
Object Detection Model for Scanned Documents
Atipico1/Kor-IR
Kor-IR: Korean Information Retrieval Benchmark
jfma-USTC/HRDoc
Dataset and scripts for HRDoc
j-rausch/DSG
SUNGBEOMCHOI/Korean-Streaming-ASR
Korean Streaming ASR(with Denoiser and Conformer CTC)