BroadJJ's Stars
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
seaweedfs/seaweedfs
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
openai/openai-python
The official Python library for the OpenAI API
Delgan/loguru
Python logging made (stupidly) simple
huggingface/text-generation-inference
Large Language Model Text Generation Inference
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
bentoml/BentoML
The easiest way to serve AI apps and models - Build reliable Inference APIs, LLM apps, Multi-model chains, RAG service, and much more!
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
mercari/ml-system-design-pattern
System design patterns for machine learning
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
ankush-me/SynthText
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
lichao-sun/Mora
Mora: More like Sora for Generalist Video Generation
glotlabs/gdrive
Google Drive CLI Client
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
zsyOAOA/ResShift
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)
songys/AwesomeKorean_Data
한국어 데이터 세트 링크
nlpai-lab/KULLM
☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM
lichao-sun/SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
csslc/CCSR
Official codes of CCSR: Improving the Stability of Diffusion Models for Content Consistent Super-Resolution
fh2019ustc/DocTr-Plus
The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.
Ree1s/IDM
VamosC/CLIP4STR
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
furkanbiten/idl_data
OCR Annotations from Amazon Textract for Industry Documents Library
project-deepform/deepform
Experimental form data extraction for journalism
jkc-ai/mwp_kr_data