Pinned Repositories
DeepSeek-VL2
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
FudanOCR
A toolbox of scene text super-resolution and recognition
optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
export_llama_to_onnx
export llama to onnx
Macaw-LLM
Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration
HUST-OBC
Oracle Bone Script data collected by VLRLab of HUST
JSRAN
Puzzle-Pieces-Picker
Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
test
test code uploud
VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Pengjie-W's Repositories
Pengjie-W/HUST-OBC
Oracle Bone Script data collected by VLRLab of HUST
Pengjie-W/Puzzle-Pieces-Picker
Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
Pengjie-W/JSRAN
Pengjie-W/test
test code uploud