Pinned Repositories
alipay
一个PHP文件搞定支付宝支付系列,包括电脑网站支付,手机网站支付,现金红包、消费红包、扫码支付,JSAPI支付、单笔转账到支付宝账户、交易结算(分账、分润)、网页授权获取用户信息等
AnyGPT
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
basic_shapes_object_detection_model
Model that is fine tuned to detect some basic geometric shapes
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
clip
改造clip让它支持中文,并支持8K的输入长度
clip-multimodal-ml
CLIP-UNet
Official implementation of CLIP-UNet in pytorch.
ConvFormer
csdn
CSDN 博客文章和代码存储,状态公开
hash
Bcrypt,Argon不可逆加密
gg22mm's Repositories
gg22mm/AnyGPT
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
gg22mm/basic_shapes_object_detection_model
Model that is fine tuned to detect some basic geometric shapes
gg22mm/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
gg22mm/clip
改造clip让它支持中文,并支持8K的输入长度
gg22mm/clip-multimodal-ml
gg22mm/CLIP-UNet
Official implementation of CLIP-UNet in pytorch.
gg22mm/csdn
CSDN 博客文章和代码存储,状态公开
gg22mm/Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
gg22mm/fetch_sse_post
fetch实现stream、原生fetch+sse+post
gg22mm/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
gg22mm/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
gg22mm/LLM-Dojo
欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所(最好的学习永远在项目中),包括一个开源大模型训练框架,以及llm_tricks模块,其中包括各种大模型的tricks实现与原理讲解!。👩🎓👨🎓
gg22mm/Long-CLIP
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
gg22mm/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
gg22mm/metahuman-stream
Real time interactive streaming digital human
gg22mm/nlp-in-action-public
Natural language processing projects in action.
gg22mm/ONE-PEACE
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
gg22mm/Pix2SeqV2-Pytorch
Simple Implementation of Pix2seqV2(multi-task)
gg22mm/python_rtmpstream
python库,实现推送实时rtmp音视频流
gg22mm/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
gg22mm/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
gg22mm/Real-ESRGAN_-
PyTorch implementation of Real-ESRGAN model
gg22mm/rembg
Rembg is a tool to remove images background
gg22mm/RT-ODLab
YOLO Tutorial
gg22mm/stable-diffusion
Speechless at the original stable-diffusion
gg22mm/TransUNet
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
gg22mm/yolo-v1
PyTorch implementation of the YOLOv1 architecture presented in "You Only Look Once: Unified, Real-Time Object Detection" by Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi
gg22mm/yolo5_test_v1
gg22mm/yolo6
gg22mm/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite