getfox's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
karpathy/llm.c
LLM training in simple, raw C/CUDA
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
openai/universe
Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
THUDM/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
openai/transformer-debugger
promptslab/Awesome-Prompt-Engineering
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
openai/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
lyuwenyu/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
TencentARC/MotionCtrl
Official Code for MotionCtrl [SIGGRAPH 2024]
sudharsan13296/Hands-On-Meta-Learning-With-Python
Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow
MetaGLM/glm-cookbook
Examples and guides for using the GLM APIs
SunzeY/AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
Improbable-AI/VisionProTeleop
VisionOS App + Python Library to stream head / wrist / finger tracking data from Vision Pro to any robots.
Meituan-AutoML/VisionLLaMA
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
airockchip/rknn-llm
Tongjilibo/build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
jennyzzt/awesome-open-ended
Awesome Open-ended AI
daniel89710/lightNet-TRT
LightNet-TRT is a high-efficiency and real-time implementation of convolutional neural networks (CNNs) using Edge AI.
JiaDingCN/GeminiFusion
ZainZh/sunriseX3_yolov8
地平线horizon开发板部署工具