getfox's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
ltdrdata/ComfyUI-Manager
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
openai/transformer-debugger
promptslab/Awesome-Prompt-Engineering
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
openai/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
lyuwenyu/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
google-deepmind/open_x_embodiment
vimalabs/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
Jumpat/SegAnyGAussians
The official implementation of SAGA (Segment Any 3D GAussians)
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
Improbable-AI/VisionProTeleop
VisionOS App + Python Library to stream head / wrist / finger tracking data from Vision Pro to any robots.
Meituan-AutoML/VisionLLaMA
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
jennyzzt/awesome-open-ended
Awesome Open-ended AI
daniel89710/lightNet-TRT
LightNet-TRT is a high-efficiency and real-time implementation of convolutional neural networks (CNNs) using Edge AI.
adeeb10abbas/ros2-docker-dev
Run ROS1/2 with GUI support without hassle!
FeiGeChuanShu/ncnn-android-depth_anything
a Android demo of depth_anything_v1 and depth_anything_v2
measure-infinity/mulan-code
liuliu/swift-mujoco
Swift Binding for MuJoCo: https://mujoco.org/