getfox

getfox's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python70.9k 576 08.4k
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python55.9k 403 3.6k5.9k
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Language:Python35.9k 507 4755.9k
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python34.8k 1.1k 1.8k8.6k
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
Language:C#17.2k 555 2.9k4.2k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.5k 271 117802
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
Language:Jupyter Notebook9.5k 102 161765
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Language:Jupyter Notebook8.2k 106 1.5k817
ltdrdata/ComfyUI-Manager
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
Language:JavaScript6.8k 39 524879
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
Language:JavaScript5.6k 64 152534
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Language:Python4.6k 38 452447
openai/transformer-debugger
Language:Python4k 25 14236
promptslab/Awesome-Prompt-Engineering
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
Language:Python3.9k 72 0352
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Language:Python3k 49 58274
openai/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
Language:Cython2.9k 197 645814
lyuwenyu/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
Language:Python2.6k 25 440297
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Language:Python2.3k 41 634306
mit-han-lab/efficientvit
Efficient vision foundation models for high-resolution generation and perception.
Language:Python2.3k 38 138185
google-deepmind/open_x_embodiment
Language:Jupyter Notebook843 18 7559
vimalabs/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
Language:Python769 17 5286
Jumpat/SegAnyGAussians
The official implementation of SAGA (Segment Any 3D GAussians)
Language:Jupyter Notebook588 11 11942
TencentARC/LLaMA-Pro
[ACL 2024] Progressive LLaMA with Block Expansion.
Language:Python479 20 3235
Improbable-AI/VisionProTeleop
VisionOS App + Python Library to stream head / wrist / finger tracking data from Vision Pro to any robots.
Language:Swift399 12 1027
Meituan-AutoML/VisionLLaMA
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
Language:Python364 23 610
jennyzzt/awesome-open-ended
Awesome Open-ended AI
179 11 019
daniel89710/lightNet-TRT
LightNet-TRT is a high-efficiency and real-time implementation of convolutional neural networks (CNNs) using Edge AI.
Language:C++73 5 011
adeeb10abbas/ros2-docker-dev
Run ROS1/2 with GUI support without hassle!
Language:Dockerfile59 4 28
FeiGeChuanShu/ncnn-android-depth_anything
a Android demo of depth_anything_v1 and depth_anything_v2
Language:C++52 2 34
measure-infinity/mulan-code
Language:Python38 1 20
liuliu/swift-mujoco
Swift Binding for MuJoCo: https://mujoco.org/
Language:Swift18 2 41