Prof-pengyin

Assistant Professor City University of Hong Kong Coding and designing for general artificial Intelligence.

City University of Hong KongHong Kong, China

Prof-pengyin's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python68.3k 574 08.1k
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.1k 64 259944
bmaltais/kohya_ss
Language:Python9.4k 91 2k1.2k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.3k 89 1.8k931
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Language:Jupyter Notebook7.9k 105 1.5k791
py-why/dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
Language:Python7k 137 475924
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python6.6k 70 104584
andrewyng/translation-agent
Language:Python4.7k 51 15534
agiresearch/AIOS
AIOS: LLM Agent Operating System
Language:Python3.3k 49 31392
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
Language:Python2.6k 36 52251
agiresearch/OpenAGI
OpenAGI: When LLM Meets Domain Experts
Language:Python1.9k 27 16161
BAAI-Agents/Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
Language:Python1.8k 22 31154
google-deepmind/mujoco_menagerie
A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
Language:Jupyter Notebook1.3k 26 67176
OSU-NLP-Group/HippoRAG
HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.
Language:Python1.3k 14 31109
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
Language:Python1.1k 17 99138
google-deepmind/mujoco_mpc
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Language:C++977 25 95147
google-deepmind/open_x_embodiment
Language:Jupyter Notebook794 18 7354
CYHSM/awesome-neuro-ai-papers
Papers from the intersection of deep learning and neuroscience
385 21 139
Improbable-AI/VisionProTeleop
VisionOS App + Python Library to stream head / wrist / finger tracking data from Vision Pro to any robots.
Language:Swift365 12 1021
1x-technologies/1xgpt
world modeling challenge for humanoid robots
Language:Python307 10 717
dingo-actual/infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)
Language:Python272 6 1423
danijar/daydreamer
DayDreamer: World Models for Physical Robot Learning
Language:Jupyter Notebook270 10 1428
Srameo/LE3D
HDR 3D Scene Editing!
155 25 34
Timothyxxx/WorldModelPapers
Paper collections of the continuous effort start from World Models.
128 10 06
kvablack/susie
Code for subgoal synthesis via image editing
Language:Python104 2 1314
kodie-artner/AR-RViz
Unity Project for visualization and control of ROS systems in augmented reality
Language:C#81 3 87
guiglass/Mocap_Fusion_Gloves
Language:C++50 4 115
rail-berkeley/soar
Code release for paper "Autonomous Improvement of Instruction Following Skills via Foundation Models" | CoRL 2024
Language:Python42 10 12
johnrso/spawnnet
SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks
Language:Python35 3 43
NUS-LinS-Lab/ManiFM
Official Site for ManiFoundation Model
Language:Python34 1 20

Prof-pengyin

Prof-pengyin's Stars

openai/whisper

facebookresearch/segment-anything-2

bmaltais/kohya_ss

NVIDIA/TensorRT-LLM

google-deepmind/mujoco

py-why/dowhy

huggingface/lerobot

andrewyng/translation-agent

agiresearch/AIOS

facebookresearch/jepa

agiresearch/OpenAGI

BAAI-Agents/Cradle

google-deepmind/mujoco_menagerie

OSU-NLP-Group/HippoRAG

openvla/openvla

google-deepmind/mujoco_mpc

google-deepmind/open_x_embodiment

CYHSM/awesome-neuro-ai-papers

Improbable-AI/VisionProTeleop

1x-technologies/1xgpt

dingo-actual/infini-transformer

danijar/daydreamer

Srameo/LE3D

Timothyxxx/WorldModelPapers

kvablack/susie

kodie-artner/AR-RViz

guiglass/Mocap_Fusion_Gloves

rail-berkeley/soar

johnrso/spawnnet

NUS-LinS-Lab/ManiFM