Pinned Repositories
audio_publisher
The **Audio Publisher Node** is a ROS (Robot Operating System) node designed to capture audio data from a microphone, publish it as a ROS topic, and optionally save it to a WAV file. It also supports playing the received audio in real time.
Bert-VITS2
vits2 backbone with multilingual-bert
DiffusionPolicy-Robotics
Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.
Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
stream_vits_zh
本项目基于 vits_chinese,提供一个支持流式输出的中文 VITS 模型接口。
whisper_ros1
This repository contains a ROS1 node for real-time speech-to-text transcription using the **Whisper** model. The node subscribes to an audio stream and publishes the transcribed text to a specified topic.
X3Plus-Embodied
2023 Computer Science Project: An Embodied Intelligence Path Planning System Based on YahboomCar X3 Plus, open-sourcing the practical achievements of the freshman team.
Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
awesome-embodied-vla-va-vln
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
control_your_robot
this project provide a verity of code help you collect data from your robotic arm, have fun!
EmbodiedMind's Repositories
EmbodiedMind/DiffusionPolicy-Robotics
Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.
EmbodiedMind/stream_vits_zh
本项目基于 vits_chinese,提供一个支持流式输出的中文 VITS 模型接口。
EmbodiedMind/whisper_ros1
This repository contains a ROS1 node for real-time speech-to-text transcription using the **Whisper** model. The node subscribes to an audio stream and publishes the transcribed text to a specified topic.
EmbodiedMind/X3Plus-Embodied
2023 Computer Science Project: An Embodied Intelligence Path Planning System Based on YahboomCar X3 Plus, open-sourcing the practical achievements of the freshman team.
EmbodiedMind/audio_publisher
The **Audio Publisher Node** is a ROS (Robot Operating System) node designed to capture audio data from a microphone, publish it as a ROS topic, and optionally save it to a WAV file. It also supports playing the received audio in real time.
EmbodiedMind/Bert-VITS2
vits2 backbone with multilingual-bert
EmbodiedMind/Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2