JackalWu2019's Stars
itsnamgyu/block-transformer
Block Transformer: Global-to-Local Language Modeling for Fast Inference (Official Code)
ridgerchu/matmulfreellm
Implementation for MatMul-free LM.
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
MarkFzp/humanplus
[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans
siddhanthaldar/BAKU
Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning
roboflow/supervision
We write your reusable computer vision tools. 💜
sanderwood/bgpt
Beyond Language Models: Byte Models are Digital World Simulators
WooooDyy/AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
robocasa/robocasa
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
siddrrsh/ambientGPT
roboterax/humanoid-gym
Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer https://arxiv.org/abs/2404.05695
Jingkang50/PSG4D
4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)
tyz1030/darkgs
[IROS '24 Oral] DarkGS: Building 3DGS in the dark with a torch.
phidatahq/phidata
Build AI Assistants with memory, knowledge and tools.
advaitpaliwal/insight
dora-rs/dora
DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
BasedHardware/omi
AI wearables
markokraemer/OACE
xai-org/grok-1
Grok open release
deepgram-devs/deepgram-ai-agent-demo
Deepgram Conversational AI demo
bachittle/open-voice-pilot
Open-source AI for voice control, rivaling Alexa and Siri
Improbable-AI/VisionProTeleop
VisionOS App + Python Library to stream head / wrist / finger tracking data from Vision Pro to any robots.
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)