JackalWu2019

JackalWu2019's Stars

itsnamgyu/block-transformer
Block Transformer: Global-to-Local Language Modeling for Fast Inference (Official Code)
Language:Python1308
ridgerchu/matmulfreellm
Implementation for MatMul-free LM.
Language:Python2.9k179
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.8k108
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
Language:Python1.1k139
MarkFzp/humanplus
[CoRL 2024] HumanPlus: Humanoid Shadowing and Imitation from Humans
Language:Python53588
siddhanthaldar/BAKU
Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning
Language:Python694
roboflow/supervision
We write your reusable computer vision tools. 💜
Language:Python22.9k1.7k
sanderwood/bgpt
Beyond Language Models: Byte Models are Digital World Simulators
Language:Python30820
WooooDyy/AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
Language:Python31938
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python6.7k594
robocasa/robocasa
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Language:Python52338
siddrrsh/ambientGPT
Language:TypeScript27621
roboterax/humanoid-gym
Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer https://arxiv.org/abs/2404.05695
Language:Python706117
Jingkang50/PSG4D
4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)
Language:Python851
tyz1030/darkgs
[IROS '24 Oral] DarkGS: Building 3DGS in the dark with a torch.
Language:Python765
phidatahq/phidata
Build AI Assistants with memory, knowledge and tools.
Language:Python11.2k1.7k
advaitpaliwal/insight
Language:Python31556
dora-rs/dora
DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
Language:Rust1.5k79
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7.5k740
BasedHardware/omi
AI wearables
Language:C3.5k411
markokraemer/OACE
221
xai-org/grok-1
Grok open release
Language:Python49.5k8.3k
deepgram-devs/deepgram-ai-agent-demo
Deepgram Conversational AI demo
Language:TypeScript33193
bachittle/open-voice-pilot
Open-source AI for voice control, rivaling Alexa and Siri
Language:Python12
Improbable-AI/VisionProTeleop
VisionOS App + Python Library to stream head / wrist / finger tracking data from Vision Pro to any robots.
Language:Swift36622
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
7.4k901
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
Language:Python3.8k650
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.6k2.2k
ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
Language:TypeScript75.4k58.9k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python33.5k3.9k