datamonday's Stars
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
karpathy/arxiv-sanity-preserver
Web interface for browsing, search and filtering recent arxiv submissions
CrazyBoyM/llama3-Chinese-chat
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
xlang-ai/OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
datawhalechina/tiny-universe
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
km1994/LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题
jackaduma/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
haosulab/ManiSkill
SAPIEN Manipulation Skill Framework, a open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.
robocasa/robocasa
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Auromix/ROS-LLM
ROS-LLM is a framework designed for embodied intelligence applications in ROS. It allows natural language interactions and leverages Large Language Models (LLMs) for decision-making and robot control. With an easy configuration process, this framework allows for swift integration, enabling your robot to operate with it in as little as ten minutes.
UMass-Foundation-Model/3D-VLA
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
mira-space/MiraData
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
real-stanford/scalingup
[CoRL 2023] This repository contains data generation and training code for Scaling Up & Distilling Down
zjukg/KG-MM-Survey
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
EmbodiedGPT/EmbodiedGPT_Pytorch
seanzhang-zhichen/llama3-chinese
Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。
google-research/language-table
Suite of human-collected datasets and a multi-task continuous control benchmark for open vocabulary visuolinguomotor learning.
OpenAgentsInc/openagents
Project info & docs
BAAI-DCAI/SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
Gary3410/TaPA
[arXiv 2023] Embodied Task Planning with Large Language Models
UMass-Foundation-Model/MultiPLY
Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
normal-computing/branches
Prototype advanced LLM algorithms for reasoning and planning.
redis-developer/agentic-rag
Complete example of how to build an Agentic RAG architecture with Redis, AWS Bedrock, and LlamaIndex.
beccabai/Data-centric_multimodal_LLM
Survey on Data-centric Large Language Models
mozilla-ai/lm-buddy
Your buddy in the (L)LM space.
UMass-Foundation-Model/HAZARD
HAZARD challenge
zhangfaen/finetune-InternVL2
ioai-tech/data_example
Load and visualize io-data with python scripts.
eundersander/hab_vr_mephisto
A simple Habitat VR app and proof-of-concept for Mephisto integration