thkkk
tanh(k)/Hengkai Tan, Ph.D Student @ TSAIL, THU-CST. Email: bernard.hengk.tan [at] gmail [dot] com
Tsinghua UniversityBeijing
thkkk's Stars
microsoft/mup
maximal update parametrization (µP)
Jamie-Stirling/RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
syncdoth/RetNet
Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.
lucidrains/q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
kyegomez/RT-2
Democratization of RT-2 "RT-2: New model translates vision and language into action"
OpenInterpreter/open-interpreter
A natural language interface for computers
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
jeinlee1991/chinese-llm-benchmark
中文大模型能力评测榜单:目前已囊括139个大模型,覆盖chatgpt、gpt-4o、谷歌gemini、Claude3.5、百度文心一言、千问、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及deepseek-v3、qwen2.5、llama3.1、glm4、书生internLM2.5等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!
shroominic/codeinterpreter-api
👾 Open source implementation of the ChatGPT Code Interpreter
haseeb-heaven/langchain-coder
Web Application that can generate code and fix bugs and run using various LLM's (GPT,Gemini,PALM)
Gepetto/example-robot-data
Set of robot URDFs for benchmarking and developed examples.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
GT-RIPL/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
THUDM/CodeGeeX2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
bytedance/lynx-llm
paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/
microsoft/torchscale
Foundation Architecture for (M)LLMs
e2b-dev/awesome-ai-agents
A list of AI autonomous agents
Farama-Foundation/Minari
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
deep-floyd/IF
kinghuin/AIGC-progress
Follow the rapid development of AIGC models and applications. | 跟上AIGC模型和应用快速发展的步伐 🚀
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
vimalabs/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
ahmetbersoz/chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
microsoft/PromptCraft-Robotics
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
google-deepmind/mujoco_mpc
Real-time behaviour synthesis with MuJoCo, using Predictive Control
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
joerick/pyinstrument
🚴 Call stack profiler for Python. Shows you why your code is slow!
Kaixhin/PlaNet
Deep Planning Network: Control from pixels by latent planning with learned dynamics