Pinned Repositories
-EANet
External Attention Network
BigData
big data books and papers
OpenPose
a Human Pose Estimation Implementation, the website is www.openpose.org
PonyDebugger
Remote network and data debugging for your native iOS app using Chrome Developer Tools
womenguang
oztc's Repositories
oztc/act-plus-plus
Imitation Learning algorithms with Co-traing for Mobile ALOHA: ACT, Diffusion Policy, VINN
oztc/Awesome-LLM-Reasoning
Collection of papers and resources on Reasoning in Language Models (LLMs), including Chain-of-Thought, Instruction-Tuning, Multimodality.
oztc/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
oztc/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
oztc/Awesome-Open-Vocabulary
A Survey on Open Vocabulary Learning
oztc/Awesome-Reasoning-Foundation-Models
โจโจLatest Papers and Benchmarks in Reasoning with Foundation Models
oztc/Awesome-Robotics-Foundation-Models
oztc/ColossalAI
Making large AI models cheaper, faster and more accessible
oztc/daam
Diffusion attentive attribution maps for interpreting Stable Diffusion.
oztc/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
oztc/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
oztc/E2B
Cloud Runtime for AI Agents
oztc/Efficient-AI-Backbones
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
oztc/GiT
Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
oztc/grok-1
Grok open release
oztc/HALOs
A library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).
oztc/LayerDiffusion
Transparent Image Layer Diffusion using Latent Transparency
oztc/lit-llam
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
oztc/llama3
The official Meta Llama 3 GitHub site
oztc/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
oztc/LWM
oztc/mamba
Mamba SSM architecture
oztc/Megatron-LM
Ongoing research training transformer models at scale
oztc/ml-aim
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
oztc/MOSS-RLHF
MOSS-RLHF
oztc/octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
oztc/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
oztc/oztc.github.io
Oz T. Chang's website
oztc/Theseus
Theseus is a modern OS written from scratch in Rust that explores ๐ข๐ง๐ญ๐ซ๐๐ฅ๐ข๐ง๐ ๐ฎ๐๐ฅ ๐๐๐ฌ๐ข๐ ๐ง: closing the semantic gap between compiler and hardware by maximally leveraging the power of language safety and affine types. Theseus aims to shift OS responsibilities like resource management into the compiler.
oztc/visualnav-transformer
Official code and checkpoint release for "ViNT: A Foundation Model for Visual Navigation".