SuperBruceJia
A Ph.D. Student at @vkola-lab, Boston University. Passionate about Large Language Models (LLMs), Large Multimodal Models (LMMs), and Machine Learning.
Boston UniversityBoston, MA
SuperBruceJia's Stars
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
PaddlePaddle/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
XiaoxinHe/Awesome-Graph-LLM
A collection of AWESOME things about Graph-Related LLMs.
tensorflow/mesh
Mesh TensorFlow: Model Parallelism Made Easier
laekov/fastmoe
A fast MoE impl for PyTorch
databricks/megablocks
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
HITsz-TMG/UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
microsoft/Tutel
Tutel MoE: An Optimized Mixture-of-Experts Implementation
HKUDS/GraphGPT
[SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models"
huggingface/huggingface-llama-recipes
allenai/OLMoE
OLMoE: Open Mixture-of-Experts Language Models
ChenLiu-1996/CitationMap
A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.
lmthang/thesis
Thang Luong's thesis on Neural Machine Translation
PKU-DAIR/Hetu
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
for-ai/parameter-efficient-moe
MedicineToken/Medical-Graph-RAG
Medical Graph RAG: Graph RAG for the Medical Data
shawntan/scattermoe
Triton-based implementation of Sparse Mixture of Experts.
mit-han-lab/vila-u
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
ageitgey/all-podcasts-dataset
A free dataset of (almost) all publicly available podcasts.
learn-anything/podcasts
Awesome Podcasts
Awenbocc/med-vqa
Medical Visual Question Answering via Conditional Reasoning [ACM MM 2020]
batmanlab/MedSyn
Repo for MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images
LiyaoTang/ERDA
All Points Matter: Entropy-Regularized Distribution Alignment for Weakly-supervised 3D Segmentation (NeurIPS 2023)