shixiangsong
I am an undergraduate student majoring in computer science in SJTU.
Shanghai Jiao Tong UniverisityShanghai
shixiangsong's Stars
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
OpenInterpreter/open-interpreter
A natural language interface for computers
pytorch/torchtitan
A native PyTorch Library for large model training
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
lucidrains/pause-transformer
Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount of time on any token
nomic-ai/contrastors
Train Models Contrastively in Pytorch
sjtug/SJTUBeamer
上海交通大学 Beamer 模版 | Beamer template for Shanghai Jiao Tong University
JadyXuan/NTTS
NO TIME TO SLEEP
andyzoujm/representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
shiyemin/light-hf-proxy
A light proxy solution for HuggingFace hub.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Xuekai-Zhu/key-configuration-of-llms
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
xai-org/grok-1
Grok open release
ShirasawaSama/CefDetectorX
【升级版-Electron】Check how many CEFs are on your computer. 检测你电脑上有几个CEF.
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
hkust-nlp/Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
EleutherAI/semantic-memorization
wenyan-lang/wenyan
文言文編程語言 A programming language for the ancient Chinese.
RUCAIBox/HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Learner209/artistic-fusion
Artistic Fusion:Revolutionizing Mural Style Transfer with Combined GAN and Diffusion Model Techniques
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
WilliamX1/ChCore
2022 Chcore Lab
codefuse-ai/Awesome-Code-LLM
[TMLR] A curated list of language modeling researches for code and related datasets.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
SJTU-IPADS/OS-Course-Lab
本仓库包含上海交通大学IPADS实验室设计的操作系统课程系列实验。
Simonwzm/CS3601-ChCore-Lab
CHCore Lab for CS3601, SJTU