hsaest

second-year master @ Fudan Univ.

Fudan UniversityShanghai, China

hsaest's Stars

hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.8k 203 4.9k3.9k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.6k 159 1.5k2.2k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook11.9k 97 3411.8k
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook9.2k 136 4431.4k
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Language:Python6.8k 72 123772
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.6k 51 560439
pytorch/captum
Model interpretability and understanding for PyTorch
Language:Python4.8k 267 541489
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python2.4k 25 267133
datawhalechina/learn-nlp-with-transformers
we want to create a repo to illustrate usage of transformers in chinese
Language:Shell2.2k 16 21380
lucidrains/gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
Language:Python1.8k 73 49104
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.7k 22 66112
MrYxJ/calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
Language:Python506 4 3516
QwenLM/Qwen2-Math
A series of math-specific large language models of our Qwen2 series.
Language:Python439 13 1033
LMD0311/Awesome-World-Model
Collect some World Models for Autonomous Driving papers.
422 22 011
ChenLiu-1996/CitationMap
A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.
Language:Python396 1 1223
enoche/MMRec
A Toolbox for MultiModal Recommendation. Integrating 10+ Models...
Language:Python329 3 3241
google-research/self-organising-systems
Language:Jupyter Notebook308 21 467
westlake-repl/Recommendation-Systems-without-Explicit-ID-Features-A-Literature-Review
Paper List of Pre-trained Foundation Recommender Models
300 10 124
lupantech/MathVista
MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts
Language:Jupyter Notebook226 5 2335
mathllm/MathCoder
Family of LLMs for mathematical reasoning.
Language:Python213 4 315
OSU-NLP-Group/GrokkedTransformer
Code for the paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
Language:Python145 5 410
JFan1997/Awesome_PhD_Opportunities
This repository is used for advertising PhD recruitment opportunities. Contributions are welcome!
141 3 15
Ber666/RAP
Reasoning with Language Model is Planning with World Model
Language:PDDL138 3 816
ZrrSkywalker/MathVerse
[ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Language:Python138 7 511
kohjingyu/search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
Language:Python126 2 514
xufangzhi/ENVISIONS
A Neural-Symbolic Self-Training Framework
Language:C97 2 43
THUDM/VisualAgentBench
Towards Large Multimodal Models as Visual Foundation Agents
Language:Python94 9 71
siyuyuan/evoagent
Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"
Language:Python73 5 09
HZQ950419/Math-LLaVA
Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
Language:Python55 3 66
PathMMU-Benchmark/PathMMU
Language:Python100

hsaest

hsaest's Stars

hiyouga/LLaMA-Factory

haotian-liu/LLaVA

meta-llama/llama-recipes

NielsRogge/Transformers-Tutorials

jessevig/bertviz

OpenGVLab/InternVL

pytorch/captum

QwenLM/Qwen2-VL

datawhalechina/learn-nlp-with-transformers

lucidrains/gigagan-pytorch

cambrian-mllm/cambrian

MrYxJ/calculate-flops.pytorch

QwenLM/Qwen2-Math

LMD0311/Awesome-World-Model

ChenLiu-1996/CitationMap

enoche/MMRec

google-research/self-organising-systems

westlake-repl/Recommendation-Systems-without-Explicit-ID-Features-A-Literature-Review

lupantech/MathVista

mathllm/MathCoder

OSU-NLP-Group/GrokkedTransformer

JFan1997/Awesome_PhD_Opportunities

Ber666/RAP

ZrrSkywalker/MathVerse

kohjingyu/search-agents

xufangzhi/ENVISIONS

THUDM/VisualAgentBench

siyuyuan/evoagent

HZQ950419/Math-LLaVA

PathMMU-Benchmark/PathMMU