YujieLu10
Research intern @AIatMeta FAIR | CS PhD @UCSB NLP | ex-intern in Microsoft Research, Amazon AWS AI | Multimodal Learning | Looking for full-time position
UC Santa BarbaraSanta Barbara
YujieLu10's Stars
Stability-AI/generative-models
Generative Models by Stability AI
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
facebookresearch/metaseq
Repo for external large-scale work
jbhuang0604/awesome-tips
aras-p/UnityGaussianSplatting
Toy Gaussian Splatting visualization in Unity
microsoft/GLIP
Grounded Language-Image Pre-training
deepseek-ai/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
CStanKonrad/long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
lucidrains/flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
Xnhyacinth/Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
ydyjya/Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
pydot/pydot
Python interface to Graphviz's Dot language
deepseek-ai/DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
jjihwan/FIFO-Diffusion_public
Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)
OpenGVLab/video-mamba-suite
The suite of modeling video with Mamba
allenai/WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
TIGER-AI-Lab/ImagenHub
A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)
rosewang2008/language_modeling_via_stochastic_processes
Language modeling via stochastic processes. Oral @ ICLR 2022.
alexa/teach
TEACh is a dataset of human-human interactive dialogues to complete tasks in a simulated household environment.
YujieLu10/LLMScore
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
YujieLu10/TIP
Multimodal-Procedural-Planning
VegB/iNLG
Implementation of "Visualize Before You Write: Imagination-Guided Open-Ended Text Generation".
WildVision-AI/LMM-Engines
WildVision-AI/WildVision-Bench
YujieLu10/CLAP
VIM-Bench/VIM_TOOL
WildVision-AI/WildVision-Arena
https://huggingface.co/spaces/WildVision/vision-arena
WildVision-Bench/wildvision-bench.github.io