YasmineXXX

Shandong University

YasmineXXX's Stars

ttengwang/Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
2309
mattneary/attention
visualizing attention for LLM users
Language:Python1828
catherinesyeh/attention-exploration
Files for attention exploration in BERT
Language:Jupyter Notebook31
catherinesyeh/attention-viz
Visualizing query-key interactions in language + vision transformers
Language:HTML13117
LetheSec/HuggingFace-Download-Accelerator
利用HuggingFace的官方下载工具从镜像网站进行高速下载。
Language:Python89783
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Language:Python10.9k1.6k
koalaman/shellcheck
ShellCheck, a static analysis tool for shell scripts
Language:Haskell36.7k1.8k
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Python4.7k479
aranciokov/ranp
Language:Python51
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
Language:Python4.8k422
jpthu17/EMCL
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
Language:Python1259
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
Language:Python1.6k138
OpenBioLink/ThoughtSource
A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/
Language:Jupyter Notebook91172
unicamp-dl/ExaRanker
Language:Python293
RUC-NLPIR/LLM4IR-Survey
This is the repo for the survey of LLM4IR.
45437
DevSinghSachan/unsupervised-passage-reranking
Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"
Language:Python9710
stanford-crfm/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in HEIM (https://arxiv.org/abs/2311.04287) and vision-language models in VHELM (https://arxiv.org/abs/2410.07112).
Language:Python2k265
justjavac/awesome-wechat-weapp
微信小程序开发资源汇总 :100:
46.4k8.7k
thakur-nandan/beir-ColBERT
Evaluation of BEIR Datasets using ColBERT retrieval model
Language:Python161
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Language:Python1.7k197
boheumd/MA-LMM
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
Language:Python25928
Vision-CAIR/MiniGPT4-video
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Language:Python57161
LAION-AI/CLIP_benchmark
CLIP-like model evaluation
Language:Jupyter Notebook64181
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.6k514
RahulSChand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Language:JavaScript1.2k60
chenfei-wu/TaskMatrix
Language:Python34.6k3.3k
Atomic-man007/Awesome_Multimodel_LLM
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.
28219
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Language:Python1.3k110
mwray/Semantic-Video-Retrieval
Code and benchmarks for the Semantic Video Retrieval Task
Language:Python542
MLNLP-World/MyArxiv
Arxiv个性化定制化模版，实现对特定领域的相关内容、作者与学术会议的有效跟进。
Language:CSS26824