talha1503's Stars
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
roboflow/supervision
We write your reusable computer vision tools. 💜
voxel51/fiftyone
Refine high-quality datasets and visual AI models
jxnl/instructor
structured outputs for llms
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
alirezadir/Machine-Learning-Interviews
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
roboflow/maestro
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
TheShadow29/awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
buoyancy99/diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
opendilab/awesome-multi-modal-reinforcement-learning
A curated list of Multi-Modal Reinforcement Learning resources (continually updated)
pratyushasharma/laser
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
haoranD/Awesome-Embodied-AI
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
michiyasunaga/dragon
[NeurIPS 2022] DRAGON 🐲: Deep Bidirectional Language-Knowledge Graph Pretraining
facebookresearch/open-eqa
OpenEQA Embodied Question Answering in the Era of Foundation Models
agiresearch/InstructGLM
Language is All a Graph Needs
allenai/ScienceWorld
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
zh460045050/V2L-Tokenizer
GAIR-NLP/weak-to-strong-reasoning
jvking/text-games
This repository provides text game simulators for research purposes.
pliang279/HEMM
Holistic evaluation of multimodal foundation models
superhero-7/AltDiffusion
Source code for paper: "AltDiffusion: A multilingual Text-to-Image diffusion model"
thunlp/EmbodiedAIxLLMPapers
Papers on integrating large language models with embodied AI
jvking/reddit-RL-simulator
This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit
amazon-science/ContextualUnderstanding-ContrastiveDecoding
Enhancing contextual understanding in large language models through contrastive decoding
google-research-datasets/visage
Visage contains an image dataset of images with human annotations on whether or not certain attributes are present or depicted in the image. The attribute may either be stereotypical or non-stereotypical w.r.t. to the identity group in the image. It also contains a list of attributes in English along with annotations about whether they are visual.
MLSA-SRM/Python-CLIK
Command Line Interface for Keys is a tool to hide and manage your API keys and Secret/Auth tokens.
cmubig/SCoFT
SCoFT: Self-Contrastive Fine-Tuning for Equitable Image Generation (CVPR 2024)
lwaekfjlk/python-project-template
Template for project development.
microsoft/DOSA
Dataset of of Social Artifacts from Different Indian Geographical Subcultures
cognitiveailab/drrn-scienceworld