avijit9's Stars
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
gkamradt/langchain-tutorials
Overview and tutorial of the LangChain Library
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
yenchenlin/nerf-pytorch
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
varunshenoy/GraphGPT
Extrapolating knowledge graphs from unstructured text using GPT-3 🕵️♂️
LLaVA-VL/LLaVA-NeXT
facebookresearch/Detic
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
piergiaj/pytorch-i3d
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
tim-learn/awesome-test-time-adaptation
Collection of awesome test-time (domain/batch/instance) adaptation methods
namuan/dr-doc-search
Converse with book - Built with GPT-3
facebookresearch/LaViLa
Code release for "Learning Video Representations from Large Language Models"
nicknochnack/Llama2RAG
A working example of RAG using LLama 2 70b and Llama Index
facebookresearch/Ego4d
Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset
jrgillick/laughter-detection
kevintsai/Building-and-Evaluating-Advanced-RAG-Applications
Jupyter notebooks for course Building and Evaluating Advanced RAG Applications, taught by Jerry Liu (Co-founder and CEO of LlamaIndex) and Anupam Datta (Co-founder and chief scientist of TruEra).
sayandebroy-csmi/cleanadapt
Reproduced code for Overcoming Label Noise for Source-free Unsupervised Video Domain Adaptation, ICVGIP'22
Sid2697/HOI-Ref
Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"
vturrisi/CO2A
Annusha/xmic
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024
lmur98/epic_kitchens_affordances
ViLab-UCSD/GeoNet
Repository for accessing and training using GeoNet dataset, published at CVPR 2023.
WesRobbins/CAST
ViLab-UCSD/LaGTran_ICML2024
Code and models for the ICML 2024 paper "Tell, Don`t Show!: Language Guidance Eases Transfer Across Domains in Images and Videos"
ALIKSARKAR/Printed-OCR-for-Extremely-Low-resource-Indic-Languages
This is a official implementation of the following paper