Coronal-Halo's Stars
apple/ml-aim
This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.
byjlw/video-analyzer
A comprehensive video analysis tool that combines computer vision, audio transcription, and natural language processing to generate detailed descriptions of video content. This tool extracts key frames from videos, transcribes audio content, and produces natural language descriptions of the video's content.
meta-llama/llama-models
Utilities intended for use with Llama models.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
mchengny/RWF2000-Video-Database-for-Violence-Detection
A large scale video database for violence detection, which has 2,000 video clips containing violent or non-violent behaviours.
vidharm/vidharm
airtlab/A-Dataset-for-Automatic-Violence-Detection-in-Videos
ajhamdi/ges-splatting
Original reference implementation of "GES : Generalized Exponential Splatting for Efficient Radiance Field Rendering" [CVPR 2024]
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
hanyangyu1021/LMGaussian
code will be available soon
btsmart/splatt3r
Official repository for Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs
NVlabs/ParallelInversion
Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation (ICRA 2023)
andrewyng/aisuite
Simple, unified interface to multiple Generative AI providers
aigc-apps/PAI-RAG
An easy-to-use framework for modular RAG
onnx/onnx
Open standard for machine learning interoperability
tyxsspa/AnyText
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
biubug6/Pytorch_Retinaface
Retinaface get 80.99% in widerface hard val using mobilenet0.25.
timesler/facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
serengil/retinaface
RetinaFace: Deep Face Detection Library for Python
matthias-k/pysaliency
Python Framework for Saliency Modeling and Evaluation
xuebinqin/BASNet
Code for CVPR 2019 paper. BASNet: Boundary-Aware Salient Object Detection
taozh2017/RGBD-SODsurvey
RGB-D Salient Object Detection: A Survey
shadowsocks/shadowsocks-windows
A C# port of shadowsocks
pyg-team/pytorch_geometric
Graph Neural Network Library for PyTorch
graphdeeplearning/graphtransformer
Graph Transformer Architecture. Source code for "A Generalization of Transformer Networks to Graphs", DLG-AAAI'21.
dmlc/dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
github/personal-website
Code that'll help you kickstart a personal website that showcases your work as a software developer.
KaiyangZhou/deep-person-reid
Torchreid: Deep learning person re-identification in PyTorch.
vye16/shape-of-motion