pannaqlucky's Stars
modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
16131zzzzzzzz/EveryoneNobel
A flexible framework powered by ComfyUI for generating personalized Nobel Prize images.
DarthReca/depth-any-canopy
meyerls/FruitNeRF
[IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
initialneil/DEGAS
latentcat/latentbox
A collection of awesome-lists for AI, creativity and art. AI、创意和艺术领域的精选合集。https://latentbox.com
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
lmterryn/ITSMe
localdevices/pyorc
Surface velocity, object tracking, and river flow measurements in an open-source API
supermemoryai/opensearch-ai
SearchGPT / Perplexity clone, but personalised for you.
sairajk/easi-tex
[SIGGRAPH 2024] "EASI-Tex: Edge-Aware Mesh Texturing from Single Image", ACM Transactions on Graphics.
pooya-mohammadi/yolov5-gradcam
Visualizing Yolov5's layers using GradCam
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
TencentARC/CustomNet
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
z1069614715/objectdetection_script
一些关于目标检测的脚本的改进思路代码,详细请看readme.md
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
onnx/models
A collection of pre-trained, state-of-the-art models in the ONNX format
zetane/viewer
ML models and internal tensors 3D visualizer
Cysu/open-reid
Open source person re-identification library in python
liuzywen/A-cross-modal-edge-guided-salient-object-detection-for-RGB-D-image
MCG-NJU/CMPT
[IJCV 2021] Cross-Modal Pyramid Translation for RGB-D Scene Recognition
frhf/cross-modal-distillation-reidentification
Code for Paper: Cross-Modal Distillation for Person Re-identification in RGB-Depth
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
junyanz/BicycleGAN
Toward Multimodal Image-to-Image Translation
roboflow/maestro
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
open-mmlab/Multimodal-GPT
Multimodal-GPT
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.