pannaqlucky

pannaqlucky's Stars

modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Language:Python4k442
16131zzzzzzzz/EveryoneNobel
A flexible framework powered by ComfyUI for generating personalized Nobel Prize images.
Language:Python1.2k79
DarthReca/depth-any-canopy
Language:Python10
meyerls/FruitNeRF
[IROS24] Offical Code for "FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework" - Inegrated into Nerfstudio
Language:Python28536
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
Language:Jupyter Notebook8.6k1.2k
initialneil/DEGAS
1063
latentcat/latentbox
A collection of awesome-lists for AI, creativity and art. AI、创意和艺术领域的精选合集。https://latentbox.com
Language:TypeScript1.3k122
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.5k1.4k
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook13.5k1.3k
lmterryn/ITSMe
Language:R428
localdevices/pyorc
Surface velocity, object tracking, and river flow measurements in an open-source API
Language:Python14733
supermemoryai/opensearch-ai
SearchGPT / Perplexity clone, but personalised for you.
Language:TypeScript993141
sairajk/easi-tex
[SIGGRAPH 2024] "EASI-Tex: Edge-Aware Mesh Texturing from Single Image", ACM Transactions on Graphics.
Language:Python1149
pooya-mohammadi/yolov5-gradcam
Visualizing Yolov5's layers using GradCam
Language:Jupyter Notebook29246
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
Language:TypeScript4.2k466
TencentARC/CustomNet
Language:Python26810
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Language:Python3.5k385
z1069614715/objectdetection_script
一些关于目标检测的脚本的改进思路代码，详细请看readme.md
Language:Python5.6k495
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
Language:JavaScript29k2.8k
onnx/models
A collection of pre-trained, state-of-the-art models in the ONNX format
Language:Jupyter Notebook8.1k1.4k
zetane/viewer
ML models and internal tensors 3D visualizer
Language:Python1.3k133
Cysu/open-reid
Open source person re-identification library in python
Language:Python1.3k351
liuzywen/A-cross-modal-edge-guided-salient-object-detection-for-RGB-D-image
Language:PostScript11
MCG-NJU/CMPT
[IJCV 2021] Cross-Modal Pyramid Translation for RGB-D Scene Recognition
Language:Python71
frhf/cross-modal-distillation-reidentification
Code for Paper: Cross-Modal Distillation for Person Re-identification in RGB-Depth
Language:Python33
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.7k85
junyanz/BicycleGAN
Toward Multimodal Image-to-Image Translation
Language:Python1.5k254
roboflow/maestro
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
Language:Python1.4k104
open-mmlab/Multimodal-GPT
Multimodal-GPT
Language:Python1.5k127
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.8k288