Pinned Repositories
3D-Registration-with-Maximal-Cliques
Source code of CVPR 2023 paper
912-notes
清华大学计算机系考研专业课 (912) 笔记
AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
aitools
tools for onnx check, compile profile & accuracy, and other functions
Android-MobileFaceNet-MTCNN-FaceDeSpoofing
Use tensorflow Lite on Android platform, integrated face detection (MTCNN), face anti spoofing (ECCV2018-FaceDeSpoofing) and face comparison (MobileFaceNet use InsightFace loss)
ANN
Artificial-Neural-Network
deepstream-jetson
for gstreamer launch
GeoFormer
GeoFormer for Homography Estimation
MobileNet-SSD-TensorRT
Accelerate mobileNet-ssd with tensorRT
bingxinhu's Repositories
bingxinhu/GeoFormer
GeoFormer for Homography Estimation
bingxinhu/3D-Registration-with-Maximal-Cliques
Source code of CVPR 2023 paper
bingxinhu/AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
bingxinhu/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
bingxinhu/DINO-X-API
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
bingxinhu/dvsann
数据格式处理
bingxinhu/event-based_vision_resources
tsinghua brain like & nan lake
bingxinhu/llama
Port of Facebook's LLaMA model in C/C++
bingxinhu/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
bingxinhu/MA-LMM
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
bingxinhu/MagicDrive
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
bingxinhu/Masked-Spiking-Transformer
[ICCV-23] Masked Spiking Transformer
bingxinhu/mlx
MLX: An array framework for Apple silicon
bingxinhu/mlx-examples
Examples in the MLX framework
bingxinhu/neuroglancer
WebGL-based viewer for volumetric data
bingxinhu/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
bingxinhu/omniglue
Code release for CVPR'24 submission 'OmniGlue'
bingxinhu/ROCm
AMD ROCm™ Software - GitHub Home
bingxinhu/SNN-
bingxinhu/snntorch
Deep and online learning with spiking neural networks in Python
bingxinhu/spikingjelly
SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.
bingxinhu/stable-diffusion-webui
Stable Diffusion web UI
bingxinhu/system_architect
💯2024年 系统架构设计师(软考高级)备考资源库+配套免费刷题软件。PC版免费刷题软件:https://ruankaodaren.com
bingxinhu/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
bingxinhu/Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
bingxinhu/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
bingxinhu/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
bingxinhu/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
bingxinhu/WasmEdge
WasmEdge is a lightweight, high-performance, and extensible WebAssembly runtime for cloud native, edge, and decentralized applications. It powers serverless apps, embedded functions, microservices, smart contracts, and IoT devices.
bingxinhu/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model