bingxinhu

Pinned Repositories

3D-Registration-with-Maximal-Cliques
Source code of CVPR 2023 paper
Language:C++0 0 00
912-notes
清华大学计算机系考研专业课 (912) 笔记
Language:C++00
AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
Language:Python0 0 00
aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Language:Python0 1 00
aitools
tools for onnx check, compile profile & accuracy, and other functions
00
Android-MobileFaceNet-MTCNN-FaceDeSpoofing
Use tensorflow Lite on Android platform, integrated face detection (MTCNN), face anti spoofing (ECCV2018-FaceDeSpoofing) and face comparison (MobileFaceNet use InsightFace loss)
Language:Java00
ANN
Artificial-Neural-Network
0 2 00
deepstream-jetson
for gstreamer launch
10
GeoFormer
GeoFormer for Homography Estimation
Language:Python10
MobileNet-SSD-TensorRT
Accelerate mobileNet-ssd with tensorRT
Language:C++1 1 00

bingxinhu's Repositories

bingxinhu/GeoFormer
GeoFormer for Homography Estimation
Language:Python10
bingxinhu/3D-Registration-with-Maximal-Cliques
Source code of CVPR 2023 paper
Language:C++0 0 00
bingxinhu/AgentGym
Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
Language:Python0 0 00
bingxinhu/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
bingxinhu/DINO-X-API
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
bingxinhu/dvsann
数据格式处理
bingxinhu/event-based_vision_resources
tsinghua brain like & nan lake
0 0
bingxinhu/llama
Port of Facebook's LLaMA model in C/C++
bingxinhu/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
bingxinhu/MA-LMM
(2024CVPR) MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
bingxinhu/MagicDrive
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
bingxinhu/Masked-Spiking-Transformer
[ICCV-23] Masked Spiking Transformer
Language:Python0 0
bingxinhu/mlx
MLX: An array framework for Apple silicon
bingxinhu/mlx-examples
Examples in the MLX framework
Language:Python0 0
bingxinhu/neuroglancer
WebGL-based viewer for volumetric data
bingxinhu/notebooks
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
bingxinhu/omniglue
Code release for CVPR'24 submission 'OmniGlue'
Language:Python0 0
bingxinhu/ROCm
AMD ROCm™ Software - GitHub Home
bingxinhu/SNN-
Language:Python
bingxinhu/snntorch
Deep and online learning with spiking neural networks in Python
bingxinhu/spikingjelly
SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.
bingxinhu/stable-diffusion-webui
Stable Diffusion web UI
bingxinhu/system_architect
💯2024年系统架构设计师（软考高级）备考资源库+配套免费刷题软件。PC版免费刷题软件：https://ruankaodaren.com
Language:HTML0 0
bingxinhu/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Language:Python
bingxinhu/Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Language:Python
bingxinhu/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Language:Python0 0
bingxinhu/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
bingxinhu/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
bingxinhu/WasmEdge
WasmEdge is a lightweight, high-performance, and extensible WebAssembly runtime for cloud native, edge, and decentralized applications. It powers serverless apps, embedded functions, microservices, smart contracts, and IoT devices.
bingxinhu/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model