Pinned Repositories
aerial_mapper
Real-time Dense Point Cloud, Digital Surface Map (DSM) and (Ortho-)Mosaic Generation for UAVs
agentsflow
Drag & drop UI to build and run a flow of autogen AI agents
AirSim
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
angular-cesium
JavaScript library for creating map based web apps using Cesium and Angular
antvis.github.io
🔜 AntV 新站点!
Anything-3D
Segment-Anything + 3D. Let's lift anything to 3D.
kpi
kpi: kobo-api
MobileVLM
Strong and Open Vision Language Assistant for Mobile Devices
models
Models and examples built with TensorFlow
UE-GeoViewer
A plugin for Unreal Engine that overlays real world maps into the world
tfgbestneal's Repositories
tfgbestneal/MobileVLM
Strong and Open Vision Language Assistant for Mobile Devices
tfgbestneal/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
tfgbestneal/ComfyUI
The most powerful and modular stable diffusion GUI with a graph/nodes interface.
tfgbestneal/DetectSegPlatform
YoloWorld & Flask
tfgbestneal/donkeycar
Open source hardware and software platform to build a small scale self driving car.
tfgbestneal/DriveLM
DriveLM: Driving with Graph Visual Question Answering
tfgbestneal/Focal_TSMP
Deep learning for vegetation health prediction and agricultural drought assessment from a regional climate simulation
tfgbestneal/Gemini
Google Gemini AI model w/speech recognition and voice.
tfgbestneal/gpt-assistant-android
免费的ChatGPT API的安卓语音助手,可用音量键唤起并进行语音交流,支持联网、Vision拍照识图、提问模板等功能 | A free ChatGPT API voice assistant for Android, activated via volume keys for voice interaction, supporting features such as network connectivity, Vision photo recognition, and question templates.
tfgbestneal/groundingLMM
Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
tfgbestneal/InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
tfgbestneal/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源模型
tfgbestneal/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
tfgbestneal/libosmscout
Libosmscout is a C++ library for offline map rendering, routing and location lookup based on OpenStreetMap data
tfgbestneal/MetaGPT
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
tfgbestneal/MiniGPT4Qwen
Cleaned Lavis + DeepSpeed Support! Align MiniGPT4 with Qwen-Chat LLM. I just use 18.8k high-quality instruction-tuning data(Bi-lingual, from minigpt4 and llava). Just fine-tune the projection layer.
tfgbestneal/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
tfgbestneal/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
tfgbestneal/RoadVision
Revolutionizing navigation with AR and MapKit integration, this iOS app offers immersive, real-time directions and customizable UI for an intuitive experience. #iOSDevelopment #AugmentedReality #MapKit #SwiftUI #Innovation
tfgbestneal/SAMJS
tfgbestneal/screenshot-to-code
Drop in a screenshot and convert it to clean HTML/Tailwind/JS code
tfgbestneal/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
tfgbestneal/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
tfgbestneal/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
tfgbestneal/SegmentAnything3D
SAM3D: Segment Anything in 3D Scenes
tfgbestneal/SpeechAgents
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
tfgbestneal/tracking_ros
ROS compatible package for object tracking based on SAM, Cutie, GroundingDINO, YOLO-World, VLPart and DEVA
tfgbestneal/U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
tfgbestneal/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
tfgbestneal/YOLOV8_SAM
yolov8 model with SAM meta