Pinned Repositories
opencv
本项目由python语言基于opencv库而开发出来的计算机视觉应用集,该集合会随着我对cv学习的深入不定期的更新和丰富。
anygrasp_sdk
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
bigdata
大数据技术hadoop+spark等生态体系实时流计算与离线批处理应用集合
clash-for-linux
clash-for-linux
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
cloudpan189-go
天翼云盘命令行客户端(CLI),基于GO语言实现
CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
CoreNLP
该项目为java编写的自然语言分析工具,我在Stanford CoreNLP基础上进行了一些修改。
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
yangle9567's Repositories
yangle9567/anygrasp_sdk
yangle9567/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
yangle9567/clash-for-linux
clash-for-linux
yangle9567/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
yangle9567/cloudpan189-go
天翼云盘命令行客户端(CLI),基于GO语言实现
yangle9567/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
yangle9567/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
yangle9567/dora
DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
yangle9567/FATE
An Industrial Grade Federated Learning Framework
yangle9567/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
yangle9567/google-research
Google Research
yangle9567/GPT4o
Community Open Source Implementation of GPT4o in PyTorch
yangle9567/lerobot
🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch
yangle9567/MetaGPT
🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
yangle9567/middleware
TrueNAS CORE/Enterprise/SCALE Middleware Git Repository
yangle9567/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
yangle9567/mobile-aloha
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
yangle9567/Octopus
🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.
yangle9567/open-interpreter
OpenAI's Code Interpreter in your terminal, running locally
yangle9567/PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
yangle9567/prometheus
The Prometheus monitoring system and time series database.
yangle9567/scale-build
TrueNAS SCALE Build System
yangle9567/Telechat
yangle9567/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
yangle9567/vlm_arm
机械臂+大模型+多模态=人机协作具身智能体
yangle9567/VLM_survey
Vision-Language Models for Vision Tasks: A Survey
yangle9567/VoxPoser
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
yangle9567/webui
TrueNAS Angular UI
yangle9567/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
yangle9567/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment