Pinned Repositories
AudioNotes
快速提取音视频内容,整理成一份结构化的markdown笔记
CAMA
Official Implementation of A Vision-Centric Approach for Static Map Element Annotation
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
gdal
GDAL is an open source X/MIT licensed translator library for raster and vector geospatial data formats. This is a mirror of the GDAL Subversion repository.
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
pluto
PLUTO: Push the Limit of Imitation Learning-based Planning for Autonomous Driving
telecarla
TELECARLA: An Open Source Extension of the CARLA Simulator for Teleoperated Driving Research Using Off-the-Shelf Components
teleoperated_driving
wangyankecn's Repositories
wangyankecn/AudioNotes
快速提取音视频内容,整理成一份结构化的markdown笔记
wangyankecn/CAMA
Official Implementation of A Vision-Centric Approach for Static Map Element Annotation
wangyankecn/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
wangyankecn/gdal
GDAL is an open source X/MIT licensed translator library for raster and vector geospatial data formats. This is a mirror of the GDAL Subversion repository.
wangyankecn/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
wangyankecn/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
wangyankecn/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
wangyankecn/pluto
PLUTO: Push the Limit of Imitation Learning-based Planning for Autonomous Driving
wangyankecn/telecarla
TELECARLA: An Open Source Extension of the CARLA Simulator for Teleoperated Driving Research Using Off-the-Shelf Components
wangyankecn/teleoperated_driving
wangyankecn/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
wangyankecn/VMA
A general map auto annotation framework based on MapTR, with high flexibility in terms of spatial scale and element type