wangyankecn

Pinned Repositories

AudioNotes
快速提取音视频内容，整理成一份结构化的markdown笔记
Language:Python0 0 00
CAMA
Official Implementation of A Vision-Centric Approach for Static Map Element Annotation
Language:Python0 0 00
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python0 0 00
gdal
GDAL is an open source X/MIT licensed translator library for raster and vector geospatial data formats. This is a mirror of the GDAL Subversion repository.
Language:C++0 0 00
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
Language:Python0 0 00
MoneyPrinterTurbo
利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.
Language:Python0 0 00
ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
Language:Go0 0 00
pluto
PLUTO: Push the Limit of Imitation Learning-based Planning for Autonomous Driving
Language:Python0 0 00
telecarla
TELECARLA: An Open Source Extension of the CARLA Simulator for Teleoperated Driving Research Using Off-the-Shelf Components
Language:C++00
teleoperated_driving
Language:Dockerfile0 0 00

wangyankecn's Repositories

wangyankecn/AudioNotes
快速提取音视频内容，整理成一份结构化的markdown笔记
Language:Python0 0 00
wangyankecn/CAMA
Official Implementation of A Vision-Centric Approach for Static Map Element Annotation
Language:Python0 0 00
wangyankecn/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Language:Python0 0 00
wangyankecn/gdal
GDAL is an open source X/MIT licensed translator library for raster and vector geospatial data formats. This is a mirror of the GDAL Subversion repository.
Language:C++0 0 00
wangyankecn/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
Language:Python0 0 00
wangyankecn/MoneyPrinterTurbo
利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.
Language:Python0 0 00
wangyankecn/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
Language:Go0 0 00
wangyankecn/pluto
PLUTO: Push the Limit of Imitation Learning-based Planning for Autonomous Driving
Language:Python0 0 00
wangyankecn/telecarla
TELECARLA: An Open Source Extension of the CARLA Simulator for Teleoperated Driving Research Using Off-the-Shelf Components
Language:C++00
wangyankecn/teleoperated_driving
Language:Dockerfile0 0 00
wangyankecn/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python0 0
wangyankecn/VMA
A general map auto annotation framework based on MapTR, with high flexibility in terms of spatial scale and element type