Pinned Repositories
ALLaVA
Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model
athena
an open-source implementation of sequence-to-sequence based speech processing engine
audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
automated-prompt-engineering-from-scratch
A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.
CLAP
Contrastive Language-Audio Pretraining
FT-w2v2-ser
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Jackwaterveg's Repositories
Jackwaterveg/ALLaVA
Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model
Jackwaterveg/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Jackwaterveg/automated-prompt-engineering-from-scratch
A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.
Jackwaterveg/Awesome-AITools
Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具,欢迎提交issues 或者pull requests
Jackwaterveg/awesome-auto-alignment
Collection of papers for scalable automated alignment.
Jackwaterveg/awesome-LLMs-In-China
**大模型
Jackwaterveg/Awesome-Model-Merging
:couple: A curated list of Model Merging methods.
Jackwaterveg/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Jackwaterveg/ChineseWebText
Jackwaterveg/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Jackwaterveg/codeshell-vscode
An intelligent coding assistant plugin for Visual Studio Code, developed based on CodeShell
Jackwaterveg/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
Jackwaterveg/EasyInstruct
[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.
Jackwaterveg/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Jackwaterveg/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
Jackwaterveg/llama
Inference code for LLaMA models
Jackwaterveg/LLaMA-Factory
Unify Efficient Fine-tuning of 100+ LLMs
Jackwaterveg/llava-phi
Jackwaterveg/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Jackwaterveg/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
Jackwaterveg/MINT-1T
MINT-1T: A one trillion token multimodal interleaved dataset.
Jackwaterveg/mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Jackwaterveg/NeMo-Aligner
Scalable toolkit for efficient model alignment
Jackwaterveg/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
Jackwaterveg/OLMo
Modeling, training, eval, and inference code for OLMo
Jackwaterveg/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Jackwaterveg/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Jackwaterveg/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Jackwaterveg/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Jackwaterveg/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath