kemuscollins's Stars
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
labring/FastGPT
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
cs-lazy-tools/ChatGPT-On-CS
基于大模型的智能对话客服工具,支持微信、拼多多、千牛、哔哩哔哩、抖音企业号、抖音、抖店、微博聊天、小红书专业号运营、小红书、知乎等平台接入,可选择 GPT3.5/GPT4.0/ 懒人百宝箱 (后续会支持更多平台),能处理文本、语音和图片,通过插件访问操作系统和互联网等外部资源,支持基于自有知识库定制企业 AI 应用。
LLM-Testing/LLM4SoftwareTesting
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
codefuse-ai/Test-Agent
Agent that empowers software testing with LLMs; industrial-first in China
Zeyi-Lin/LLM-Finetune
大语言模型微调,Qwen2、GLM4指令微调
jeinlee1991/chinese-llm-benchmark
中文大模型能力评测榜单:目前已囊括115个大模型,覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型, 以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型,多维度能力评测。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
charent/ChatLM-mini-Chinese
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
shibing624/text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
jamesmcroft/document-data-extraction-prompt-flow-evaluation
This sample demonstrates how to use GPT-4o with Vision to extract structured JSON data from PDF documents and evaluate them with Azure AI Studio and Prompt Flow
microsoft/GPT4Vision-Robot-Manipulation-Prompts
This repository provides the sample code designed to interpret human demonstration videos and convert them into high-level tasks for robots.
mshumer/gpt-prompt-engineer
ishan0102/vimGPT
Browse the web with GPT-4V and Vimium
Nikhil-Kulkarni/qa-gpt
Automate UI testing + functionality testing with GPT-4 Vision
lucgagan/auto-playwright
Automating Playwright steps using ChatGPT.
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
ggerganov/llama.cpp
LLM inference in C/C++
huggingface/text-generation-inference
Large Language Model Text Generation Inference
OpenNMT/CTranslate2
Fast inference engine for Transformer models
huggingface/candle
Minimalist ML framework for Rust
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
andreidobrinski/react-wavesurfer-demo
Shirtiny/shWave
subtitles timeline and wave audio. 一个显示音频波形图并可编辑字幕的时间轴
evan-moon/simple-waveform-visualizer
JS Audio API 놀이터