lvchakele

in beijingchina

lvchakele's Stars

solidglue/Recommender_System
推荐系统入门指南，全面介绍了工业级推荐系统的理论知识（王树森推荐系统公开课-基于小红书的场景讲解工业界真实的推荐系统），如何基于TensorFlow2训练模型，如何实现高性能、高并发、高可用的Golang推理微服务。Comprehensively introduced the theory of industrial recommender system, how to trainning models based on TensorFlow2, how to implement the high-performance、high-concurrency and high-available inference services base on Golang.
Language:Jupyter Notebook40433
solidglue/DNN_for_YouTube_Recommendations
YouTube推荐系统深度学习召回排序算法, Deep Neural Networks for YouTube Recommendations. YouTubeDNN.
Language:Jupyter Notebook108
qiangmzsx/Software-Engineering-at-Google
《Software Engineering at Google》的中英文对译版本
Language:HTML4.1k513
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
Language:Python35.8k5.1k
labring/FastGPT
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
Language:TypeScript17.2k4.6k
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript47k6.7k
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python18.5k1.9k
theevann/streamlit-audiorecorder
Audio recorder for streamlit
Language:Python12621
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
Language:TypeScript42.1k9.5k
zhayujie/chatgpt-on-wechat
基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。
Language:Python30.3k8k
QuivrHQ/quivr
Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
Language:Python36.2k3.5k
chainer/chainer
A flexible framework of neural networks for deep learning
Language:Python5.9k1.4k
GaiZhenbiao/ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
Language:Python15.2k2.3k
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python17.8k1.7k
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python4.8k389
LlamaFamily/Llama-Chinese
Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用
Language:Python13.7k1.2k
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Language:Python3.7k313
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell8.6k539
unslothai/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python16.4k1.1k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python31.2k3.4k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.6k4.5k
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++5.8k889
microsoft/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Language:Python1.9k174
bentoml/OpenLLM
Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
Language:Python9.8k626
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.3k932
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python8.9k1k
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
Language:Python6.3k441
jina-ai/jina
☁️ Build multimodal AI applications with cloud-native stack
Language:Python21k2.2k
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python7.1k578
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
Language:Python7.7k844

lvchakele

lvchakele's Stars

solidglue/Recommender_System

solidglue/DNN_for_YouTube_Recommendations

qiangmzsx/Software-Engineering-at-Google

run-llama/llama_index

labring/FastGPT

langgenius/dify

infiniflow/ragflow

theevann/streamlit-audiorecorder

lobehub/lobe-chat

zhayujie/chatgpt-on-wechat

QuivrHQ/quivr

chainer/chainer

GaiZhenbiao/ChuanhuChatGPT

microsoft/graphrag

THUDM/GLM-4

LlamaFamily/Llama-Chinese

modelscope/ms-swift

QwenLM/Qwen2.5

unslothai/unsloth

2noise/ChatTTS

lm-sys/FastChat

NVIDIA/FasterTransformer

microsoft/DeepSpeed-MII

bentoml/OpenLLM

NVIDIA/TensorRT-LLM

huggingface/text-generation-inference

InternLM/InternLM

jina-ai/jina

ymcui/Chinese-LLaMA-Alpaca-2

axolotl-ai-cloud/axolotl