Pinned Repositories
baby-llm
RedisXANN
redis x ANN vss
RedisXLM
redis x language model inference (load trained model); size (tiny|t, small|s, medium|m, large|l) with quantization; NOTE: redis embedded language model, available for stand-alone version only
RedisXSlot
redis x slot module, async block migrate/restore, don't block or less block other cmd.
craftsman
a craftsman, try best to give a best practicable solution for biz~
doraemon-nb
ipython notebooks do some sample experiments , make some idea
geo
php 扩展(地图相关的操作)
iowrapper
io_uring lib or syscall wrapper; add some benchmark for io_uring experimental study
perf-book-cn
https://github.com/dendibakh/perf-book gitbook在线电子书,翻译成中文原始markdown文档
weedge's Repositories
weedge/perf-book
The book "Performance Analysis and Tuning on Modern CPU" 使用chatGPT/gemini 翻译成中文,虽然已经有翻译的书籍。(follow更新)
weedge/AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
weedge/baby-db
weedge/baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
weedge/ChatLM-mini-Chinese
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
weedge/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
weedge/Chinese-Mixtral
中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)
weedge/dify
An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine. >>> use this to upgrade my speakerpi robot <<<
weedge/distributed-llama
Run LLMs on weak devices or make powerful devices even more powerful by distributing the workload and dividing the RAM usage.
weedge/dolma
Data and tools for generating and inspecting OLMo pre-training data.
weedge/FlagEmbedding
Dense Retrieval and Retrieval-augmented LLMs, see embedding/reranker model
weedge/generative-ai
Sample code and notebooks for Generative AI on Google Cloud
weedge/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
weedge/hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程,支持 Python, C++, Java, C#, Go, Swift, JS, TS, Dart, Rust, C, Zig 等语言。English edition ongoing
weedge/highway
Performance-portable, length-agnostic SIMD with runtime dispatch
weedge/HuixiangDou
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance; review this for my RAG pipeline
weedge/llama_cpp-rs
High-level, optionally asynchronous Rust bindings to llama.cpp
weedge/llm-applications
A comprehensive guide to building RAG-based LLM applications for production. see: RAG
weedge/lmql
A language for constraint-guided and efficient LLM programming.
weedge/makeMoE
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
weedge/makemore
An autoregressive character-level language model for making more things
weedge/MiniCPM
MiniCPM-2B: An end-side LLM outperforms Llama2-13B.
weedge/nlp_course
YSDA course in Natural Language Processing
weedge/OLMo
Modeling, training, eval, and inference code for OLMo
weedge/perf-ninja
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
weedge/Phi2-mini-Chinese
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
weedge/RAG-Survey
weedge/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
weedge/unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
weedge/weedge