weedge

The Pursuit of Happyness

china

Pinned Repositories

baby-llm
Language:Jupyter Notebook2 3 00
RedisXANN
redis x ANN vss
Language:Rust1 2 00
RedisXLM
redis x language model inference (load trained model); size (tiny|t, small|s, medium|m, large|l) with quantization; NOTE: redis embedded language model, available for stand-alone version only
Language:C0 2 00
RedisXSlot
redis x slot module, async block migrate/restore, don't block or less block other cmd.
Language:C2 2 00
craftsman
a craftsman, try best to give a best practicable solution for biz~
Language:Go1 3 00
doraemon-nb
ipython notebooks do some sample experiments , make some idea
Language:Jupyter Notebook7 2 00
geo
php 扩展(地图相关的操作)
Language:C++2 3 00
iowrapper
io_uring lib or syscall wrapper; add some benchmark for io_uring experimental study
Language:C2 2 00
perf-book-cn
https://github.com/dendibakh/perf-book gitbook在线电子书，翻译成中文原始markdown文档
Language:Python58 2 07

weedge's Repositories

weedge/perf-book
The book "Performance Analysis and Tuning on Modern CPU" 使用chatGPT/gemini 翻译成中文，虽然已经有翻译的书籍。(follow更新)
Language:TeX1 1 0
weedge/AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
Language:Python1 0
weedge/baby-db
Language:C
weedge/baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
Language:Python1 0
weedge/ChatLM-mini-Chinese
中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。
weedge/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python1 0
weedge/Chinese-Mixtral
中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）
1 0
weedge/dify
An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine. >>> use this to upgrade my speakerpi robot <<<
Language:Python1 0
weedge/distributed-llama
Run LLMs on weak devices or make powerful devices even more powerful by distributing the workload and dividing the RAM usage.
Language:C++
weedge/dolma
Data and tools for generating and inspecting OLMo pre-training data.
weedge/FlagEmbedding
Dense Retrieval and Retrieval-augmented LLMs, see embedding/reranker model
Language:Python1 0
weedge/generative-ai
Sample code and notebooks for Generative AI on Google Cloud
Language:Jupyter Notebook1 0
weedge/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
weedge/hello-algo
《Hello 算法》：动画图解、一键运行的数据结构与算法教程，支持 Python, C++, Java, C#, Go, Swift, JS, TS, Dart, Rust, C, Zig 等语言。English edition ongoing
Language:Java1 0
weedge/highway
Performance-portable, length-agnostic SIMD with runtime dispatch
weedge/HuixiangDou
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance; review this for my RAG pipeline
Language:Python1 0
weedge/llama_cpp-rs
High-level, optionally asynchronous Rust bindings to llama.cpp
Language:Rust1 0
weedge/llm-applications
A comprehensive guide to building RAG-based LLM applications for production. see: RAG
Language:Jupyter Notebook1 0
weedge/lmql
A language for constraint-guided and efficient LLM programming.
Language:Python1 0
weedge/makeMoE
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
Language:Jupyter Notebook1 0
weedge/makemore
An autoregressive character-level language model for making more things
Language:Python1 0
weedge/MiniCPM
MiniCPM-2B: An end-side LLM outperforms Llama2-13B.
Language:Python1 0
weedge/nlp_course
YSDA course in Natural Language Processing
Language:Jupyter Notebook1 0
weedge/OLMo
Modeling, training, eval, and inference code for OLMo
weedge/perf-ninja
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
Language:C++
weedge/Phi2-mini-Chinese
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
weedge/RAG-Survey
weedge/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python1 0
weedge/unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
Language:Python1 0
weedge/weedge