wallon-ai's Stars
xai-org/grok-1
Grok open release
QuivrHQ/quivr
Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
microsoft/autogen
A programming framework for agentic AI 🤖
meta-llama/llama3
The official Meta Llama 3 GitHub site
joaomdmoura/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
mistralai/mistral-inference
Official inference library for Mistral models
microsoft/promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
openai/transformer-debugger
baidu/Familia
A Toolkit for Industrial Topic Modeling
microsoft/promptbench
A unified evaluation framework for large language models
CStanKonrad/long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
OpenLMLab/MOSS-RLHF
MOSS-RLHF
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Duxiaoman-DI/XuanYuan
轩辕:度小满中文金融对话大模型
multimodal-art-projection/MAP-NEO
Muennighoff/sgpt
SGPT: GPT Sentence Embeddings for Semantic Search
jzbjyb/FLARE
Forward-Looking Active REtrieval-augmented generation (FLARE)
abacusai/Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.
FranxYao/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
HyperGAI/HPT
HPT - Open Multimodal LLMs from HyperGAI
bigscience-workshop/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
Chinese-Tiny-LLM/Chinese-Tiny-LLM
zhongwanjun/MemoryBank-SiliconFriend
Source code and demo for memory bank and SiliconFriend
ZhuiyiTechnology/GAU-alpha
基于Gated Attention Unit的Transformer模型(尝鲜版)
wzzzd/FAQ_system
FAQ智能问答系统。实现FAQ的问题-模板匹配功能。部署轻量级的Web服务应用。
jiahe7ay/infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.