Real-bojack

Real-bojack's Stars

geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python46.2k5.5k
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Language:Python17.2k1.7k
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript56.2k8.3k
aiwaves-cn/agents
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
Language:Python5.4k423
SamuelSchmidgall/AgentClinic
Agent benchmark for medical diagnosis
Language:Python13617
OpenBMB/XAgent
An Autonomous LLM Agent for Complex Task Solving
Language:Python8.2k848
gangiswag/llm-reranker
Language:Python275
Marker-Inc-Korea/AutoRAG
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Language:Python3k229
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
Language:Cuda1.8k147
Bruce-Lee-LY/cuda_hgemm
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
Language:Cuda32368
percent4/embedding_rerank_retrieval
本项目是针对RAG中的Retrieve阶段的召回技术及算法效果所做评估实验。使用主体框架为LlamaIndex.
Language:Jupyter Notebook18923
AnswerDotAI/rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
Language:Python1.2k65
HeKun-NVIDIA/CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
1.4k214
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
Language:Python4k232
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python8k591
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.9k1k
xgqdut2016/cuda_code
easy cuda code
Language:Cuda2820
DefTruth/CUDA-Learn-Notes
📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Language:Cuda1.8k182
nvixnu/pmpp__programming_massively_parallel_processors
Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (Third Edition)
Language:Cuda4816
NVIDIA/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Language:C6.6k1.9k
gpu-mode/lectures
Material for gpu-mode lectures
Language:Jupyter Notebook3.3k334
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python7.5k1.2k
abetlen/llama-cpp-python
Python bindings for llama.cpp
Language:Python8.3k1k
kwai/Megatron-Kwai
[USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism
Language:Python481
microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.9k344
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6.2k1.1k
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python26.3k2.5k
labring/FastGPT
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
Language:TypeScript19.3k5.1k
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Language:Python5.8k477
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go104k8.3k