bettercallcaleb

Pinned Repositories

ai-clone-whatsapp
Create an AI clone of yourself from your WhatsApp chats (using Mistral 7B)
Language:Python0 0 00
aider
aider is AI pair programming in your terminal
Language:Python0 0 00
Anima
33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU
Language:Jupyter Notebook0 0 00
AutoRAG
RAG AutoML Tool - Find optimal RAG pipeline for your own data.
Language:Python0 0 00
BCEmbedding
Netease Youdao's open-source embedding and reranker models for RAG products.
Language:Python0 0 00
BiLLM
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Language:Python0 0 00
DistiLlama
Chrome Extension to Summarize Web Pages Using locally running LLMs
Language:TypeScript1 0 00
lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python1 0 00
QuaRot
Code for QuaRot, an end-to-end 4-bit inference of large language models.
Language:Python1 0 00
R2R
The framework for fast development and deployment of RAG backends.
Language:Python1 0 00

bettercallcaleb's Repositories

bettercallcaleb/QuaRot
Code for QuaRot, an end-to-end 4-bit inference of large language models.
Language:Python1 0 00
bettercallcaleb/R2R
The framework for fast development and deployment of RAG backends.
Language:Python1 0 00
bettercallcaleb/ai-clone-whatsapp
Create an AI clone of yourself from your WhatsApp chats (using Mistral 7B)
Language:Python0 0 00
bettercallcaleb/aider
aider is AI pair programming in your terminal
Language:Python0 0 00
bettercallcaleb/AutoRAG
RAG AutoML Tool - Find optimal RAG pipeline for your own data.
Language:Python0 0 00
bettercallcaleb/BCEmbedding
Netease Youdao's open-source embedding and reranker models for RAG products.
Language:Python0 0 00
bettercallcaleb/BiLLM
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Language:Python0 0 00
bettercallcaleb/ComfyUI-J
Jannchie's ComfyUI custom nodes.
Language:Python0 0
bettercallcaleb/CustomGPT-Google-Sheets-RAG
Allows you to use Google Sheets to store and retrieve data from your custom GPT
Language:JavaScript0 0
bettercallcaleb/DreamGenTrain
Language:Python0 0
bettercallcaleb/fltr
Like grep but for natural language questions. Based on Mixtral 8x7B.
Language:Rust0 0
bettercallcaleb/free-reddit-comments-nuke
a free python tool that nukes all your reddit comments
Language:Python1 0
bettercallcaleb/GPTFast
Accelerate your Hugging Face Transformers 6-7x. Native to Hugging Face and PyTorch.
Language:Python0 0
bettercallcaleb/InfLLM
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"
Language:Python0 0
bettercallcaleb/jen-ai
A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.
Language:Python0 0
bettercallcaleb/LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Language:Python0 0
bettercallcaleb/llama_index
LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
Language:Python0 0
bettercallcaleb/llm-scraper
Turn any webpage into structured data using LLMs
Language:TypeScript0 0
bettercallcaleb/LongLM
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Language:Python0 0
bettercallcaleb/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
bettercallcaleb/LWM
Language:Python0 0
bettercallcaleb/marker
Convert PDF to markdown quickly with high accuracy
Language:Python0 0
bettercallcaleb/MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
Language:Python0 0
bettercallcaleb/mistral.rs
Blazingly fast LLM inference.
Language:Rust0 0
bettercallcaleb/Open-Ollama-RAG-ChatApp
Retrieval-Augmented Generation Chat Bot using Ollama, Langchain and Gradio.
Language:Jupyter Notebook0 0
bettercallcaleb/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python0 0
bettercallcaleb/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with LLMs faster and more controllable.
Language:Python0 0
bettercallcaleb/summarize
Video summarization from multiple sources (YouTube, Dropbox, Google Drive, local files) using multiple LLM endpoints (OpenAI, Groq, LM-studio).
Language:Jupyter Notebook0 0
bettercallcaleb/SWE-agent
SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models
Language:Python0 0
bettercallcaleb/talk-llama-fast
Port of OpenAI's Whisper model in C/C++, fast and with xtts
Language:C0 0