ekshaks
Founder OffNote Labs | Gen AI Consulting | Researcher: Deep Learning, Large Language Models
OffNote LabsBangalore
ekshaks's Stars
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
karpathy/llm.c
LLM training in simple, raw C/CUDA
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
squidfunk/mkdocs-material
Documentation that simply works
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
princeton-nlp/SWE-agent
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
quickwit-oss/tantivy
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
bentoml/OpenLLM
Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
huggingface/chat-ui
Open source codebase powering the HuggingChat app
nilsherzig/LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
huggingface/parler-tts
Inference and training library for high-quality TTS models.
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
unum-cloud/usearch
Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
aurelio-labs/semantic-router
Superfast AI decision making and intelligent processing of multi-modal data.
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
RahulSChand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
AnswerDotAI/rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
lean-dojo/LeanCopilot
LLMs as Copilots for Theorem Proving in Lean
naver/splade
SPLADE: sparse neural search (SIGIR21, SIGIR22)
grycap/scar
Serverless Container-aware ARchitectures (e.g. Docker in AWS Lambda)
explosion/srsly
🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)
Holmeswww/AgentKit
An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natural language prompts.
EQ-bench/EQ-Bench
A benchmark for emotional intelligence in large language models
vast-ai/vast-python
Vast.ai python and cli api client
HKUDS/HiGPT
[KDD'2024] "HiGPT: Heterogenous Graph Language Models"
r-three/phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
GuanSuns/LLMs-World-Models-for-Planning
The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning".
hamelsmu/ft-drift
Check for data drift between two OpenAI multi-turn chat jsonl files.