coffepowered's Stars
princeton-nlp/LitSearch
[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search
harpribot/awesome-information-retrieval
A curated list of awesome information retrieval resources
lepture/mistune
A fast yet powerful Python Markdown parser with renderers and plugins.
AI-Commandos/RAGMeUp
Generic rag framework to apply the power of LLMs on any given dataset
Hugo-Dz/on-device-transcription
A ready-to-use, minimal app that converts any speech into text.
AnswerDotAI/fasthtml
The fastest way to create an HTML app
Pints-AI/1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
paradedb/paradedb
Postgres for Search and Analytics
rashadphz/farfalle
🔍 AI search engine - self-host with local or cloud LLMs
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
berkerdemirel/GPT-from-scratch
Re-implementation of Andrej Karpathy's nanoGPT
satwikkansal/wtfpython
What the f*ck Python? 😱
n4ze3m/dialoqbase
Create chatbots with ease
kba/awesome-ocr
Links to awesome OCR projects
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
srush/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
BobaZooba/xllm
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
galatolofederico/cerbero-7b
Cerbero-7b is the first 100% Free and Open Source Italian Large Language Model (LLM) ready to be used for research or commercial applications.
wangcunxiang/LLM-Factuality-Survey
The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>
promptslab/LLMtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
Nixtla/tsfeatures
Calculates various features from time series data. Python implementation of the R package tsfeatures.
predibase/llm_distillation_playbook
Best practices for distilling large language models.
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
huggingface/large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
minimaxir/simpleaichat
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
tomaarsen/SpanMarkerNER
SpanMarker for Named Entity Recognition
chakki-works/seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
agamm/semantic-split
A Python library to chunk/group your texts based on semantic similarity.
kagisearch/vectordb
A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.