itayle's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
antonmedv/fx
Terminal JSON viewer & processor
deepset-ai/haystack
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
piskvorky/gensim
Topic Modelling for Humans
FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
facebookresearch/hydra
Hydra is a framework for elegantly configuring complex applications
OpenAccess-AI-Collective/axolotl
Go ahead and axolotl questions
interstellard/chatgpt-advanced
WebChatGPT: A browser extension that augments your ChatGPT prompts with web results.
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
TimDettmers/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Deci-AI/super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
promptslab/Promptify
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
stanford-futuredata/ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
castorini/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
google-research/language
Shared repository for open-sourced projects from the Google AI Language team.
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
IntelLabs/fastRAG
Efficient Retrieval Augmentation and Generation Framework
sebastian-hofstaetter/teaching
Open-Source Information Retrieval Courses @ TU Wien
HazyResearch/manifest
Prompt programming with FMs.
microsoft/task_oriented_dialogue_as_dataflow_synthesis
Code to reproduce experiments in the paper "Task-Oriented Dialogue as Dataflow Synthesis" (TACL 2020).
allenai/acl2022-zerofewshot-tutorial
NNLP-IL/Hebrew-Resources
A comprehensive list of Hebrew NLP resources.
HKUNLP/icl-ceil
[ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.
microsoft/semantic_parsing_with_constrained_lm
Code to reproduce experiments in the paper "Constrained Language Models Yield Few-Shot Semantic Parsers" (EMNLP 2021).
OhadRubin/EPR
telepathylabsai/OpenDF
Code to reproduce LREC Paper Simplifying Semantic Annotations of SMCalFlow
microsoft/compositional-generalization-span-level-attention
code for the NAACL 2021 paper Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention by Microsoft Semantic Machines.