text-retrieval

There are 38 repositories under text-retrieval topic.

  • HanXinzi-AI/awesome-NLP-resources

    a collection of NLP projects&tools. 自然语言处理方向项目和工具集合。

  • arian-askari/ChatGPT-RetrievalQA-CIKM2023

    A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.

    Language:Jupyter Notebook141507
  • jiepujiang/LuceneTutorial

    A simple tutorial of Lucene for LIS 501 Introduction to Text Mining students at the University of Wisconsin-Madison (Fall 2021).

    Language:Java742016
  • ElmiraGhorbani/chatgpt-long-term-memory

    The ChatGPT Long Term Memory package is a powerful tool designed to empower your projects with the ability to handle a large number of simultaneous users and external sources.

    Language:Python56213
  • miccunifi/Cross-the-Gap

    [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion

    Language:Python531120
  • wjpoom/SPEC

    [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"

    Language:Jupyter Notebook48230
  • GoodAI/goodai-ltm

    A Python library for long-term memory in language models. Improve conversational scenarios and create autonomous learning agents with enhanced context.

    Language:Python333510
  • lxucs/commoncrawl-warc-retrieval

    Python tools to retrieve text from CommonCrawl WARC files based on cdx index.

    Language:Python18113
  • usnistgov/trec-browser

    Metadata browser of TREC

    Language:Jupyter Notebook9202
  • jarvis0/image-search

    🌄 Search images through text by writing a caption or a description. You will be intelligently assisted while typing.

    Language:Jupyter Notebook8101
  • kasselouris/Complex-Data-Management

    Spatial Data and Networks, Efficient Text Retrieval Methods, Ultra Fast Database Operators, Top-K Queries

    Language:Python8100
  • anhquan075/CS336-legal-text-retrieval

    CS336 Final Project - Vietnamese Legal Text Retrieval

    Language:Python7110
  • Bladefidz/data-mining

    Fundamental of Data Mining: Study case and implementations.

    Language:Jupyter Notebook6302
  • SavanK/FakeNewsChallenge

    Combating fake news problem

    Language:Java5100
  • soulteary/text-retrieval-example

    Let's talk about text retrieval.

    Language:Go5102
  • MChatzakis/DIS-TextRetrieval

    A Text Retrieval Approach Using BM25+ and BERT Reranking

    Language:Jupyter Notebook4100
  • nhtlongcs/elastic-search-docker

    This is a docker compose file for running elastic search in a docker container. It is based on the official elastic search docker image

    Language:Python410
  • AdithyaSanyal/Voice-based-Personal-Assistant

    A voice based personal assistant which has different functionalities right from voice based text, image retrieval, a chatbot to a text summarizer and an automatic question generator. Made by amalgamating different concepts of NLP and Machine Learning together

    Language:Python2100
  • mhasnat/LPM_CityDB

    This repository provides the materials to experiment with the CityDB dataset for License Plate Matching (LPM)

    Language:Jupyter Notebook2100
  • rohinb2/hqbot

    A simple bot for HQ trivia that uses OpenCV.

    Language:Python2300
  • tkhan11/NLP-based-Text-Retrieval

    Natural Language Processing Based Text Retrieval System in Python

    Language:Python2200
  • Farhaj499/RAG_with_ChromaDB

    This project implements an Extractive Question Answering (EQA) system that extracts answers from a set of downloaded text files based on user queries.

    Language:Jupyter Notebook1100
  • gorjanradevski/macchina

    Codebase for "Self-supervised context-aware Covid-19 document exploration through atlas grounding" as well as links to the tools mentioned in the paper. Work done within ESAT-PSI at KU Leuven.

    Language:Python1320
  • hymn-ing/text-retrieval-by-posting-list

    A simple text retrieval manifested by posting list, supporting boolean retrieval on zone index, ranked retrieval.

    Language:C++1100
  • JalajVora/Text-Analytics-with-Multi-Class-and-Imbalanced-Learning

    Genre Identification task along with Text Analytics with Multi-Class and Imbalanced Learning on Gutenberg Corpus

    Language:HTML1100
  • Mounir-charef/sentiment-analysis-binary-rating

    This project utilizes machine learning and deep learning techniques to perform sentiment analysis on text reviews, automatically categorizing them as positive or negative. It provides valuable insights into user opinions and emotions expressed in textual data.

    Language:Jupyter Notebook110
  • sarthak268/Multimedia-Computing-and-Applications

    This repository contains code for all assignments in the Multimedia Computing and Applications (CSE563) course.

    Language:Python100
  • trungdangtapcode/Flashcard-Recommendation-Extension

    Flashcard recommendation based on BEIT and TF score and can be used as normal flashcard extension

    Language:TypeScript1100
  • Adarsh-sophos/Smart-Library

    Identifying Books on Library Shelves using Supervised Deep Learning.

    Language:Jupyter Notebook0102
  • bnvulpe/code-extractor

    Transforming images into code at a click. Upload a photo or screenshot and copy the code to your script in seconds!

    Language:HTML0100
  • iiTzMohit/YTQueryBot

    YTQueryBot is a web app that answers questions about YouTube videos. It uses Streamlit for the UI, LangChain for transcript processing, and OpenAI for generating responses from video data.

    Language:Python0100
  • rootguillen/Patent-Search-System-with-Gradio

    Developed by Gyudong HAN, Counsellor, WIPO (gyudong.han@wipo.int). Developed this system with reference to the general text retrieval system which was uploaded together with the video clip named "LangChain Retrieval QA Over Multiple Files with ChromaDB". I only added the implementation of Gradio for its UI.

    Language:Jupyter Notebook0101
  • vedaant00/uhsr

    UHSR (Unified Hyperbolic Spectral Retrieval) is a next-generation hybrid text retrieval framework that combines BM25 (Lexical Search) with FAISS/Pinecone (Semantic Search), enhanced by Spectral Re-Ranking & AI-Powered Reranking. It supports multiple similarity metrics, provides interpretable normalized scores, & is designed for scalability & speed.

    Language:Python0100
  • VedangW/upr-kilt

    Unsupervised Passage Retrieval for Question Answering, Fact Checking, and Entity Linking on the KILT benchmark using the T5 language model series.

    Language:Python0100
  • avojak/oise

    Open IRC Search Engine

    Language:Java20
  • LeonardoSaccotelli/Numerical-Methods-For-Computer-Science

    Basic and advanced linear algebra and numerical problems, numerical algorithms, and techniques with multiple applications in the field of Computer Science.

    Language:Jupyter Notebook10