data-indexing

There are 15 repositories under data-indexing topic.

  • cocoindex

    cocoindex-io/cocoindex

    Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!

    Language:Rust3.3k20225262
  • saturn-lab/BDMI-2019A

    Big Data and Machine Intelligence Course in Autumn 2019.

    Language:Jupyter Notebook359326
  • cocoindex-io/patient-intake-extraction

    Patient Intake Form Extraction using llm

    Language:Python132
  • TopTrenDev/solana-dex-data-indexer-substream

    🧠 Solana DEX Swap Data Indexer Substream-powered swap indexer for Solana — supports Pump.fun, PumpSwap, BonkFun, Meteora, Raydium, Orca & more. ⚡📊🔥 Designed for real-time trade analytics, MEV research, and on-chain data pipelines. 📡

    Language:Rust9
  • most-inesctec/I2Bplus-tree

    :evergreen_tree: Improved Interval B+ tree implementation, in TS :evergreen_tree:

    Language:TypeScript7140
  • fshnkarimi/Similar-Paper-Reccomendation

    This repository contains an application designed to recommend scientific papers that are most similar to a given input paragraph. The application uses the llama and weaviate libraries to achieve this.

    Language:Jupyter Notebook5100
  • tangentlin/indexed-collection

    A zero-dependency library of classes that make filtering, sorting and observing changes to arrays easier and more efficient.

    Language:TypeScript4110
  • datafast-network/datafast-runtime

    Datafast Runtime is a high-performance subgraph processing runtime which is written from scratch and designed to handle subgraphs with unparalleled speed & storage-efficiency

    Language:Rust30350
  • dappros/rag_demos

    Examples of RAG (Retrieval-Augmented Generation) with Ethora, LangChain, and OpenAI. Build knowledge-based AI assistants fast. Powered by Ethora Chat Component.

    Language:Python1
  • iron-hope-shop/bords-portfolio

    BORDS is an open-access reaction search engine that leverages Google's Open Reaction Database to provide ultra-fast, comprehensive access to millions of chemical reactions. Built with a modern cloud stack, it streamlines reaction data extraction, transformation, and indexing for researchers in chemistry and related fields.

    Language:JavaScript10
  • Md-Emon-Hasan/Vector-Database

    Designed to store and retrieve high-dimensional data, such as embeddings, efficiently. It enables fast similarity searches by leveraging techniques.

    Language:Jupyter Notebook113
  • paocarvajal1912/Forecasting_Net_Prophet

    Time series analysis showing trend, seasonality, and periodicity decomposition; and forecasting using Facebook Prophet. The analysis makes extensive use of indexing data tools and of the Pandas and Datetime libraries.

    Language:Jupyter Notebook1200
  • SciGaP/seagrid-data

    System for Managing the data generated by the SEAGrid Science Gateway

    Language:Java1916
  • ahenrij/univ-rennes1-m2-inv-search-engine

    Python implementation of a TF-IDF/cosine based search engine

    Language:Python0100
  • Atiq-Data/Modern_Data_Warehouse

    A comprehensive guide to building a modern data warehouse using medallion Data Warehouse Architecture with SQL Server, including ETL processes, data modeling, and analytics.

    Language:TSQL