data-indexing
There are 15 repositories under data-indexing topic.
cocoindex-io/cocoindex
Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!
saturn-lab/BDMI-2019A
Big Data and Machine Intelligence Course in Autumn 2019.
cocoindex-io/patient-intake-extraction
Patient Intake Form Extraction using llm
TopTrenDev/solana-dex-data-indexer-substream
🧠Solana DEX Swap Data Indexer Substream-powered swap indexer for Solana — supports Pump.fun, PumpSwap, BonkFun, Meteora, Raydium, Orca & more. ⚡📊🔥 Designed for real-time trade analytics, MEV research, and on-chain data pipelines. 📡
most-inesctec/I2Bplus-tree
:evergreen_tree: Improved Interval B+ tree implementation, in TS :evergreen_tree:
fshnkarimi/Similar-Paper-Reccomendation
This repository contains an application designed to recommend scientific papers that are most similar to a given input paragraph. The application uses the llama and weaviate libraries to achieve this.
tangentlin/indexed-collection
A zero-dependency library of classes that make filtering, sorting and observing changes to arrays easier and more efficient.
datafast-network/datafast-runtime
Datafast Runtime is a high-performance subgraph processing runtime which is written from scratch and designed to handle subgraphs with unparalleled speed & storage-efficiency
dappros/rag_demos
Examples of RAG (Retrieval-Augmented Generation) with Ethora, LangChain, and OpenAI. Build knowledge-based AI assistants fast. Powered by Ethora Chat Component.
iron-hope-shop/bords-portfolio
BORDS is an open-access reaction search engine that leverages Google's Open Reaction Database to provide ultra-fast, comprehensive access to millions of chemical reactions. Built with a modern cloud stack, it streamlines reaction data extraction, transformation, and indexing for researchers in chemistry and related fields.
Md-Emon-Hasan/Vector-Database
Designed to store and retrieve high-dimensional data, such as embeddings, efficiently. It enables fast similarity searches by leveraging techniques.
paocarvajal1912/Forecasting_Net_Prophet
Time series analysis showing trend, seasonality, and periodicity decomposition; and forecasting using Facebook Prophet. The analysis makes extensive use of indexing data tools and of the Pandas and Datetime libraries.
SciGaP/seagrid-data
System for Managing the data generated by the SEAGrid Science Gateway
ahenrij/univ-rennes1-m2-inv-search-engine
Python implementation of a TF-IDF/cosine based search engine
Atiq-Data/Modern_Data_Warehouse
A comprehensive guide to building a modern data warehouse using medallion Data Warehouse Architecture with SQL Server, including ETL processes, data modeling, and analytics.