chunking
There are 159 repositories under chunking topic.
jiesutd/NCRFpp
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
systemd/casync
Content-Addressable Data Synchronization Tool
folbricht/desync
Alternative casync implementation
26hzhang/neural_sequence_labeling
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
microsoft/rag-experiment-accelerator
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
jordicenzano/go-ts-segmenter
Live TS segmenter and HLS manifest creation in Go
Safakan/TalkWithYourFiles
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
xtabbas/The-Ultimate-Boilerplate
webpack 2, react hotloader 3, react router v4, code splitting and more
Sammyjo20/laravel-chunkable-jobs
📑 Split Laravel jobs into multiple separate job chunks
esastack/esa-restclient
An asynchronous event-driven HTTP client based on netty.
Koziev/GrammarEngine
Грамматический Словарь Русского Языка (+ английский, японский, etc)
ronomon/deduplication
Fast multi-threaded content-dependent chunking deduplication for Buffers in C++ with a reference implementation in Javascript. Ships with extensive tests, a fuzz test and a benchmark.
umarbutler/semchunk
A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
bnosac/crfsuite
Labelling Sequential Data in Natural Language Processing with R - using CRFsuite
howardyclo/grammar-pattern
Extract and align grammar patterns from English sentences.
DanEngelbrecht/longtail
Incremental asset delivery library
iscc/fastcdc-py
FastCDC implementation in Python https://pypi.org/project/fastcdc/
Alkl58/NotEnoughAV1Encodes-Qt
Linux GUI for AV1 Encoders
dcarpintero/llamaindexchat
LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex
nftstorage/carbites
🚗 🚙 🚕 Chunking for CAR files. Split a single CAR into multiple CARs.
DanEngelbrecht/golongtail
Command line front end for longtail synchronization tool
indyjo/cafs
Content-Addressable File System (used by BitWrk)
khoih-prog/AsyncWebServer_STM32
AsyncWebServer for STM32 using builtin LAN8742A Ethernet. This AsyncWebServer Library for STM32 is currently working on STM32 boards, such as Nucleo-144 F767ZI, etc., using builtin LAN8742A Ethernet. Now support using CString to save heap to send very large data
Zabuzard/FastCDC4J
Fast and efficient content-defined chunking for data deduplication. Java implementation of FastCDC as library.
R3X-G1L6AME5H/Godot-LOD-Manager
A simple LOD and Chunking Solution for Godot.
remram44/cdchunking-rs
Content-Defined Chunking for Rust
stevezheng23/sequence_labeling_tf
Sequence Labeling in Tensorflow
DennisSmuda/godot-chunking-system
Demo on how to make a 2D grid-based map with FastNoise and infinite movement in every Direction. Uses multithreading to load/unload chunks of the map! 🌎
saltyrtc/chunked-dc-js
Binary chunking that can be reassembled out-of-order.
IPRIT/md-svg-vue
Material design icons by Google for Vue.js & Nuxt.js (server side support & inline svg with path)
jparkerweb/semantic-chunking
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
michaeljs1990/jmem
Break up huge JSON arrays into manageable sizes.
lancopku/SAPO
C# code for "Towards Easier and Faster Sequence Labeling for Natural Language Processing: A Search-based Probabilistic Online Learning Framework (SAPO)" (Information Sciences)
linuxscout/mishtar
Mishtar: Named and temporal entities chunker
fd0/split
Split large files into smaller ones using deterministic Content Defined Chunking
raffidil/react-chunked-uploader
A react hook for uploading large files that need chunking.