chunking

There are 159 repositories under chunking topic.

jiesutd/NCRFpp
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Language:Python1.9k 60 172447
systemd/casync
Content-Addressable Data Synchronization Tool
Language:C1.5k 82 108117
folbricht/desync
Alternative casync implementation
Language:Go323 15 12344
26hzhang/neural_sequence_labeling
A TensorFlow implementation of Neural Sequence Labeling model, which is able to tackle sequence labeling tasks such as POS Tagging, Chunking, NER, Punctuation Restoration and etc.
Language:Python235 8 1547
microsoft/rag-experiment-accelerator
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
Language:Python107 22 22236
jordicenzano/go-ts-segmenter
Live TS segmenter and HLS manifest creation in Go
Language:Go91 9 313
Safakan/TalkWithYourFiles
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
Language:Python86 4 113
xtabbas/The-Ultimate-Boilerplate
webpack 2, react hotloader 3, react router v4, code splitting and more
Language:JavaScript85 6 08
Sammyjo20/laravel-chunkable-jobs
📑 Split Laravel jobs into multiple separate job chunks
Language:PHP83 3 23
esastack/esa-restclient
An asynchronous event-driven HTTP client based on netty.
Language:Java82 3 3422
Koziev/GrammarEngine
Грамматический Словарь Русского Языка (+ английский, японский, etc)
Language:C++73 9 1819
ronomon/deduplication
Fast multi-threaded content-dependent chunking deduplication for Buffers in C++ with a reference implementation in Javascript. Ships with extensive tests, a fuzz test and a benchmark.
Language:JavaScript71 5 59
umarbutler/semchunk
A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.
Language:Python71 2 36
bnosac/crfsuite
Labelling Sequential Data in Natural Language Processing with R - using CRFsuite
Language:C62 8 2011
howardyclo/grammar-pattern
Extract and align grammar patterns from English sentences.
Language:Python49 6 210
DanEngelbrecht/longtail
Incremental asset delivery library
Language:C46 7 157
iscc/fastcdc-py
FastCDC implementation in Python https://pypi.org/project/fastcdc/
Language:Python43 5 1317
Alkl58/NotEnoughAV1Encodes-Qt
Linux GUI for AV1 Encoders
Language:Python30 3 02
dcarpintero/llamaindexchat
LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex
Language:Python29 2 05
nftstorage/carbites
🚗 🚙 🚕 Chunking for CAR files. Split a single CAR into multiple CARs.
Language:JavaScript24 2 147
DanEngelbrecht/golongtail
Command line front end for longtail synchronization tool
Language:Go23 5 446
indyjo/cafs
Content-Addressable File System (used by BitWrk)
Language:Go19 2 02
khoih-prog/AsyncWebServer_STM32
AsyncWebServer for STM32 using builtin LAN8742A Ethernet. This AsyncWebServer Library for STM32 is currently working on STM32 boards, such as Nucleo-144 F767ZI, etc., using builtin LAN8742A Ethernet. Now support using CString to save heap to send very large data
Language:C19 4 64
Zabuzard/FastCDC4J
Fast and efficient content-defined chunking for data deduplication. Java implementation of FastCDC as library.
Language:Java19 2 04
R3X-G1L6AME5H/Godot-LOD-Manager
A simple LOD and Chunking Solution for Godot.
Language:GDScript18 1 22
remram44/cdchunking-rs
Content-Defined Chunking for Rust
Language:Rust18 3 54
stevezheng23/sequence_labeling_tf
Sequence Labeling in Tensorflow
Language:Python18 4 31
DennisSmuda/godot-chunking-system
Demo on how to make a 2D grid-based map with FastNoise and infinite movement in every Direction. Uses multithreading to load/unload chunks of the map! 🌎
Language:GDScript16 3 42
saltyrtc/chunked-dc-js
Binary chunking that can be reassembled out-of-order.
Language:TypeScript16 4 153
IPRIT/md-svg-vue
Material design icons by Google for Vue.js & Nuxt.js (server side support & inline svg with path)
Language:JavaScript15 1 32
jparkerweb/semantic-chunking
🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows
Language:JavaScript15 1 00
michaeljs1990/jmem
Break up huge JSON arrays into manageable sizes.
Language:PHP15 4 13
lancopku/SAPO
C# code for "Towards Easier and Faster Sequence Labeling for Natural Language Processing: A Search-based Probabilistic Online Learning Framework (SAPO)" (Information Sciences)
Language:C#13 6 05
linuxscout/mishtar
Mishtar: Named and temporal entities chunker
Language:Python13 4 12
fd0/split
Split large files into smaller ones using deterministic Content Defined Chunking
Language:Go11 1 0
raffidil/react-chunked-uploader
A react hook for uploading large files that need chunking.
Language:TypeScript11 2 00

chunking

jiesutd/NCRFpp

systemd/casync

folbricht/desync

26hzhang/neural_sequence_labeling

microsoft/rag-experiment-accelerator

jordicenzano/go-ts-segmenter

Safakan/TalkWithYourFiles

xtabbas/The-Ultimate-Boilerplate

Sammyjo20/laravel-chunkable-jobs

esastack/esa-restclient

Koziev/GrammarEngine

ronomon/deduplication

umarbutler/semchunk

bnosac/crfsuite

howardyclo/grammar-pattern

DanEngelbrecht/longtail

iscc/fastcdc-py

Alkl58/NotEnoughAV1Encodes-Qt

dcarpintero/llamaindexchat

nftstorage/carbites

DanEngelbrecht/golongtail

indyjo/cafs

khoih-prog/AsyncWebServer_STM32

Zabuzard/FastCDC4J

R3X-G1L6AME5H/Godot-LOD-Manager

remram44/cdchunking-rs

stevezheng23/sequence_labeling_tf

DennisSmuda/godot-chunking-system

saltyrtc/chunked-dc-js

IPRIT/md-svg-vue

jparkerweb/semantic-chunking

michaeljs1990/jmem

lancopku/SAPO

linuxscout/mishtar

fd0/split

raffidil/react-chunked-uploader