VikParuchuri's Stars
karpathy/LLM101n
LLM101n: Let's build a Storyteller
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
MadcowD/ell
A language model programming library.
asg017/sqlite-vec
A vector search SQLite extension that runs anywhere!
bugbakery/audapolis
an editor for spoken-word audio with automatic transcription
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
ucbepic/docetl
A system for agentic LLM-powered data processing and ETL
illuin-tech/colpali
The code used to train and run inference with the ColPali architecture.
Surfer-Org/Protocol
Open-source framework for exporting and building applications off of your personal data.
yandex/YaFSDP
YaFSDP: Yet another Fully Sharded Data Parallel
EleutherAI/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
tobymao/saq
Simple Async Queues
BobMcDear/attorch
A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.
FudanVI/benchmarking-chinese-text-recognition
This repository contains datasets and baselines for benchmarking Chinese text recognition.
hsfzxjy/handwriter.ttf
Handwriting synthesis with Harfbuzz WASM.
HazyResearch/pdftotree
:evergreen_tree: A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.
dailenson/One-DM
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
ZZZHANG-jx/Recommendations-Document-Image-Processing
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.
satbyy/go-noto-universal
Noto fonts go universal! Download pan-Unicode, merged Noto fonts according to time of usage (current, ancient) or geographical region (South Asia, SE Asia, Africa-MiddleEast, Europe-Americas).
khuangaf/Awesome-Chart-Understanding
A curated list of recent and past chart understanding work based on our survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.
AnswerDotAI/cold-compress
Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of GPT-Fast, a simple, PyTorch-native generation codebase.
FuxiaoLiu/MMC
[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning
cloneofsimo/min-fsdp
X-rayLaser/pytorch-handwriting-synthesis-toolkit
Handwriting generation and handwriting synthesis as described in Alex Graves's paper https://arxiv.org/abs/1308.0850. Pytorch implementation.
orasik/parsevision
Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help developers and product teams identify if the parsing has missed some vital information from the document.
2OsZI4ISYd/stepcutis
OCR of PDFs through a cocktail of document analysis models