xtyrrell

just cruising through cyberspace

Cape Town, South Africa

xtyrrell's Stars

rapid7/metasploit-framework
Metasploit Framework
Language:Ruby33.8k 2k 6k13.9k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.5k 160 1.5k2.2k
exaloop/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
Language:C++15k 139 412517
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python14.7k 262 2072.6k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.1k 99 530846
tensorflow/playground
Play with neural networks!
Language:TypeScript11.9k 476 1202.5k
Doriandarko/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
Language:Python9.1k 130 123965
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language:Python5.7k 47 298465
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.6k 50 559439
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Language:Python4.3k 58 897744
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Language:Python3.6k 43 361424
lm-sys/RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
Language:Python3k 25 46227
cstate/cstate
🔥 Open source static (serverless) status page. Uses hyperfast Go & Hugo, minimal HTML/CSS/JS, customizable, outstanding browser support (IE8+), preloaded CMS, read-only API, badges & more.
Language:HTML2.5k 16 182232
mediar-ai/screenpipe
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.
Language:Rust2.3k 20 165132
atopile/atopile
Design circuit boards with code! ✨ Get software-like design reuse 🚀, validation, version control and collaboration in hardware; starting with electronics ⚡️
Language:Python1.9k 15 115110
sinaatalay/rendercv
The engine of the RenderCV App
Language:Python1.8k 8 102119
DAGWorks-Inc/burr
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
Language:Python1.1k 9 9461
mezbaul-h/june
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
Language:Python690 6 842
Yuliang-Liu/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
Language:Python444 14 2829
3rd/tsdiagram
Create diagrams and plan your code with TypeScript.
Language:TypeScript415 10 1315
rectanglehq/Shapeshift
Transform JSON objects using vector embeddings
Language:TypeScript402 3 011
xiaoyu258/DocProj
Document Rectification and Illumination Correction using a Patch-based CNN
Language:Python339 13 3086
ZZZHANG-jx/DocRes
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Language:Python295 6 1628
ZZZHANG-jx/Recommendations-Document-Image-Processing
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.
154 7 59
yfzhang114/SliME
✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models
Language:Python131 4 87
kyegomez/Python-Package-Template
A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more
Language:Shell126 2 025
jeromewu/tesseract.js-video
An example app to recognize video clip with tesseract.js
Language:HTML123 2 236
andreluisjunqueira/react-native-document-scanner-android
Document scanner android, feature live detection, auto-capture, perspective correction :vibration_mode: :camera: -- :trophy:
Language:Java82 8 2932
ParadoxZW/LLaVA-UHD-Better
A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo
Language:Python31 4 53
fahmiaziz98/receipt_parsing
receipt parsing using donut model, next we will add using LLM + OCR or VLM
Language:Jupyter Notebook5 1 02

xtyrrell

xtyrrell's Stars

rapid7/metasploit-framework

haotian-liu/LLaVA

exaloop/codon

openai/evals

OpenBMB/MiniCPM-V

tensorflow/playground

Doriandarko/claude-engineer

clovaai/donut

OpenGVLab/InternVL

open-mmlab/mmocr

mindee/doctr

lm-sys/RouteLLM

cstate/cstate

mediar-ai/screenpipe

atopile/atopile

sinaatalay/rendercv

DAGWorks-Inc/burr

mezbaul-h/june

Yuliang-Liu/MultimodalOCR

3rd/tsdiagram

rectanglehq/Shapeshift

xiaoyu258/DocProj

ZZZHANG-jx/DocRes

ZZZHANG-jx/Recommendations-Document-Image-Processing

yfzhang114/SliME

kyegomez/Python-Package-Template

jeromewu/tesseract.js-video

andreluisjunqueira/react-native-document-scanner-android

ParadoxZW/LLaVA-UHD-Better

fahmiaziz98/receipt_parsing