zye1996's Stars
meilisearch/meilisearch
A lightning-fast search API that fits effortlessly into your apps, websites, and workflow
slidevjs/slidev
Presentation Slides for Developers
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
ShishirPatil/gorilla
Gorilla: An API store for LLMs
imputnet/cobalt
save what you love
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
vercel/ai
Build AI-powered applications with React, Svelte, Vue, and Solid
PKU-YuanGroup/ChatLaw
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
IDEA-Research/GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
ThioJoe/YT-Spammer-Purge
Allows you easily scan for and delete scam comments using several methods.
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
baaivision/EVA
EVA Series: Visual Representation Fantasies from BAAI
Eladlev/AutoPrompt
A framework for prompt tuning using Intent-based Prompt Calibration
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
autodistill/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
kyegomez/BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
michaelfeil/infinity
Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of text-embedding models and frameworks.
frotms/PaddleOCR2Pytorch
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
gabriben/awesome-generative-information-retrieval
google-research/omniglue
Code release for CVPR'24 submission 'OmniGlue'
AI21Labs/in-context-ralm
LeapLabTHU/EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
linanqiu/reddit-dataset
Dataset of threads and comments from reddit
ChiYeungLaw/LexLIP-ICCV23
Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval"
dondongwon/LPMDataset