Pinned Repositories
cvframework
DSH_tensorflow
implemement of DEEP SUPERVISED HASHING FOR FAST IMAGE RETRIEVAL_CVPR2016
EducationSystem
faiss
A library for efficient similarity search and clustering of dense vectors.
HRNet-Facial-Landmark-Detection
This is an official implementation of facial landmark detection for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
Image_Process
实现一些图像处理函数
K-means-algorithm
This is a kind of clustering algorithms.
Perceptron-Algorithm
The content of it inlcudes training discriminant functions with perceptron algorithm(ex1) and classifying the samples with trained discriminant functions(ex2).
tensorrt_inference
tf-magnet-loss
Tensorflow implementation of Magnet Loss from "Metric Learning for Adaptive Density Discrimination"
berooo's Repositories
berooo/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
berooo/awesome-document-understanding
A curated list of resources for Document Understanding (DU) topic
berooo/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
berooo/baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
berooo/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
berooo/CAN
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).
berooo/ChineseNLPCorpus
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
berooo/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
berooo/DocBank
DocBank: A Benchmark Dataset for Document Layout Analysis
berooo/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
berooo/ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
berooo/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
berooo/GitHub520
:kissing_heart: 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
berooo/GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
berooo/i-Code
berooo/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
berooo/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
berooo/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
berooo/llm-rankers
Zero-shot Document Ranking with Large Language Models.
berooo/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
berooo/MRAG
Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"
berooo/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
berooo/open-llms
📋 A list of open LLMs available for commercial use.
berooo/open-mllms
open llm for multimodal
berooo/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
berooo/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
berooo/TabRecSet
A large scale camera-taken table detection and recognition dataset.
berooo/tabula
Tabula is a tool for liberating data tables trapped inside PDF files
berooo/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
berooo/WanJuan1.0
万卷1.0多模态语料