Pinned Repositories
pdf_parser
All in one PDF Parser Toolkit
pdffigures2
Given a scholarly PDF, extract figures, tables, captions, and section titles.
biomedical
Tools for curating biomedical training data for large-scale language modeling
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
sciol-wacv-2024
Code and resources for the WACV 2024 paper: SciOL and MuLMS-Img: Introducing A Large-Scale Multimodal Scientific Dataset and Models for Image-Text Tasks in the Scientific Domain
sciparser
PDF parsing toolkit for preparing academic text corpus
BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Davidwhw's Repositories
Davidwhw doesn’t have any repository yet.