yiyibooks's Stars
1c7/chinese-independent-developer
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻**独立开发者项目列表 -- 分享大家都在做什么
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
coolwanglu/pdf2htmlEX
Convert PDF to HTML without losing text or format.
LibreTranslate/LibreTranslate
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
microsoft/TypeChat
TypeChat is a library that makes it easy to build natural language interfaces using types.
kpdecker/jsdiff
A javascript text differencing implementation.
bentrevett/pytorch-seq2seq
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
argosopentech/argos-translate
Open-source offline translation library written in Python
pdf2htmlEX/pdf2htmlEX
Convert PDF to HTML without losing text or format.
flitbit/diff
Javascript utility for calculating deep difference, capturing changes, and applying changes across objects; for nodejs and the browser.
meta-llama/PurpleLlama
Set of tools to assess and improve LLM security.
ljinkai/weekly
独立开发产品变现周刊,每周五发布。
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
breezedeus/Pix2Text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
Bistutu/FluentRead
拥有基于上下文语境的人工智能翻译引擎,为网站提供更加友好的翻译,让所有人都能够拥有基于母语般的阅读体验。
ruanyf/articles
personal articles
brucemiller/LaTeXML
LaTeXML: a TeX and LaTeX to XML/HTML/ePub/MathML translator.
k2-fsa/icefall
dginev/ar5iv
A web service offering HTML5 articles from arXiv.org as converted with latexml
melodysdreamj/WizardVicunaLM
LLM that combines the principles of wizardLM and vicunaLM
wq2012/SpectralCluster
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
jorisschellekens/borb-examples
fe1ixxu/ALMA
State-of-the-art LLM-based translation models.
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
awslabs/pptod
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)
roedoejet/g2p
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
salesforce/botsim
BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots
arXiv/html_feedback
Supports a student project developing a UI for feedback on arXiv articles rendered as html.