alexriggio/BERT-LoRA-TensorRT
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
Jupyter NotebookApache-2.0
Stargazers
- AndrewHogenson
- AnirudhMaiyaUniversity of Colorado Boulder
- cm2435stealth
- dwolvek
- GavinYGM
- goodboyyes2009
- hilam
- jblagojaMacedonia
- jphgxq
- KollinRT
- laifiMunich
- LiuShisan23Edinburgh
- majorization
- mbzuaiciao
- monk1337@saamaresearch
- mtanigSaitama, Japan
- panos-spanAthens
- patteg21Charlotte, NC
- phamvanlinh143Viettel AI
- sgaserettoParaguay
- silverisland
- sufe-zcz
- tanshuai0219Shanghai Jiao Tong University
- Taurids
- TDL77
- VenusTZZ
- VeritasYinPurdue University
- vhxs@JHUAPL
- vinhtran2611HCMUT
- wangzhanxd
- xhwSkhizeinChina Beijing
- xinxinlaoshiSeattle
- Yangjianxiao0203
- yunxingluaaron
- yw4180
- ztb-35Louisiana State University