tunib-ai/parallelformers
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
PythonApache-2.0
Issues
- 4
Cross-node inference
#48 opened by BDHU - 0
Use this library for CNN networks like Unet
#55 opened by cporrasn - 3
Support for LLaMA
#50 opened by IzzetYoung - 0
Title: RuntimeError: Timed out initializing process group in store based barrier
#54 opened by hugocool - 0
Still in development?
#53 opened by codeananda - 1
RuntimeError: CUDA error: peer access is not supported between these two devices
#44 opened by Dorcoh4 - 2
freeze_support()
#22 opened by psinha30 - 0
freeze_support()
#52 opened by vinnitu - 0
Support for Falcon-7B and Falcon-40B models
#51 opened by mahdyshabeeb - 1
- 6
Error using google/UL2 model
#29 opened by dnhkng - 2
[Feature Request] Add Bloom to the Auto Policy
#36 opened by airsplay - 1
OSError: [Errno 9] Bad file descriptor
#42 opened by aws-stdun - 0
Speed up results serialization
#46 opened by mkardas - 0
Add Vision Encoder Decoder model to parallelformers
#45 opened by gagan3012 - 0
Bug with T511b inference
#43 opened by ZeyiLiao - 4
A bug with `n_fused`
#41 opened by JiayiFeng - 1
torch no_grad
#40 opened by zelcookie - 0
INT8 support
#39 opened by volkerha - 0
Support Codegen 12B
#38 opened by Tiiiger - 0
Can you please add Question Answering models like LayoutLMv2ForQuestionAnswering
#35 opened by sujit420 - 2
Can you please add support for gpt_neox
#34 opened by tahercoolguy - 3
Support for GPT2-XL
#33 opened by snoop2head - 1
GPT2 parallelism does not work on the Tesla K80
#27 opened by 0x7o - 1
- 1
- 12
- 1
Support for OPT
#30 opened by mrzjy - 2
- 0
EncoderDecoder support
#26 opened by d-miketa - 0
Recommended way for cleaning up?
#24 opened by creatorrr - 6
AttributeError: Can't get attribute 'MegatronPolicy' on <module '__main__' (built-in)>
#20 opened by Oaklight - 4
- 1
- 6
다중 Model 로드 방법
#18 opened by Don9wanKim - 29
AssertionError: Model should be on CPU before parallelization. It is more memory-efficient.
#16 opened by juliensalinas - 13
KoGPT3와 연동시 품질 이슈
#17 opened by BangDaeng - 11
Support for GPT-J
#4 opened by andreamad8 - 0
- 1
- 1
How can I parallelize the MegatronBertModel?
#14 opened by kajyuuen - 2
docker support
#3 opened by hyunwoongko - 0
Bug about `AlbertModel`
#5 opened by hyunwoongko