Issues
- 2
How to inference BLOOM-176B by multi-node multi-card?
#189 opened by vicwer - 7
关于示例代码版本落后,无法运行等问题About the example code version backward, can not run and other issues
#224 opened by AntyRia - 0
[bug] The InferenceEngine used in the example for distributed inference cannot be imported.
#223 opened by flybird11111 - 1
Support GPT BigCode (bigcode/starcoder, bigcode/gpt_bigcode-santacoder, etc.)
#218 opened by liulhdarks - 0
OPT-125m problem
#222 opened by Yummy813 - 0
Location of logs
#221 opened by Yummy813 - 0
- 0
- 1
EnergonAI running OPT reasoning example: When encountering a client request, the server is blocked and cannot return the result
#215 opened by colynhn - 2
- 0
Where is InferenceEngine definition?
#216 opened by yehx1 - 2
question about load model state_dict in multi-gpus
#213 opened by irasin - 1
Is there an example of the http client?
#214 opened by frankxyy - 1
Concrete doc of this project
#212 opened by frankxyy - 3
can't find server.sh
#160 opened by zhangyilalala - 4
_pickle.UnpicklingError: invalid load key, '{'.
#211 opened by RundongCao - 11
- 3
- 2
OPT inference
#198 opened by Joanna-0421 - 2
Where is InferenceEngine definition?
#210 opened by liujuncn - 4
an error caused by running the example of the opt
#206 opened by LemonSqi - 2
- 1
How to use dynamic batch features
#199 opened by hudengjunai - 2
OPT demo TEST
#203 opened by Batizhao8899 - 3
fail to install EnergonAI
#204 opened by NewDriverLee - 1
- 2
miss cache error when pose generation opt
#201 opened by tycallen - 1
- 1
Doesn't run gpt reference?
#196 opened by YuchengWang - 1
Does not support Cuda 10.2 ?
#193 opened by 0-1CxH - 2
- 3
Can not start the Bloom server
#191 opened by SAI990323 - 2
- 2
Failed to load OPT-30B checkpoint
#183 opened by ericxsun - 1
Support OPT-IML model
#184 opened by ericxsun - 1
- 0
trpc.rpc_sync consumed most time
#175 opened by fanlongbd - 0
- 1
- 0
need guidelines on converting OPT-17B checkpoint
#161 opened by gulzainali98 - 1
- 1
[RFC] Async engine and pipeline based on RPC
#151 opened by ver217 - 1
num_beams for beam search
#138 opened by shammmmmless - 1
inference of pre-trained model
#125 opened by Emerald01 - 0
Remove hard code directory path
#110 opened by feifeibear - 0
Provide a docker service
#106 opened by feifeibear - 1
OPT inference generate example
#102 opened by virgulvirgul - 1
Missing energonai_linear_func in setup.py
#100 opened by xgreat8 - 1
Connection refused on docker exposed port
#99 opened by xgreat8 - 0
[Feature]: Automatic Pipeline Parallelism
#89 opened by dujiangsu