huggingface/optimum-habana

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

PythonApache-2.0

Issues

Increase in device memory usage with peft version 0.7.0
#590 opened 2 months ago by vivekgoe
7
[Bug] fp8 measure failed on Llama-2-7b-hf
#890 opened 2 months ago by ClarkChin08
3
Update to transformers 4.39.1+
#865 opened 2 months ago by vidyasiv
2
GPTJAttention.forward() got an unexpected keyword argument 'token_idx'
#847 opened 2 months ago by IT-Forrest
6
After accelerate v0.28.0, the `split_batches` of class Accelerator became a unsettable property method
#872 opened 2 months ago by eraser00
2
Bug in Optimum Habana Examples
#741 opened 2 months ago by allenwsh82
6
Upgrade DPO and PPO to support trl v0.7.8 or higher
#784 opened 3 months ago by skavulya
2
make sure a match between logits and logits_dtype in evaluation_loop
#754 opened 3 months ago by IT-Forrest
3
LlamaForCausalLM.forward() got an unexpected keyword argument 'use_flash_attention'
#760 opened 3 months ago by dittops
9
Error while saving checkpoint with trainer
#744 opened 3 months ago by dittops
12
Discussion on attention mask handling
#692 opened 3 months ago by vidyasiv
3
[Feature request] Mistral implementation for language modeling?
#696 opened 4 months ago by timpal0l
3
ENABLE_CONSOLE disrupts version checks in examples
#684 opened 4 months ago by vidyasiv
2
multi-node distributed training on Ray is failed.
#505 opened 5 months ago by yuanwu2017
2
Device Aquire failed
#367 opened 5 months ago by DannyAtal
15
Facing issue while running the git repository for text generation
#599 opened 6 months ago by VenkateshPasumarti
1
run_lora_clm.py support for other datasets
#629 opened 5 months ago by tmabraham
1
Text Generation runtime error
#614 opened 5 months ago by danielfleischer
7
Error: Getting size for given data type is not supported while fine tuning starcoder model on optimum-habana
#350 opened 6 months ago by anindya-saha
7
Default value of ignore_eos
#455 opened 6 months ago by bhargaveede
5
Eval bug for speech recognition facebook/wav2vec2-large-lv60
#573 opened 6 months ago by skaulintel
2
text_generation_launcher: Waiting for shard to be ready... rank=1 forever if we pass --num-shard
#376 opened 6 months ago by avinashkarani
8
tgi server keeps repeating short words in the output.
#383 opened 6 months ago by htang2012
6
StableDiffusion v2.1 produces incorrect images
#547 opened 7 months ago by mgawarkiewicz
3
Will Hugging Face support GLM series models (ChatGLM-6B, ChatGLM2-6B ...) in Transformers?
#299 opened 7 months ago by jychen-habana
2
Repo card metadata block was not found. Setting CardData to empty
#491 opened 7 months ago by sureshnam
1
FileNotFoundError: Couldn't find a dataset script
#432 opened 7 months ago by OmerBoucris
4
Several greedy search Test cases failing with KeyError: 'bucket_size'
#446 opened 8 months ago by ankurneog
4
Add support for max_length in run_generation
#472 opened 8 months ago by ankurneog
3
Adaptive output and contextual dialogue capabilities of text-generation-inference
#424 opened 8 months ago by MLikeWater
1
ImportError: No module named optimum.habana.distributed
#300 opened 8 months ago by Abhaycnvrg
1
Latest version of optimum-habana does not work with transformers==4.32.1
#460 opened 8 months ago by amukho
1
Performance is better in 1.6.1 release compared to 1.7.4 release in many models
#419 opened 9 months ago by vineethanandh
3
Beach search transformers test cases are failing with KeyError: 'limit_hpu_graphs'
#445 opened 8 months ago by ankurneog
2
'meta-llama/Llama-2-7b-hf' tests fail with Authentication failure.
#402 opened 8 months ago by vineethanandh
7
Does it make sense to also provide an option of max input tokens for text generation ?
#423 opened 8 months ago by puneeshkhanna
4
gpt neox finetuning does not work(segmentaion fault) since 1.7.0
#403 opened 9 months ago by ZhaiFeiyue
9
accelerate llama inference in TGI
#339 opened 9 months ago by sywangyi
2
htcore issue on "text-generation-inference" server with "langchain" client
#338 opened 9 months ago by htang2012
1
dock build fail in https://github.com/huggingface/optimum-habana/tree/main/text-generation-inference
#337 opened 9 months ago by sywangyi
11
setup.py need to be updated to 0.22.0 for accelerate
#371 opened 9 months ago by LeoZhao-Habana
1
Where in the directory "/tmp/tst-summarization", is the summarization output stored?
#292 opened a year ago by Abhaycnvrg
3
RuntimeError: Device acquire failed. in /usr/local/lib/python3.8/dist-packages/habana_frameworks/torch/hpu/__init__.py"
#288 opened a year ago by Abhaycnvrg
9
return super().__torch_function__(func, types, new_args, kwargs) RuntimeError: [Rank:0] FATAL ERROR :: MODULE:PT_DEVMEM Allocation failed for size::3288334336 (3136)MB
#289 opened a year ago by Abhaycnvrg
10
Enable beam_search for text-generation
#270 opened a year ago by ZhaiFeiyue
3
Bad Performance of text-generation with sampling algo
#252 opened a year ago by ZhaiFeiyue
9
Adding profiling
#245 opened a year ago by ZhaiFeiyue
4
Can not run text-generation with bloom deepspeed?
#240 opened a year ago by ZhaiFeiyue
2
When loading datasets by HuggingFace datasets.load_dataset like cifar10, could it be possible to return the dataset without decoding automatically.
#236 opened a year ago by jychen-habana
3
GPT2 support bf16 for both training and inferecne
#230 opened a year ago by ZhaiFeiyue
10