huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
PythonApache-2.0
Issues
- 5
- 5
Running out of memory while finetuning and inferencing VideoMAE due to which script is being killed.
#30939 opened - 3
Trained tokenizer has broken encoding for cyrillic
#30937 opened - 1
VisEncoderDecoderModel generate text incomplete when predict image with long text label
#30931 opened - 3
CLIPTokenizerFast cause memory leak
#30930 opened - 4
Libraries import missing, unable to load image for inference and not able to load pipeline with the trained model
#30927 opened - 5
- 2
- 1
`center_crop` outputs wrong sized array if provided with odd-numbered dimensions smaller than requested crop size
#30922 opened - 3
[BUG] Offline loading of non-safe tensors fails
#30920 opened - 1
- 6
RuntimeError: expected mat1 and mat2 to have the same dtype, but got: float != c10::BFloat16
#30914 opened - 8
- 0
- 4
- 0
DeformableDETR two stage not support bfloat16
#30906 opened - 0
Chatdoctor
#30904 opened - 8
Using context length of 60, trying to predict next 7 days Close price. Error is : Lags cannot go further than history length
#30903 opened - 10
- 1
The implementation of ROPE seems a little off for k
#30900 opened - 6
- 0
Logging to wandb breaks FSDP in 4.41.0
#30895 opened - 6
- 1
transformers 4.41.0 breaks generate() for T5
#30892 opened - 5
- 2
- 5
- 2
- 0
Add IRIS
#30882 opened - 2
Wav2vec2 model has unknown attributes weight_g/weight_v when DeepSpeed ZeRO-3 is enabled
#30881 opened - 2
- 6
Kosmos-2.5 implementation in transformers
#30877 opened - 2
Owlv2 model keeps crashing
#30874 opened - 7
- 1
- 5
Improving memory efficiency further 🚀
#30860 opened - 3
- 1
Enabling device_map="auto" for Video-LLaVA
#30858 opened - 3
scores_for_ground_truths Error for deepset/roberta-base-squad2 model and squad_v2 dataset
#30856 opened - 2
- 2
- 6
[BLIP2] BLIP2QFormerLayer is missing the self.intermediate parameter, which makes training from scratch impossible
#30846 opened - 22
Significant performance degradation with multi-GPU training on newer torch/transformers
#30840 opened - 0
- 4
Cannot import name 'WhisperForAudioClassification -Already installed transformers==4.40.2
#30834 opened - 5
- 0
- 1
- 9
- 3