huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
PythonApache-2.0
Issues
- 3
`PreTrainedTokenizerFast._batch_encode_plus()` got an unexpected keyword argument `'split_special_tokens'`
#30685 opened by fahadh4ilyas - 6
KV cache with CPU offloading
#30704 opened by n17s - 9
LLM inference with static kv-cache example gives different generations depending on the batch examples
#30670 opened by jpiabrantes - 4
- 0
Whisper assistant decoding not working with pipeline
#30611 opened by kamilakesbi - 3
Error while runing T5 trainer: TypeError: argument 'ids': 'list' object cannot be interpreted as an integer
#30712 opened by Aml-Hassan-Abd-El-hamid - 2
- 4
- 11
error when using PPO in Gemma
#30605 opened by mostafamdy - 3
Add Wav2Vec2BertProcessorWithLM
#30671 opened by FredHaa - 6
Error while moving model to GPU `NotImplementedError: Cannot copy out of meta tensor; no data!`
#30703 opened by goelayu - 7
Evaluate trainer on Code-Switched Speech fails with "ValueError: Multiple languages detected when trying to predict the most likely target language for transcription."
#30654 opened by sproocht - 3
- 4
from_pretrained torch_dtype DO NOT affect model buffers
#30709 opened by Chandler-Bing - 3
- 6
default max value of max_new_token
#30666 opened by Navanit-git - 2
Error with tf-keras when trying to geneate random seeds
#30711 opened by fabiancpl - 14
Setting compute_metrics in Trainer with Idefics2ForConditionalGeneration leads to AttributeError: 'DynamicCache' object has no attribute 'detach'
#30631 opened by EloiEynard - 1
- 5
- 8
Add static cache support for Whisper
#30707 opened by mobicham - 5
TypeError: WhisperForConditionalGeneration.forward() got an unexpected keyword argument 'model'
#30616 opened by kadirnar - 2
Question about quantized model with zero3
#30663 opened by mxjmtxrm - 2
LLama-3 8B - can't match MMLU performance
#30694 opened by gioaca00 - 0
Refusal rejection removal as a feature
#30705 opened by KnutJaegersberg - 1
Pure Python `PreTrainedTokenizer` is Broken
#30696 opened by daskol - 5
Add Prismatic VLMs to Transformers
#30638 opened by siddk - 1
CLIP Training Example Bug - Overfitting
#30682 opened by humanely - 0
DDP error with load_best_model_at_end enabled
#30702 opened by zhiyuanhhh - 2
- 0
- 6
Cannot save HQQ quantized model.
#30689 opened by mxjmtxrm - 0
More memory consumption than litgpt
#30629 opened by getao - 2
- 3
Cannot copy out of meta tensor; no data! for SwinV2ForImageClassification
#30661 opened by ethvedbitdesjan - 0
FutureWarning about resume_download is raised after huggingface-hub 0.23.0 release
#30618 opened by albertvillanova - 2
model_max_length default parameters are missing in transformers>=4.40.0
#30643 opened by helpmefindaname - 2
[Phi-3-mini-128k-instruct] Difference in slow and fast tokenization after adding new tokens
#30660 opened by jpmann - 2
can't import phi3config etc.
#30659 opened by tsw123678 - 1
[i18n-<languageCode>] Translating docs to <languageName>
#30665 opened by Ggjkfkg - 1
Error During Training with PatchTSMixerForTimeSeriesClassification for Time Series Classification
#30614 opened by tdg2088 - 1
Urdu Encoding Issue in Hugging Face Tokenizer
#30636 opened by El-chapo-007 - 4
DPT implementation contains unused parameters
#30633 opened by ducha-aiki - 1
Wav2Vec2ForCTC weight mismatch
#30628 opened by MahmoudAshraf97 - 2
Cannot convert llama 3 model to hf
#30604 opened by Bedoshady - 2
Remove pipelines, chatformatters, templates etc --> Replace with simple generator function / manual string interpolation ---> Just have one standardized way for building datasets and running inference
#30625 opened by bdytx5 - 1
HTML Files Keep on Loading
#30626 opened by IsaacZachary - 1
Error During Training with PatchTSMixerForTimeSeriesClassification for Time Series Classification
#30609 opened by tdg2088 - 4
- 2