keras-team/keras-nlp

Modular Natural Language Processing workflows with Keras

PythonApache-2.0

Issues

Any plans for Llama 3?
#1583 opened a day ago by awsaf49
0
Preprocessor does not respect sequence_length
#1627 opened a day ago by 52631
2
Distributed batch size not calculated correctly
#1630 opened 3 days ago by natbprice
2
Documented `id_to_token` doesn't exist for UnicodeCodepointTokenizer
#1631 opened 3 days ago by fecundf
0
Cannot export a slightly customized XLMRoberta model from keras_nlp
#1629 opened 3 days ago by YangIsNotAvailable
0
Add support for `PaliGemma`
#1626 opened 4 days ago by awsaf49
0
DebertaV3MaskedLM example don't work
#1622 opened 12 days ago by mrektor
1
unable to diagnose OOM
#1628 opened 3 days ago by josharian
0
make it easier to adjust dropout when loading gemma models
#1620 opened 13 days ago by josharian
0
403 KaggleApiHTTPError while running GemmaCausalLM
#1625 opened 9 days ago by nashschool
2
Any plans for more Llama type models?
#1587 opened a month ago by pass-lin
1
Issue instantiating a keras_nlp.models.Backbone from a model preset of Hugging Face handles
#1574 opened a month ago by RandomWalkie
4
Gemma Model Storing and Loading after Fine tuning
#1482 opened 3 months ago by kreouzisv
4
GemmaBackbone.get_layout_map broken for gemma_2b_en
#1613 opened 16 days ago by josharian
2
Issue when fine-tuning Albert - Resource localhost/_0_SentencepieceOp/N10tensorflow4text12_GLOBAL__N_121SentencepieceResourceE does not exist.
#1573 opened a month ago by deathsaber
3
i installed keras-nlp in pycharm IDE , when in run
#1426 opened 20 days ago by said-ml
8
Cannot reproduce results from notebook on Colab
#1592 opened a month ago by jespernwulff
3
keras-nlp insists I use the (buggy) Tensorflow 2.16.1 which does not work with my GPU
#1519 opened 24 days ago by nas-mouti
12
[RfC] Ideas for better Hugging Face Hub integration
#1529 opened 25 days ago by Wauplin
7
Any plans for QLora?
#1537 opened 2 months ago by asmith26
2
Update ByteTokenizer to remove TensorFlow dependency
#1469 opened 3 months ago by stereoplegic
1
cannot import name 'CachedMultiHeadAttention' from partially initialized module 'keras_nlp.src.layers.modeling.cached_multi_head_attention' (most likely due to a circular import)
#1427 opened 3 months ago by anilmamidwar15021991
0
Samplers in Gemma model
#1588 opened a month ago by mostafamdy
6
Retrieving Model Text in Custom Loss Function for Training
#1589 opened a month ago by mostafamdy
0
Any plans for moreLlama 3?
#1586 opened a month ago by pass-lin
0
How gemma_lm.preprocessor.sequence_length dealing with large input data
#1582 opened a month ago by mostafamdy
3
Add Electra Weights to Kaggle Models
#1422 opened 4 months ago by pranavvp16
3
Data-Parallel Training with KerasNLP and tf.distribute example dataset problem
#1504 opened 2 months ago by sitamgithub-MSIT
4
Keep kv cache as list of tensors maybe better than one tensor
#1562 opened a month ago by lingzhi98
3
Why not use low precision matmul for reverse embedding in gemma model
#1542 opened a month ago by lingzhi98
4
create local variable per_token_loss in score method to global. So that we can modify loss function.
#1539 opened 2 months ago by deveshklt
4
`SentencePieceTokenizer` inside a `keras.models.Model` fails to be reconstructed during `keras.saving.load_model()`
#1522 opened 2 months ago by briango28
2
Add grok-1
#1525 opened 2 months ago by innat
0
Feature Request: Transformer Debugger - Debugging and controlling the behavior of transformer based LLM models.
#1513 opened 2 months ago by abhaskumarsinha
3
Add Mistral 0.2 models as possible presets
#1515 opened 2 months ago by borisdayma
3
Gemma discrepancies
#1494 opened 2 months ago by awsaf49
1
Question about Gemma tensor parallel sharding policy
#1464 opened 2 months ago by AIGideon
5
Model weights contributions?
#1463 opened 3 months ago by deep-diver
5
Keras_NLP and Kaggle Hub: Are models allowed without weights in Kaggle Hub?
#1433 opened 3 months ago by abhaskumarsinha
0
How to add a serialized model and weights of a keras model to keras-nlp?
#1479 opened 3 months ago by abhaskumarsinha
4
Dropout is not called in the training regime in TransformerEncoder and others
#1500 opened 2 months ago by foxik
2
Add CLIP tokenizer to Keras NLP
#1453 opened 3 months ago by divyashreepathihalli
1
Add `oov_token` Argument to `BytePairTokenizer`
#1466 opened 3 months ago by abuelnasr0
1
Broken link to "Understanding masking and padding" guide
#1446 opened 2 months ago by TheCrazyT
2
ContrastiveSampler lacks a seed param, while the docstring states it has one
#1481 opened 2 months ago by martin-gorner
1
Any guide how to use tools/gemma/run_gemma_xla.py?
#1461 opened 2 months ago by deep-diver
4
Mistral kills the process by taking too many RAM
#1458 opened 3 months ago by deep-diver
2
Preset and doc for Mistral (multilingual)
#1418 opened 3 months ago by federicoparra
26
Issue with `BytePairTokenizer`
#1435 opened 3 months ago by abuelnasr0
0
Keras CI fails due to update of `tensorflow-hub` to 0.16.1
#1419 opened 4 months ago by sampathweb
0