mistralai/mistral-finetune

PythonApache-2.0

Issues

mistal-chat cuda out of memory error
#14 opened 7 months ago by yuvalshachaf
11
CUDA out of memory during training
#69 opened 6 months ago by CodeWithOz
7
[BUG: TRL support]
#103 opened 2 months ago by moghadas76
0
[BUG: does it support ministral? 3B or above?
#102 opened 2 months ago by ArtificialZeng
0
[BUG: utils.validate_data has inconsistent use of tabs and spaces in indentation which leads to crashing]
#95 opened 4 months ago by Hackerbone
0
[BUG: validate_data.py ModuleNotFoundError (finetune & tensorflow)
#98 opened 4 months ago by CorentinWicht
11
[BUG: Error during training
#85 opened 5 months ago by Chasapas
1
mistral small support
#101 opened 3 months ago by win4r
0
[BUG: ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (32768,) + inhomogeneous part.
#99 opened 3 months ago by nvnk3
1
How can I merge the LoRA weights into the base model?
#74 opened 6 months ago by pantDevesh
7
[BUG] Error with vllm
#93 opened 4 months ago by C3po-D2rd2
0
[BUG: Instruction FT (train.py) crashes on AWS g4dn.12xlarge (4 T4s) without precise indication of reasons]
#92 opened 4 months ago by leloss
1
Getting an error when attempting to finetune Mixtral
#23 opened 7 months ago by jagilley
2
I have finetuned mistral-instruct-v0.3 but fine tuned model got worse. It doesn not even answer what original v0.3 can answer.
#82 opened 5 months ago by ZTAP0011
4
[BUG: after training the model, i am not able to merge the model and run inference on it
#89 opened 4 months ago by kiranshivaraju
0
[Not a bug] Support for fine-tuning codestral-mamba?
#87 opened 5 months ago by ciscodoojung
2
Fail to finetune with several GPU
#71 opened 6 months ago by banalg
3
[NOT A BUG] : Is it possible to run the current validation/training scripts on a ray cluster?
#88 opened 5 months ago by SaiKrishnaBala
0
How to use utils.extend_model_vocab?
#12 opened 7 months ago by CrispStrobe
3
[BUG]: utils.validate_data is not compatible with Mistral-Nemo-12B
#84 opened 5 months ago by ShadyPi
0
Mistral-Finetune creates consolidated.safetensors for mixtral 8x7b instruct v0.1 but mistral-chat fails inference for it complains about LoRA weights file being loaded missing an expected key for one of the model layers.
#75 opened 6 months ago by tensimixt
5
`flshattF@v2.3.6` is not supported due to requires device with capability > (8, 0) but your GPU has capability (6, 1) (too old)
#79 opened 5 months ago by kamlesh0606
1
[BUG]: The _parse_available_tools method does not return all the defined tools.
#77 opened 6 months ago by matheus-prandini
1
Cannot use wandb logging
#24 opened 7 months ago by jagilley
3
data validation issue
#66 opened 6 months ago by noviljohnson
1
How to convert model to GGUF after fine-tuned?
#64 opened 6 months ago by bensonbs
3
Finetune 22B codestra model
#68 opened 6 months ago by salaki
0
Demos
#35 opened 7 months ago by shgidi
0
Add model format flexibility of AutoModel.from_pretrained()
#28 opened 7 months ago by DavidFarago
1
google colab error
#25 opened 7 months ago by silvacarl2
1
How to do early stopping
#33 opened 7 months ago by sujoyrc
0
Tensorboard integration
#32 opened 7 months ago by SaiKrishnaBala
0
Use huggingface model ID instead of local path to model
#27 opened 7 months ago by DavidFarago
0
mixtral 8x7b and 8x22b please
#11 opened 7 months ago by ehartford
5
Although it's not very useful now, thank you for opening up
#21 opened 7 months ago by Trangle
0
Video tutorial?
#6 opened 7 months ago by gileneusz
1
`ValueError: min() arg is an empty sequence` in `utils.validate_data` script
#17 opened 7 months ago by aymenkrifa
1
Avoid symlink `tests/fixtures`
#18 opened 7 months ago by DavidFarago
1
Error python version 3.9
#9 opened 7 months ago by thomaspernet
0