pytorch - How to fine-tune merlinite 7B model in Python

I am new to LLM programming in Python and I am trying to fine-tune the instructlab/merlinite-7b-lab model on my Mac M1. My goal is to teach this model to a new music composer Xenobi Amilen I have invented.

The text of this composer is here.

Using the new Ilab CLI from RedHat I created this training set for the model. It is a JSONL file with 100 questions/answers about the invented composer.

I wrote this Python script to train the model. I tested all the parts related to the tokenizer, datasets and it seems to work. However, the final train got this error:

RuntimeError: Placeholder storage has not been allocated on MPS device!
  0%|          | 0/75 [00:00<?, ?it/s]

I found a lot of articles about this error on Google and also StackOverflow like this, for example. The problem seems that in addition to the model I have to send to mps also the input parameters, but it's not clear to me how to change my code to do that.

I tried several fixes but had no luck. Can anyone can help?

edited Jul 3 at 21:15

asked Jun 30 at 21:37

Salvatore D'angelo

1,1093 gold badges19 silver badges43 bronze badges

Anyone can help on this?
– Salvatore D'angelo
Commented Jul 3 at 21:15
Did you try setting model.to(device)? Trainer should use the device of the model. Does it work if you run on cpu instead of the mps?
– Shaido
Commented Jul 5 at 7:31
First question: yes. I attached the code. Second question: yes. Same error even if I use cpu.
– Salvatore D'angelo
Commented Jul 5 at 14:16

Add a comment |

Collectives™ on Stack Overflow

How to fine-tune merlinite 7B model in Python

0

Browse other questions tagged
python
pytorch
huggingface-transformers
large-language-model
huggingface-tokenizers
or ask your own question.

Linked

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Browse other questions tagged pythonpytorchhuggingface-transformerslarge-language-modelhuggingface-tokenizers or ask your own question.

Linked

Browse other questions tagged
python
pytorch
huggingface-transformers
large-language-model
huggingface-tokenizers
or ask your own question.