Newest 'large-language-model+peft' Questions

0 votes

0 answers

49 views

How to generate output of HuggingFace PEFT model with previous message history as context?

I am trying to generate text from my fine-tuned Llama3 model which uses the PEFT AutoPeftModelForCausalLM library while also passing in previous message history. This is how I am currently generating ...

Avik Malladi

27

asked Jun 27 at 0:02

0 votes

0 answers

30 views

untimeError: cutlassF: no kernel found to launch! while generating text using finetuned Peft model

peft_model_id = "/finetuned_deep_seek/transformers/deepseek_finetuned/1" peft_model = AutoModelForCausalLM.from_pretrained(peft_model_id) peft_model = PeftModel.from_pretrained(model, ...

sriram anush

95

asked Jun 26 at 19:33

0 votes

0 answers

78 views

Creating a Dataset for Fine tuning LLM using PEFT and SFT Trainer?

I have a dataset of 1000 records with 3 columns "question" ,"step by step answer"," single word answer" in CSV format. I tried to fine tune an LLM (gemma) on this dataset ...

sriram anush

95

asked Jun 12 at 18:09

3 votes

1 answer

2k views

AttributeError: 'TrainingArguments' object has no attribute 'model_init_kwargs'

While finetuning Gemma2B model using QLoRA i'm getting error as AttributeError: 'TrainingArguments' object has no attribute 'model_init_kwargs' Code: Loading the libraries from enum import Enum from ...

Tarun

182

asked Jun 4 at 12:06

0 votes

0 answers

239 views

Issues finetuning LORA of 8bit Llama3 on custom dataset

I have been trying to fine tune a QLora version of Llama3-8B-IT model on kaggle notebook on a custom dataset of about 44 questions. However I am not getting good results in all of the responses. The ...

APaul31

38

asked May 31 at 15:07

1 vote

1 answer

627 views

How to fix error `OSError: <model> does not appear to have a file named config.json.` when loading custom fine-tuned model?

Preface I am new to implementing the NLP model. I have successfully fine-tuned LLaMA 3-8B variants with QLORA and uploaded them to HuggingFace. The directories are filled with these files: - ....

sempraEdic

128

asked May 30 at 2:36

1 vote

0 answers

113 views

Peft model from checkpoint leading into size missmatch

I have trained peft model and saved it in huggingface. No i want to merge it with base model. i have used following code. from peft import PeftModel, PeftConfig,AutoPeftModelForCausalLM from ...

Sandun Tharaka

11

asked May 27 at 17:35

0 votes

1 answer

134 views

half() is not supported for quantized model when using FineTuned

I have fine tuned a Llama-3 model ( model_name="meta-llama/Meta-Llama-3-8B") in standard way per this notebook https://colab.research.google.com/drive/1Zmaceu65d7w4Tcd-cfnZRb6k_Tcv2b8g?usp=...

M80

994

asked May 9 at 19:28

0 votes

0 answers

9 views

Can I add a composition Block of different adapter types in adapterhub library?

I am using adapterhub library for implementing PEFT methods for fine-tuning. I have a question please: Can i stack prefix tuning with bottleneck adapter? something like config unioin but as ...

Rawhani

33

asked Apr 29 at 21:23

0 votes

1 answer

252 views

Diffrence between gguf and lora

Does the gguf format perform model quantization even though it's already quantized with LORA? Hello ! im new to Llms ,and l've fine-tuned the CODELLAMA model on kaggle using LORA.I've merged and ...

Samar

3

asked Apr 17 at 10:30

0 votes

0 answers

36 views

KeyError: 'input_ids' arise when I used prompt-tuned Codet5 for

I successfully prompt tuned codet5, however, I can't use the fine-tuned model for inference. It shows Key error 'input_ids': Traceback (most recent call last): File "/home/liangpeng/project/...

Mabel

1

asked Apr 15 at 5:21

0 votes

0 answers

26 views

Seek help for a more efficient way to use caching to train my lora model

When I was fine-tuning llama2 model using lora, I came across a problem. The instruction dataset goes something like this: "Here's the background to the problem... (1000 identical words)... Now ...

Forrest

1

asked Apr 8 at 1:24

1 vote

0 answers

106 views

What is the difference between merging LORA weight with base model and not merging the weight in LLAMA2 (LLM)?

The question is regarding LLM(Large language model). I want to understand it from LLAMA2 perspective. Can someone explain why the final outcome is almost same without combining weights? Additionally, ...

XGB

111

asked Mar 6 at 13:12

1 vote

2 answers

1k views

Huggingface transformer train function throwing Device() received an invalid combination of arguments

I was trying to train a model with peft qLora training. Lora config and peft training args are like below: lora_config = LoraConfig( r=8, lora_alpha=16, target_modules=[ "...

Syed Mohammad Fahim Abrar

883

asked Feb 16 at 14:39

1 vote

0 answers

377 views

How do I save a huggingface LLM model into shards?

I am following the fine tuning guide on the following website: https://www.labellerr.com/blog/hands-on-with-fine-tuning-llm/ I have successfully fine tuned the Falcon-7b model on a dataset from ...

Muhammad Omar Farooq

41

asked Feb 14 at 4:19

Collectives™ on Stack Overflow

All Questions

How to generate output of HuggingFace PEFT model with previous message history as context?

untimeError: cutlassF: no kernel found to launch! while generating text using finetuned Peft model

Creating a Dataset for Fine tuning LLM using PEFT and SFT Trainer?

AttributeError: 'TrainingArguments' object has no attribute 'model_init_kwargs'

Issues finetuning LORA of 8bit Llama3 on custom dataset

How to fix error `OSError: <model> does not appear to have a file named config.json.` when loading custom fine-tuned model?

Peft model from checkpoint leading into size missmatch

half() is not supported for quantized model when using FineTuned

Can I add a composition Block of different adapter types in adapterhub library?

Diffrence between gguf and lora

KeyError: 'input_ids' arise when I used prompt-tuned Codet5 for

Seek help for a more efficient way to use caching to train my lora model

What is the difference between merging LORA weight with base model and not merging the weight in LLAMA2 (LLM)?

Huggingface transformer train function throwing Device() received an invalid combination of arguments

How do I save a huggingface LLM model into shards?

Hot Network Questions

Collectives™ on Stack Overflow

All Questions

Related Tags