Newest 'fine-tuning+artificial-intelligence' Questions

0 votes

0 answers

24 views

Vertex AI Studio: Fine-tuned chat-bison@002 returns results are not in training data

I have a training dataset of about 1500 samples, in 1 JSONL file. I tried to fine-tune chat-bison@002 model but none of the answers in the test prompt is desired. Even when I try to copy a short ...

nogias

583

asked Jul 15 at 13:47

0 votes

1 answer

44 views

Different results for the same epoch using different number of total epochs

I am training a Machine Learning model for STS task using the Sentence Transformers library. When I was testing it, I noticed that my model generated different results for the same number of epochs ...

Hígor Hahn

1

asked Jun 24 at 22:35

0 votes

0 answers

36 views

What's the correct data structure and format to fine-tune OpenAI assistant as a vector file?

I'm trying to fine-tune an assistant that depends on gpt-3.5-turbo model with numbers and bullets but bullets never show up. I created both docx and txt files with different data format like: Title: ...

PHP User

2,412

asked Jun 23 at 11:52

0 votes

0 answers

39 views

How To Train GPT-3 On Different Datasets For Different Clients

I am planning on building an ai saas product that would focus on CRM and customer support. However, the bot should know about the company that it is answering for. I checked their docs and did ...

Bob joe12

125

asked Jun 12 at 20:46

0 votes

1 answer

53 views

Retrieving relevant documents for specific queries

I am trying to retrieve the top 5 relevant documents related to a user's query using the RAG-Token model. I'm using a custom knowledge base and I tried adjusting the retrieval parameters. This is the ...

Rhett

1

asked Jun 9 at 0:15

-2 votes

1 answer

44 views

Training data for chatGPT wont work correctly

I am trying to train a model in chatGPT (chatGPT-3.5-turbo-1106) for my cleaning business. based on the documentation I created and uploaded the training data successfully, but the answer I am getting ...

user3570022

81

asked May 27 at 22:49

0 votes

0 answers

17 views

Exception has occurred. Error in accessing saved model

I recieve this error when calling my fine-tuned GPT2 saved model. Exception has occurred: OSError Error no file named pytorch_model.bin, tf_model.h5, model.ckpt.index or flax_model.msgpack found in ...

Bayan

1

asked May 2 at 12:41

-1 votes

1 answer

88 views

Limited Labeled Data for Fine-tuning LLMs?

Fine-tuning LLMs for specific domains is attractive, but what about scenarios with limited labeled data? Can unlabeled data or alternative approaches be effective? Looking for insights on best ...

kodexolabs

1

asked Apr 23 at 9:17

0 votes

0 answers

135 views

Most efficient way to fine tune SDXL for range of product

I want to fine tune SDXL using LoRA on a range of product so that SDXL can generate images of those product later on. I have many products. What is the most efficient way to fine tune? Do I just train ...

DevEnma

121

asked Apr 18 at 2:30

0 votes

1 answer

2k views

Fine-tuning: llama-2-13b-chat

For fine-tuning of the large language models (llama2), what should be the format(.text/.json/.csv) and structure (like should be an excel or docs file or prompt and response or instruction and output) ...

aiwesee

1

asked Aug 22, 2023 at 5:10

0 votes

1 answer

438 views

Google Colab Free Tier: Code Stops at 51,000 Examples While Fine-tuning LLAMA 2 with Custom Dataset

I'm encountering an issue while fine-tuning Llama 2 on Google Colab using a custom dataset. The code halts exactly at 51,000 examples during the training process, even though my dataset contains 61,...

CreekSi0

1

asked Jul 25, 2023 at 18:25

0 votes

1 answer

1k views

I am attempting to fine-tune the stable diffusion with Dreambooth on myself (my face and body)

I am attempting to fine-tune the stable diffusion with Dreambooth on myself (my face and body), but the results are not satisfactory. I am seeking guidance on the best way to fine-tune stable ...

Arthur Hakobyan

61

asked Jan 9, 2023 at 9:04

1 vote

0 answers

126 views

Transfer learning (or fine-tuning) pre-trained model on non-text data

I am currently fine-tuning a sentiment analysis bert-based model using PyTorch Trainer from hugging face. So far, so good. I have easily managed to fine-tune the model on my text data. However, I'd ...

corvusMidnight

650

asked Dec 11, 2022 at 9:40

4 votes

2 answers

2k views

What are the differences between adapter tuning and prefix tuning? [closed]

I am trying to understand the concept of adapter-tuning, prompt-tuning, and prefix-tuning in the context of few-shot learning. It appears to me that I can apply prompt tuning to a black box language ...

Exploring

3,041

asked Dec 7, 2022 at 1:16

5 votes

3 answers

8k views

What are the differences between fine tuning and few shot learning?

I am trying to understand the concept of fine-tuning and few-shot learning. I understand the need for fine-tuning. It is essentially tuning a pre-trained model to a specific downstream task. However, ...

Exploring

3,041

asked Jun 14, 2022 at 3:54

Collectives™ on Stack Overflow

All Questions

Vertex AI Studio: Fine-tuned chat-bison@002 returns results are not in training data

Different results for the same epoch using different number of total epochs

What's the correct data structure and format to fine-tune OpenAI assistant as a vector file?

How To Train GPT-3 On Different Datasets For Different Clients

Retrieving relevant documents for specific queries

Training data for chatGPT wont work correctly

Exception has occurred. Error in accessing saved model

Limited Labeled Data for Fine-tuning LLMs?

Most efficient way to fine tune SDXL for range of product

Fine-tuning: llama-2-13b-chat

Google Colab Free Tier: Code Stops at 51,000 Examples While Fine-tuning LLAMA 2 with Custom Dataset

I am attempting to fine-tune the stable diffusion with Dreambooth on myself (my face and body)

Transfer learning (or fine-tuning) pre-trained model on non-text data

What are the differences between adapter tuning and prefix tuning? [closed]

What are the differences between fine tuning and few shot learning?

Hot Network Questions

Collectives™ on Stack Overflow

All Questions

Related Tags