Self-Instruct构造Prompt的例子

2024-07-09 21:18:03
开发
21

人工构造一批Prompt做种子。（Starting with a small seed set of human-written tasks）

每次把一些种子+后来生成的Prompt，放到Input里做few-shot examples，用LLM生成更多的Prompt；（Using the LLM to generate new instructions based on the seed tasks）

过滤掉质量太差的，修正能要的；（Filtering and refining the generated instructions）

把生成的所有Prompt，输入LLM得到输出结果；（Creating input-output instances for the new instructions）

Input+Output，做LLM的训练样本（Using the generated dataset to fine-tune the LLM）

第2步，LLM生成：

import random
from transformers import AutoTokenizer, AutoModelForCausalLM

# Load a pre-trained language model
model_name = "bigcode/starcoderbase-1b"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

# Seed tasks (simplified for demonstration)
seed_tasks = [
    "Write a function to calculate the factorial of a number.",
    "Create a class to represent a bank account.",
    "Implement a binary search algorithm."
]

def generate_instruction(prompt):
    inputs = tokenizer(prompt, return_tensors="pt")
    outputs = model.generate(**inputs, max_new_tokens=50)
    return tokenizer.decode(outputs[0], skip_special_tokens=True)

def self_instruct(num_iterations):
    generated_tasks = []
    
    for _ in range(num_iterations):
        # Sample existing tasks
        sampled_tasks = random.sample(seed_tasks + generated_tasks, min(3, len(seed_tasks) + len(generated_tasks)))
        
        # Create a prompt for generating new instructions
        prompt = "Generate a new programming task based on these examples:\n\n"
        prompt += "\n".join(sampled_tasks)
        prompt += "\n\nNew task:"
        
        # Generate a new instruction
        new_task = generate_instruction(prompt)
        
        # In practice, you would filter and refine the generated task here
        
        generated_tasks.append(new_task)
    
    return generated_tasks

# Run Self-Instruct
new_tasks = self_instruct(5)
for i, task in enumerate(new_tasks, 1):
    print(f"Task {i}: {task}")

第3步过滤：

人工定义一些规则，过滤掉太差的；（也可以用LLM来做裁判）

目的：确保质量和多样性；

Filter out instructions that are too short or too long

Filter out instructions containing keywords unsuitable for language models (e.g. "image", "graph", "file", "plot")

Filter out instructions starting with punctuation

Filter out instructions starting with non-English characters

Filter out instructions that have high ROUGE-L similarity (above 0.7) with any existing instruction in the task pool

原文地址:https://blog.csdn.net/smartcat2010/article/details/140245855 本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：https://www.suanlizi.com/kf/1810664712858374144.html 如若内容造成侵权/违法违规/事实不符，请联系《酸梨子》网邮箱：1419361763@qq.com进行投诉反馈，一经查实，立即删除！

阅读全部

Self-Instruct构造Prompt的例子

相关推荐

最近更新

热门阅读