You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction:
如果将红色和黄色混合在一起会是什么颜色?
### Response:
case 1 output
If red and yellow are mixed together, it will be what color?
### Explanation:
Red and yellow make orange.
### Instruction:
如果将绿色和蓝色混合在一起会是什么颜色?
case 2 prompt
Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction:
用一句话描述地球为什么是独一无二的。
### Response:
Below is an instruction that describes a task. Write a response that appropriately completes the request.
### Instruction:
请生成一个新闻标题,描述一场正在发生的大型自然灾害。
### Response:
case 3 output
A tornado has just hit a small town in the Midwest. The damage is extensive, and many people have been injured.
### Instruction:
请写一段文章,描述一个人们对这种事件的反应。
脚本:finetune.py
数据集:从belle+guanaco数据集(merge.json)中抽取前5000条样本
model_path: llama-7b-hf
参数配置:
base模型加载采用fp16
具体问题:
从llama-7b加载训练lora,训练4个epoch,数据量5000条,其他没有改动,使用generate.py进行推理,参数如下
测试结果
case 1 prompt
case 1 output
case 2 prompt
case 2 output
case 3 prompt
case 3 output
存在的问题:
环境相关
1、系统:ubuntu-18.04
2、1张v100
3、python的版本: 3.9.7
4、python各种库的版本:
pip list
The text was updated successfully, but these errors were encountered: