Add reft method #8723

TranscenderNing · 2024-07-07T08:43:05Z

PR types

New feature

PR changes

Add reft in paddlenlp/peft/reft
reft
├── pareft
│ ├── config.py reft配置文件，继承pavenv.config
│ ├── dataset.py reft数据处理，reft方法输入会有一个新的intervention_locations字段，表示干预的token的位置，例如f5+l5表示干预输入的前5个tokne和后5个token
│ ├── init.py
│ ├── interventions.py 干预网络
│ ├── reft_model.py 创建reft模型，继承pavenv.interventableModel
│ ├── reft_trainer.py 重写compute_loss方法，reft方法需要根据配置中的position参数干预对应位置token的隐藏表示
│ └── utils.py 工具类
└── pavenv
├── init.py
└── models
├── basic_utils.py 基础的工具类
├── configuration_intervenable_model.py 创建干预方法的配置
├── constants.py 常量
├── init.py
├── intervenable_base.py 这个是方法实现的主要类，在该类中模型添加orward_post_hook，在前向传播过程中hook中提取干预位置的向量，将提取的向量输入干预模型，将干预模型的结果替换对应位置的向量
├── intervenable_modelcard.py 模型配置的信息
├── interventions.py 所有干预网络的父类
├── llama
│ └── modelings_intervenable_llama.py llama模型的基础配置
└── modeling_utils.py 模型的一些工具类

Description

paddle-bot · 2024-07-07T08:43:10Z

Thanks for your contribution!

CLAassistant · 2024-07-07T08:43:11Z

All committers have signed the CLA.

codecov · 2024-07-07T09:13:11Z

Codecov Report

Attention: Patch coverage is 19.14043% with 1618 lines in your changes missing coverage. Please review.

Project coverage is 54.77%. Comparing base (ee4944e) to head (2975154).

❗ Current head 2975154 differs from pull request most recent head fdfb8c5

Please upload reports for the commit fdfb8c5 to get more accurate results.

Files	Patch %	Lines
paddlenlp/reft/pavenv/models/intervenable_base.py	9.88%	711 Missing ⚠️
paddlenlp/reft/pareft/dataset.py	18.29%	259 Missing ⚠️
paddlenlp/reft/pavenv/models/modeling_utils.py	14.22%	193 Missing ⚠️
paddlenlp/reft/pavenv/models/intervention_utils.py	15.65%	97 Missing ⚠️
paddlenlp/reft/pavenv/models/interventions.py	37.81%	74 Missing ⚠️
.../pavenv/models/configuration_intervenable_model.py	12.98%	67 Missing ⚠️
paddlenlp/reft/pavenv/models/basic_utils.py	26.31%	56 Missing ⚠️
paddlenlp/reft/pareft/reft_trainer.py	32.92%	55 Missing ⚠️
paddlenlp/reft/pareft/interventions.py	25.86%	43 Missing ⚠️
paddlenlp/reft/pareft/reft_model.py	25.71%	26 Missing ⚠️
... and 6 more

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8723      +/-   ##
===========================================
- Coverage    55.44%   54.77%   -0.67%     
===========================================
  Files          631      645      +14     
  Lines        98542   100066    +1524     
===========================================
+ Hits         54632    54812     +180     
- Misses       43910    45254    +1344

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

lugimzzz · 2024-07-19T06:49:06Z

llm/utils/argument.py

+    share_weights: bool = field(default=True, metadata={"help": "Flag indicating whether to share weights."})
+    greedy_decoding: bool = field(default=True, metadata={"help": "Flag indicating whether to use greedy decoding."})
+    temperature: float = field(default=None, metadata={"help": "Temperature parameter for decoding."})
+    num_hidden_layers: int = field(default=32)


REFT如果有比较多的参数建议另开一个REFTArgument，提交的pr建议筛选出必要的argument

lugimzzz · 2024-07-19T06:49:44Z

llm/utils/compute_metrics.py

+    return DataLoader(dataset, shuffle=shuffle, batch_size=batch_size, collate_fn=collate_fn)
+
+
+def compute_metrics_reft(


这个compute_metrics如果不是通用必要的建议删除

lugimzzz · 2024-07-19T06:50:24Z

llm/utils/data.py

+
+
+# paddle version label 错开
+class LoReftSupervisedDataset(ReftDataset):


新开这个dataset的原因是什么，原有的无法复用吗

reft方法输入会有一个新的intervention_locations字段，表示干预的token的位置，例如f5+l5表示干预输入的前5个tokne和后5个token

lugimzzz · 2024-07-19T06:51:04Z

paddlenlp/__init__.py

@@ -45,6 +45,7 @@
    peft,
    prompt,
    quantization,
+    reft,


reft建议放在peft目录下

lugimzzz · 2024-07-19T06:52:14Z

paddlenlp/reft/pareft/layers.py

+        print("n,m", n, m)
+
+        # weight_attr = paddle.ParamAttr(initializer=paddle.nn.initializer.Orthogonal())
+        # linear = paddle.nn.Linear(10, 15, weight_attr=weight_attr)


去掉无关的comment还有在pr里描述一下哪些是reft方法，哪些是你的方法

lugimzzz · 2024-07-22T06:56:19Z

paddlenlp/reft/pareft/dataset.py

+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+IGNORE_INDEX = -100


为什么reft需要单独的dataset？

reft方法输入会有一个新的intervention_locations字段，表示干预的token的位置，例如f5+l5表示干预输入的前5个tokne和后5个token

lugimzzz · 2024-07-22T06:57:23Z

paddlenlp/reft/pareft/reft_trainer.py

+from dataclasses import dataclass
+from typing import Dict, Sequence
+
+import paddle


reft为什么需要单独trainer？

paddlenlp/reft/pavenv/models/__init__.py

…to reft-branch

Add reft method

5d7c99a

paddle-bot bot added the contributor label Jul 7, 2024

paddle-bot bot assigned wj-Mcat Jul 7, 2024

lugimzzz reviewed Jul 19, 2024

View reviewed changes

TranscenderNing added 2 commits July 19, 2024 15:06

删除无关模型文件

07d8cc7

删除日志文件

2975154

lugimzzz reviewed Jul 22, 2024

View reviewed changes

sijunhe added the Beijing Innovation Consortium label Jul 28, 2024

TranscenderNing and others added 5 commits July 28, 2024 17:16

remove redundant code

f2579ce

delete unsing files

0fc747f

Merge branch 'develop' into reft-branch

76c7ef1

Merge branch 'reft-branch' of github.com:TranscenderNing/PaddleNLP in…

3e6d0da

…to reft-branch

delete unusing files

fdfb8c5

TranscenderNing closed this Jul 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add reft method #8723

Add reft method #8723

TranscenderNing commented Jul 7, 2024 •

edited

Loading

paddle-bot bot commented Jul 7, 2024

CLAassistant commented Jul 7, 2024 •

edited

Loading

codecov bot commented Jul 7, 2024 •

edited

Loading

lugimzzz Jul 19, 2024

TranscenderNing Jul 27, 2024

lugimzzz Jul 19, 2024

lugimzzz Jul 19, 2024

TranscenderNing Jul 27, 2024

lugimzzz Jul 19, 2024

TranscenderNing Jul 23, 2024

TranscenderNing Jul 27, 2024

lugimzzz Jul 19, 2024

TranscenderNing Jul 27, 2024

lugimzzz Jul 22, 2024

TranscenderNing Jul 27, 2024

lugimzzz Jul 22, 2024

		return DataLoader(dataset, shuffle=shuffle, batch_size=batch_size, collate_fn=collate_fn)


		def compute_metrics_reft(



		# paddle version label 错开
		class LoReftSupervisedDataset(ReftDataset):

Add reft method #8723

Add reft method #8723

Conversation

TranscenderNing commented Jul 7, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Jul 7, 2024

CLAassistant commented Jul 7, 2024 • edited Loading

codecov bot commented Jul 7, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TranscenderNing commented Jul 7, 2024 •

edited

Loading

CLAassistant commented Jul 7, 2024 •

edited

Loading

codecov bot commented Jul 7, 2024 •

edited

Loading