Files
T
2025-08-23 15:55:07 +08:00

12 lines
518 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
一种Adapter-tuning的实现方式,只提供的思路,具体可以视情况稍微修改。
这里补充一些模型层数:
GPT-2 Small12个GPT2Block,约有1.17亿个参数。
GPT-2 Medium24个GPT2Block,约有3.48亿个参数。
GPT-2 Large36个GPT2Block,约有7.55亿个参数。
GPT-2 XL (也称为Extra Large)48个GPT2Block,约有15.54亿个参数。
RoBERTa Base12个RobertaLayer,总共约有1.25亿个参数。
RoBERTa Large24个RobertaLayer,总共约有3.55亿个参数。