JZTXT
  • 首页
  • Ai
  • Java
  • Python
  • Android
  • Mysql
  • JavaScript
  • Html
  • CSS

llama-factory fine-tuning-3 (conception and technologies explanation)

发布时间 2023-11-29 15:45:50作者: Daze_Lu

train method

supervised fine-tuning

 

Reward Modeling

 

PPO training

 

DPO training

 

full-parameter

 

partial-parameter

 

LoRA

 

QLoRA

 

command parameter

fp16

 

gradient_accumulation_steps

 

lr_scheduler_type

 

lora_target

 

overwrite_cache

 

stage

 

    本栏目推荐文章
  • [cpp]: concept --<template>
  • The Evolution of Smart Car Technology: A Glimpse into the Future of Mobility
  • 信息与通信技术(ICT,information and communications technology)
  • 大语言模型优化方法简介:Prompt、RAG、Fine-tuning
  • 南阳 南阳科技职业学院 外文名Nanyang Vocational College of science and technology
  • 浙江科技大学(Zhejiang University of Science and Technology)
  • llama-factory fine-tuning 4 (mixtral fine-tuning)
  • llama-factory fine-tuning 3
  • 生物信息学和生物医学技术国际会议(ICBBT---International Conference on Bioinformatics and Biomedical Technology (ICBBT
  • P5048 [Ynoi2019 模拟赛] Yuno loves sqrt technology III
版权声明:本网站为非赢利性站点,本网站所有内容均来源于互联网相关站点自动搜索采集信息,相关链接已经注明来源。
联系我们