中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
机构
采集方式
_filter
_filter
_filter
筛选

浏览/检索结果: 共4条,第1-4条 帮助

条数/页: 排序方式:
Large Language Models are Parallel Multilingual Learners 期刊论文  OAI收割
arXiv, 2024
作者:  
Mu, Yongyu;  Feng, Peinan;  Cao, Zhiquan;  Wu, Yuzhang;  Li, Bei
  |  收藏  |  浏览/下载:25/0  |  提交时间:2024/04/15
RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data 期刊论文  OAI收割
arXiv, 2024, 页码: 14
作者:  
Chenglong Wang;  Yang Gan;  Yifu Huo;  Yongyu Mu;  Murun Yang
  |  收藏  |  浏览/下载:12/0  |  提交时间:2024/09/23
Hybrid Alignment Training for Large Language Models 期刊论文  OAI收割
arXiv, 2024
作者:  
Chenglong Wang;  Hang Zhou;  Kaiyan Chang;  Bei Li;  Yongyu Mu
  |  收藏  |  浏览/下载:15/0  |  提交时间:2024/07/23
LRHP: Learning Representations for Human Preferences via Preference Pairs 期刊论文  OAI收割
arXiv, 2024
作者:  
Chenglong Wang;  Yang Gan;  Yifu Huo;  Yongyu Mu;  Qiaozhi He
  |  收藏  |  浏览/下载:10/0  |  提交时间:2024/12/03