PubCard - Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm

(Preprint) Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm

Lin Bo ¹, Liang Pang 庞亮 ³, Gang Wang ⁴, Jun Xu 徐君 ², XiuQiang He 何秀强 ⁴, Ji-Rong Wen 文继荣 ²

¹ School of Information, Renmin University of China, Beijing, China
中国北京中国人民大学信息学院
² Gaoling School of Artificial Intelligence, Renmin University of China, , Beijing, China
中国北京中国人民大学高瓴人工智能学院
³ Institute of Computing Technology, Chinese Academy of Sciences
中国北京中国科学院计算技术研究所
⁴ Huawei Noah’s Ark Lab
中国香港华为诺亚方舟实验室

arXiv, 2021-08-12

https://arxiv.org/abs/2108.05652

Abstract

Recently, pre-trained language models such as BERT have been applied to document ranking for information retrieval. These methods usually first pre-train a general language model on an unlabeled large corpus and then conduct ranking-specific fine-tuning on expert-labeled relevance datasets. Though reliminary successes have been observed in a variety of IR tasks, a lot of room still remains for further improvement.

Ideally, an IR system would model relevance from a user-system dualism: the user's view and the system's view. User's view judges the relevance based on the activities of “real users” while the system's view focuses on the relevance signals from the system side, e.g., from the experts or algorithms, etc. Inspired by the user-system relevance views and the success of pre-trained language models, in this paper we propose a novel ranking framework called Pre-Rank that takes both user's view and system's view into consideration, under the pre-training and fine-tuning paradigm. Specifically, to model the user's view of relevance, Pre-Rank pre-trains the initial query-document representations based on a large-scale user activities data such as the click log. To model the system's view of relevance, Pre-Rank further fine-tunes the model on expert-labeled relevance data. More importantly, the pre-trained representations, are fine-tuned together with handcrafted learning-to-rank features under a wide and deep network architecture. In this way, Pre-Rank can model the relevance by incorporating the relevant knowledge and signals from both real search users and the IR experts.

To verify the effectiveness of Pre-Rank, we showed two implementations by using BERT and SetRank as the underlying ranking model, respectively. Experimental results base on three publicly available benchmarks showed that in both of the implementations, Pre-Rank can respectively outperform the underlying ranking models and achieved state-ofthe-art performances. The results demonstrate the effectiveness of Pre-Rank in combining the user-system views of relevance.

Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_1

Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_2

Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_3

Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_4

Operando monitoring of state of health for lithium battery via fiber optic ultrasound imaging system

Chen Geng, Wang Anqi, Zhang Yi, Zhang Fujun, Xu Dongchen, Liu Yueqi, Zhang Zhi, Yan Zhijun, Li Zhen, Li Hao, Sun Qizhen

Opto-Electronic Science

2025-06-25

Observation of polaronic state assisted sub-bandgap saturable absorption

Li Zhou, Yiduo Wang, Jianlong Kang, Xin Li, Quan Long, Xianming Zhong, Zhihui Chen, Chuanjia Tong, Keqiang Chen, Zi-Lan Deng, Zhengwei Zhang, Chuan-Cun Shu, Yongbo Yuan, Xiang Ni, Si Xiao, Xiangping Li, Yingwei Wang, Jun He

Opto-Electronic Advances

2025-06-19

Three-dimensional measurement enabled by single-layer all-in-one transmitting-receipting optical metasystem

Xiaoli Jing, Qiming Liao, Misheng Liang, Bo Wang, Junjie Li, Yongtian Wang, Rui You, Lingling Huang

Opto-Electronic Advances

2025-06-19

Fast-zoom and high-resolution sparse compound-eye camera based on dual-end collaborative optimization

Yi Zheng, Hao-Ran Zhang, Xiao-Wei Li, You-Ran Zhao, Zhao-Song Li, Ye-Hao Hou, Chao Liu, Qiong-Hua Wang

Opto-Electronic Advances

2025-06-19

Cascaded metasurfaces for adaptive aberration correction

Lei Zhang, Tie Jun Cui

Opto-Electronic Advances

2025-05-27

Embedded solar adaptive optics telescope: achieving compact integration for high-efficiency solar observations

Naiting Gu, Hao Chen, Ao Tang, Xinlong Fan, Carlos Quintero Noda, Yawei Xiao, Libo Zhong, Xiaosong Wu, Zhenyu Zhang, Yanrong Yang, Zao Yi, Xiaohu Wu, Linhai Huang, Changhui Rao

Opto-Electronic Advances

2025-05-27

Spectrally extended line field optical coherence tomography angiography

Si Chen, Kan Lin, Xi Chen, Yukun Wang, Chen Hsin Sun, Jia Qu, Xin Ge, Xiaokun Wang, Linbo Liu

Opto-Electronic Advances

2025-05-27

Wearable photonic smart wristband for cardiorespiratory function assessment and biometric identification

Wenbo Li, Yukun Long, Yingyin Yan, Kun Xiao, Zhuo Wang, Di Zheng, Arnaldo Leal-Junior, Santosh Kumar, Beatriz Ortega, Carlos Marques, Xiaoli Li, Rui Min

Opto-Electronic Advances

2025-05-27

Integrated photonic polarizers with 2D reduced graphene oxide

Junkai Hu, Jiayang Wu, Di Jin, Wenbo Liu, Yuning Zhang, Yunyi Yang, Linnan Jia, Yijun Wang, Duan Huang, Baohua Jia, David J. Moss

Opto-Electronic Science

2025-05-22

Tip-enhanced Raman scattering of glucose molecules

Zhonglin Xie, Chao Meng, Donghua Yue, Lei Xu, Ting Mei, Wending Zhang

Opto-Electronic Science

2025-05-22

Structural color: an emerging nanophotonic strategy for multicolor and functionalized applications

Wenhao Wang, Long Wang, Qianqian Fu, Wang Zhang, Liuying Wang, Gu Liu, Youju Huang, Jie Huang, Haoyuan Zhang, Fuqiang Guo, Xiaohu Wu

Opto-Electronic Science

2025-04-25

Reconfigurable origami chiral response for holographic imaging and information encryption

Zhibiao Zhu, Yongfeng Li, Jiafu Wang, Ze Qin, Lixin Jiang, Yang Chen, Shaobo Qu

Opto-Electronic Science

2025-04-25

Recursive Multi-Tensor Contraction for XEB Verification of Quantum Circuits Differential STBC-SM Scheme for Uplink Multi-user Massive MIMO Communications: System Design and Performance Analysis