PubCard - Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm

(Preprint) Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm

Lin Bo ¹, Liang Pang 庞亮 ³, Gang Wang ⁴, Jun Xu 徐君 ², XiuQiang He 何秀强 ⁴, Ji-Rong Wen 文继荣 ²

¹ School of Information, Renmin University of China, Beijing, China
中国北京中国人民大学信息学院
² Gaoling School of Artificial Intelligence, Renmin University of China, , Beijing, China
中国北京中国人民大学高瓴人工智能学院
³ Institute of Computing Technology, Chinese Academy of Sciences
中国北京中国科学院计算技术研究所
⁴ Huawei Noah’s Ark Lab
中国香港华为诺亚方舟实验室

arXiv , 2021-08-12

https://arxiv.org/abs/2108.05652

Abstract

Recently, pre-trained language models such as BERT have been applied to document ranking for information retrieval. These methods usually first pre-train a general language model on an unlabeled large corpus and then conduct ranking-specific fine-tuning on expert-labeled relevance datasets. Though reliminary successes have been observed in a variety of IR tasks, a lot of room still remains for further improvement.

Ideally, an IR system would model relevance from a user-system dualism: the user's view and the system's view. User's view judges the relevance based on the activities of “real users” while the system's view focuses on the relevance signals from the system side, e.g., from the experts or algorithms, etc. Inspired by the user-system relevance views and the success of pre-trained language models, in this paper we propose a novel ranking framework called Pre-Rank that takes both user's view and system's view into consideration, under the pre-training and fine-tuning paradigm. Specifically, to model the user's view of relevance, Pre-Rank pre-trains the initial query-document representations based on a large-scale user activities data such as the click log. To model the system's view of relevance, Pre-Rank further fine-tunes the model on expert-labeled relevance data. More importantly, the pre-trained representations, are fine-tuned together with handcrafted learning-to-rank features under a wide and deep network architecture. In this way, Pre-Rank can model the relevance by incorporating the relevant knowledge and signals from both real search users and the IR experts.

To verify the effectiveness of Pre-Rank, we showed two implementations by using BERT and SetRank as the underlying ranking model, respectively. Experimental results base on three publicly available benchmarks showed that in both of the implementations, Pre-Rank can respectively outperform the underlying ranking models and achieved state-ofthe-art performances. The results demonstrate the effectiveness of Pre-Rank in combining the user-system views of relevance.

Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_1

Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_2

Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_3

Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_4

Harmonic heterostructured pure Ti fabricated by laser powder bed fusion for excellent wear resistance via strength-plasticity synergy

Desheng Li, Huanrong Xie, Chengde Gao, Huan Jiang, Liyuan Wang, Cijun Shuai

Opto-Electronic Advances

2025-09-25

Strong-confinement low-index-rib-loaded waveguide structure for etchless thin-film integrated photonics

Yifan Qi, Gongcheng Yue, Ting Hao, Yang Li

Opto-Electronic Advances

2025-09-25

Flicker minimization in power-saving displays enabled by measurement of difference in flexoelectric coefficients and displacement-current in positive dielectric anisotropy liquid crystals

Junho Jung, HaYoung Jung, GyuRi Choi, HanByeol Park, Sun-Mi Park, Ki-Sun Kwon, Heui-Seok Jin, Dong-Jin Lee, Hoon Jeong, JeongKi Park, Byeong Koo Kim, Seung Hee Lee, MinSu Kim

Opto-Electronic Advances

2025-09-25

Dual-frequency angular-multiplexed fringe projection profilometry with deep learning: breaking hardware limits for ultra-high-speed 3D imaging

Wenwu Chen, Yifan Liu, Shijie Feng, Wei Yin, Jiaming Qian, Yixuan Li, Hang Zhang, Maciej Trusiak, Malgorzata Kujawinska, Qian Chen, Chao Zuo

Opto-Electronic Advances

2025-09-25

Parallel all-optical encoded CDMA-driven anti-interference LiDAR for 78 MHz point acquisition

Shujian Gong, Peng Tian, Yinghui Guo, Xiaoyin Li, Mingbo Pu, Qi Zhang, Yanqin Wang, Heping Liu, Xiangang Luo

Opto-Electronic Technology

2025-09-22

Enrichment strategies in surface-enhanced Raman scattering: theoretical insights and optical design for enhanced light-matter interaction

Zhiyang Pei, Chang Ji, Mingrui Shao, Yang Wu, Xiaofei Zhao, Baoyuan Man, Zhen Li, Jing Yu, Chao Zhang