Year
Month

(Preprint) Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm
Lin Bo ¹, Liang Pang 庞亮 ³, Gang Wang ⁴, Jun Xu 徐君 ², XiuQiang He 何秀强 ⁴, Ji-Rong Wen 文继荣 ²
¹ School of Information, Renmin University of China, Beijing, China
中国 北京 中国人民大学信息学院
² Gaoling School of Artificial Intelligence, Renmin University of China, , Beijing, China
中国 北京 中国人民大学高瓴人工智能学院
³ Institute of Computing Technology, Chinese Academy of Sciences
中国 北京 中国科学院计算技术研究所
⁴ Huawei Noah’s Ark Lab
中国 香港 华为诺亚方舟实验室
arXiv , 2021-08-12
Abstract

Recently, pre-trained language models such as BERT have been applied to document ranking for information retrieval. These methods usually first pre-train a general language model on an unlabeled large corpus and then conduct ranking-specific fine-tuning on expert-labeled relevance datasets. Though reliminary successes have been observed in a variety of IR tasks, a lot of room still remains for further improvement.

Ideally, an IR system would model relevance from a user-system dualism: the user's view and the system's view. User's view judges the relevance based on the activities of “real users” while the system's view focuses on the relevance signals from the system side, e.g., from the experts or algorithms, etc. Inspired by the user-system relevance views and the success of pre-trained language models, in this paper we propose a novel ranking framework called Pre-Rank that takes both user's view and system's view into consideration, under the pre-training and fine-tuning paradigm. Specifically, to model the user's view of relevance, Pre-Rank pre-trains the initial query-document representations based on a large-scale user activities data such as the click log. To model the system's view of relevance, Pre-Rank further fine-tunes the model on expert-labeled relevance data. More importantly, the pre-trained representations, are fine-tuned together with handcrafted learning-to-rank features under a wide and deep network architecture. In this way, Pre-Rank can model the relevance by incorporating the relevant knowledge and signals from both real search users and the IR experts.

To verify the effectiveness of Pre-Rank, we showed two implementations by using BERT and SetRank as the underlying ranking model, respectively. Experimental results base on three publicly available benchmarks showed that in both of the implementations, Pre-Rank can respectively outperform the underlying ranking models and achieved state-ofthe-art performances. The results demonstrate the effectiveness of Pre-Rank in combining the user-system views of relevance.
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_1
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_2
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_3
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_4
  • Fast-zoom and high-resolution sparse compound-eye camera based on dual-end collaborative optimization
  • Yi Zheng, Hao-Ran Zhang, Xiao-Wei Li, You-Ran Zhao, Zhao-Song Li, Ye-Hao Hou, Chao Liu, Qiong-Hua Wang
  • Opto-Electronic Advances
  • 2025-06-19
  • Cascaded metasurfaces for adaptive aberration correction
  • Lei Zhang, Tie Jun Cui
  • Opto-Electronic Advances
  • 2025-05-27
  • Embedded solar adaptive optics telescope: achieving compact integration for high-efficiency solar observations
  • Naiting Gu, Hao Chen, Ao Tang, Xinlong Fan, Carlos Quintero Noda, Yawei Xiao, Libo Zhong, Xiaosong Wu, Zhenyu Zhang, Yanrong Yang, Zao Yi, Xiaohu Wu, Linhai Huang, Changhui Rao
  • Opto-Electronic Advances
  • 2025-05-27
  • Spectrally extended line field optical coherence tomography angiography
  • Si Chen, Kan Lin, Xi Chen, Yukun Wang, Chen Hsin Sun, Jia Qu, Xin Ge, Xiaokun Wang, Linbo Liu
  • Opto-Electronic Advances
  • 2025-05-27
  • Wearable photonic smart wristband for cardiorespiratory function assessment and biometric identification
  • Wenbo Li, Yukun Long, Yingyin Yan, Kun Xiao, Zhuo Wang, Di Zheng, Arnaldo Leal-Junior, Santosh Kumar, Beatriz Ortega, Carlos Marques, Xiaoli Li, Rui Min
  • Opto-Electronic Advances
  • 2025-05-27
  • Integrated photonic polarizers with 2D reduced graphene oxide
  • Junkai Hu, Jiayang Wu, Di Jin, Wenbo Liu, Yuning Zhang, Yunyi Yang, Linnan Jia, Yijun Wang, Duan Huang, Baohua Jia, David J. Moss
  • Opto-Electronic Science
  • 2025-05-22
  • Tip-enhanced Raman scattering of glucose molecules
  • Zhonglin Xie, Chao Meng, Donghua Yue, Lei Xu, Ting Mei, Wending Zhang
  • Opto-Electronic Science
  • 2025-05-22
  • Structural color: an emerging nanophotonic strategy for multicolor and functionalized applications
  • Wenhao Wang, Long Wang, Qianqian Fu, Wang Zhang, Liuying Wang, Gu Liu, Youju Huang, Jie Huang, Haoyuan Zhang, Fuqiang Guo, Xiaohu Wu
  • Opto-Electronic Science
  • 2025-04-25
  • Reconfigurable origami chiral response for holographic imaging and information encryption
  • Zhibiao Zhu, Yongfeng Li, Jiafu Wang, Ze Qin, Lixin Jiang, Yang Chen, Shaobo Qu
  • Opto-Electronic Science
  • 2025-04-25
  • Single-layer, cascaded and broadband-heat-dissipation metasurface for multi-wavelength lasers and infrared camouflage
  • Xingdong Feng, Tianqi Zhang, Xuejun Liu, Fan Zhang, Jianjun Wang, Hong Bao, Shan Jiang, YongAn Huang
  • Opto-Electronic Advances
  • 2025-04-02
  • Phase reconstruction via metasurface-integrated quantum analog operation
  • Qiuying Li, Minggui Liang, Shuoqing Liu, Jiawei Liu, Shizhen Chen, Shuangchun Wen, Hailu Luo
  • Opto-Electronic Advances
  • 2025-04-02
  • Full-dimensional complex coherence properties tomography for multi-cipher information security
  • Yonglei Liu, Siting Dai, Yimeng Zhu, Yahong Chen, Peipei Peng, Yangjian Cai, Fei Wang
  • Opto-Electronic Advances
  • 2025-03-31



  • Recursive Multi-Tensor Contraction for XEB Verification of Quantum Circuits        Differential STBC-SM Scheme for Uplink Multi-user Massive MIMO Communications: System Design and Performance Analysis
    About
    |
    Contact
    |
    Copyright © PubCard