Year
Month
(Preprint) Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm
Lin Bo ¹, Liang Pang 庞亮 ³, Gang Wang ⁴, Jun Xu 徐君 ², XiuQiang He 何秀强 ⁴, Ji-Rong Wen 文继荣 ²
¹ School of Information, Renmin University of China, Beijing, China
中国 北京 中国人民大学信息学院
² Gaoling School of Artificial Intelligence, Renmin University of China, , Beijing, China
中国 北京 中国人民大学高瓴人工智能学院
³ Institute of Computing Technology, Chinese Academy of Sciences
中国 北京 中国科学院计算技术研究所
⁴ Huawei Noah’s Ark Lab
中国 香港 华为诺亚方舟实验室
arXiv, 2021-08-12
Abstract

Recently, pre-trained language models such as BERT have been applied to document ranking for information retrieval. These methods usually first pre-train a general language model on an unlabeled large corpus and then conduct ranking-specific fine-tuning on expert-labeled relevance datasets. Though reliminary successes have been observed in a variety of IR tasks, a lot of room still remains for further improvement.

Ideally, an IR system would model relevance from a user-system dualism: the user's view and the system's view. User's view judges the relevance based on the activities of “real users” while the system's view focuses on the relevance signals from the system side, e.g., from the experts or algorithms, etc. Inspired by the user-system relevance views and the success of pre-trained language models, in this paper we propose a novel ranking framework called Pre-Rank that takes both user's view and system's view into consideration, under the pre-training and fine-tuning paradigm. Specifically, to model the user's view of relevance, Pre-Rank pre-trains the initial query-document representations based on a large-scale user activities data such as the click log. To model the system's view of relevance, Pre-Rank further fine-tunes the model on expert-labeled relevance data. More importantly, the pre-trained representations, are fine-tuned together with handcrafted learning-to-rank features under a wide and deep network architecture. In this way, Pre-Rank can model the relevance by incorporating the relevant knowledge and signals from both real search users and the IR experts.

To verify the effectiveness of Pre-Rank, we showed two implementations by using BERT and SetRank as the underlying ranking model, respectively. Experimental results base on three publicly available benchmarks showed that in both of the implementations, Pre-Rank can respectively outperform the underlying ranking models and achieved state-ofthe-art performances. The results demonstrate the effectiveness of Pre-Rank in combining the user-system views of relevance.
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_1
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_2
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_3
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_4
  • Interpretable low-dose CT enhancement via multi-Gaussian cluster variance reduction
  • Xiaofeng Zhang, Yilan Zhu, Yongsheng Huang, Jielong Yang, Zhili Wang, Kai Zhang, Si Chen, Linbo Liu, Xin Ge
  • Opto-Electronic Science
  • 2026-03-25
  • Polygonal generalized perfect spatiotemporal optical vortices
  • Shuoshuo Zhang, Zhangyu Zhou, Qianyi Wei, Zhongsheng Man, Changjun Min, Wending Zhang, Yuquan Zhang, Ting Mei, Xiaocong Yuan
  • Opto-Electronic Science
  • 2026-03-25
  • Perovskite nanocrystals in glass for high efficiency and ultra-high resolution dynamic holographic multicolor display
  • Chao Ruan, Xinkuo Li, Ke Sun, Jianrong Qiu, Dezhi Tan
  • Opto-Electronic Advances
  • 2026-03-25
  • Pixelated BIC metasurfaces for terahertz integrated sensing and imaging
  • Zhanqiang Xue, Guizhen Xu, Junliang Chen, Junxing Fan, Hongyang Xing, Ye Zhou, Longqing Cong
  • Opto-Electronic Advances
  • 2026-03-25
  • Overcoming challenges in InP-based quantum dots: from nucleation mechanisms to high-performance quantum dot light-emitting diodes
  • Yangyang Bian, Qian Li, Fei Chen, Chunhe Yang, Huaibin Shen, Aiwei Tang
  • Opto-Electronic Advances
  • 2026-03-25
  • Emerging landscape of photonic bound states in the continuum for next-generation metadevices
  • Thi Thu Ha Do, Ronghui Lin, Daniil A. Shilkin, Zhiyi Yuan, Cuong Dang, Arseniy I. Kuznetsov, Jinghua Teng, Son Tung Ha
  • Opto-Electronic Advances
  • 2026-03-25
  • A 4096-element 3D-integrated Si-SiN optical phased array for high-power coherent LiDAR
  • Han Wang, Weimin Xie, Xin Yan, Jiaqi Li, Youxi Lu, Ping Jiang, Feng Li, Kai Jin, Xu Yang, Jiali Jiang, Keran Deng, Weishuai Chen, Jing Luo, Li Jin, Junbo Feng, Kai Wei
  • Opto-Electronic Technology
  • 2026-03-20
  • Multi-scale attention residual deep convolutional dealiasing network-assisted unambiguous ultra-long baseline high-precision microwave photonic angle of arrival estimation
  • Xianglin Chen, Yin Li, Shiru Song, Yalin Yao, He Cui, Xuan Li, Zhe Guo, Yinlong Tan, Taolin Liu, Tian Jiang
  • Opto-Electronic Technology
  • 2026-03-20
  • Dual quasi-BIC resonances synergized laser cooling in halide perovskite metasurface
  • Ying Che, Peng Lu, Yang Li, Junhao Zeng, Mengxia Hu, Fei Qin, Tianyue Zhang Xiangping Li
  • Opto-Electronic Technology
  • 2026-03-20
  • High-speed and large-capacity visible light communication for 6G: advances and perspectives
  • Nan Chi, Zhilan Lu, Fujie Li, Haoyu Zhang, Yunkai Wang, Xinyi Liu, Zhiwu Chen, Zhe Feng, Zhuoran Hu, Zhixue He, Ziwei Li, Chao Shen, Junwen Zhang
  • Opto-Electronic Technology
  • 2026-03-20
  • Multi-dimensional photodetection: from material intrinsic properties and metasurface engineering to silicon photonic integration
  • Wenqi Liu, Zilan Tang, Qingzhao Hua, Liang Liu, Xiaoxia Wang, Anlian Pan
  • Opto-Electronic Technology
  • 2026-03-20
  • Holotomography-driven learning unlocks in-silico staining of single cells in flow cytometry by avoiding fluorescence co-registration
  • Daniele Pirone, Giusy Giugliano, Michela Schiavo, Annalaura Montella, Martina Mugnano, Vincenza Cerbone, Maddalena Raia, Giulia Scalia Ivana Kurelac, Diego Luis Medina, Lisa Miccio Mario Capasso, Achille Iolascon, Pasquale Memmolo, Pietro Ferraro
  • Opto-Electronic Science
  • 2026-02-25



  • Recursive Multi-Tensor Contraction for XEB Verification of Quantum Circuits                                Differential STBC-SM Scheme for Uplink Multi-user Massive MIMO Communications: System Design and Performance Analysis
    About
    |
    Contact
    |
    Copyright © PubCard