Year
Month
(Preprint) Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm
Lin Bo ¹, Liang Pang 庞亮 ³, Gang Wang ⁴, Jun Xu 徐君 ², XiuQiang He 何秀强 ⁴, Ji-Rong Wen 文继荣 ²
¹ School of Information, Renmin University of China, Beijing, China
中国 北京 中国人民大学信息学院
² Gaoling School of Artificial Intelligence, Renmin University of China, , Beijing, China
中国 北京 中国人民大学高瓴人工智能学院
³ Institute of Computing Technology, Chinese Academy of Sciences
中国 北京 中国科学院计算技术研究所
⁴ Huawei Noah’s Ark Lab
中国 香港 华为诺亚方舟实验室
arXiv, 2021-08-12
Abstract

Recently, pre-trained language models such as BERT have been applied to document ranking for information retrieval. These methods usually first pre-train a general language model on an unlabeled large corpus and then conduct ranking-specific fine-tuning on expert-labeled relevance datasets. Though reliminary successes have been observed in a variety of IR tasks, a lot of room still remains for further improvement.

Ideally, an IR system would model relevance from a user-system dualism: the user's view and the system's view. User's view judges the relevance based on the activities of “real users” while the system's view focuses on the relevance signals from the system side, e.g., from the experts or algorithms, etc. Inspired by the user-system relevance views and the success of pre-trained language models, in this paper we propose a novel ranking framework called Pre-Rank that takes both user's view and system's view into consideration, under the pre-training and fine-tuning paradigm. Specifically, to model the user's view of relevance, Pre-Rank pre-trains the initial query-document representations based on a large-scale user activities data such as the click log. To model the system's view of relevance, Pre-Rank further fine-tunes the model on expert-labeled relevance data. More importantly, the pre-trained representations, are fine-tuned together with handcrafted learning-to-rank features under a wide and deep network architecture. In this way, Pre-Rank can model the relevance by incorporating the relevant knowledge and signals from both real search users and the IR experts.

To verify the effectiveness of Pre-Rank, we showed two implementations by using BERT and SetRank as the underlying ranking model, respectively. Experimental results base on three publicly available benchmarks showed that in both of the implementations, Pre-Rank can respectively outperform the underlying ranking models and achieved state-ofthe-art performances. The results demonstrate the effectiveness of Pre-Rank in combining the user-system views of relevance.
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_1
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_2
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_3
Modeling Relevance Ranking under the Pre-training and Fine-tuning Paradigm_4
  • Broadband ultrasound generator over fiber-optic tip for in vivo emotional stress modulation
  • Jiapu Li, Xinghua Liu, Zhuohua Xiao, Shengjiang Yang, Zhanfei Li, Xin Gui, Meng Shen, He Jiang, Xuelei Fu, Yiming Wang, Song Gong, Tuan Guo, Zhengying Li
  • Opto-Electronic Science
  • 2025-07-25
  • Non-volatile reconfigurable planar lightwave circuit splitter enabled by laser-directed Sb2S3 phase transitions
  • Shixin Gao, Tun Cao, Haonan Ren, Jingzhe Pang, Ran Chen, Yang Ren, Zhenqing Zhao, Xiaoming Chen, Dongming Guo
  • Opto-Electronic Technology
  • 2025-07-18
  • Progress in metalenses: from single to array
  • Chang Peng, Jin Yao, Din Ping Tsai
  • Opto-Electronic Technology
  • 2025-07-18
  • 30 years of nanoimprint: development, momentum and prospects
  • Wei-Kuan Lin, L. Jay Guo
  • Opto-Electronic Technology
  • 2025-07-18
  • Review for wireless communication technology based on digital encoding metasurfaces
  • Haojie Zhan, Manna Gu, Ying Tian, Huizhen Feng, Mingmin Zhu, Haomiao Zhou, Yongxing Jin, Ying Tang, Chenxia Li, Bo Fang, Zhi Hong, Xufeng Jing, Le Wang
  • Opto-Electronic Advances
  • 2025-07-17
  • Coulomb attraction driven spontaneous molecule-hotspot paring enables universal, fast, and large-scale uniform single-molecule Raman spectroscopy
  • Lihong Hong, Haiyao Yang, Jianzhi Zhang, Zihan Gao, Zhi-Yuan Li
  • Opto-Electronic Advances
  • 2025-07-17
  • Multiphoton intravital microscopy in small animals of long-term mitochondrial dynamics based on super‐resolution radial fluctuations
  • Saeed Bohlooli Darian, Jeongmin Oh, Bjorn Paulson, Minju Cho, Globinna Kim, Eunyoung Tak, Inki Kim, Chan-Gi Pack, Jung-Man Namgoong, In-Jeoung Baek, Jun Ki Kim
  • Opto-Electronic Advances
  • 2025-07-17
  • Research progress on generating perfect vortex beams based on metasurfaces
  • Xiujuan Liu, Manna Gu, Ying Tian, Mingfeng Zheng, Bo Fang, Zhi Hong, Chee Leong Tan, Xufeng Jing
  • Opto-Electronic Science
  • 2025-07-09
  • Non-volatile tunable multispectral compatible infrared camouflage based on the infrared radiation characteristics of Rosaceae plants
  • Xin Li, Xinye Liao, Junxiang Zeng, Zao Yi, Xin He, Jiagui Wu, Huan Chen, Zhaojian Zhang, Yang Yu, Zhengfu Zhang, Sha Huang, Junbo Yang
  • Opto-Electronic Advances
  • 2025-07-09
  • Spectro-polarimetric detection enabled by multidimensional metasurface with quasi-bound states in the continuum
  • Haoyang He, Fangxing Lai, Yan Zhang, Xue Zhang, Chenyi Tian, Xin Li, Yongtian Wang, Shumin Xiao, Lingling Huang
  • Opto-Electronic Advances
  • 2025-06-30
  • Emerging low-dimensional perovskite resistive switching memristors: from fundamentals to devices
  • Shuanglong Wang, Hong Lian, Haifeng Ling, Hao Wu, Tianxiao Xiao, Yijia Huang, Peter Müller-Buschbaum
  • Opto-Electronic Advances
  • 2025-06-27
  • CW laser damage of ceramics induced by air filament
  • Chuan Guo, Kai Li, Zelin Liu, Yuyang Chen, Junyang Xu, Zhou Li, Wenda Cui, Changqing Song, Cong Wang, Xianshi Jia, Ji'an Duan, Kai Han
  • Opto-Electronic Advances
  • 2025-06-27



  • Recursive Multi-Tensor Contraction for XEB Verification of Quantum Circuits                                Differential STBC-SM Scheme for Uplink Multi-user Massive MIMO Communications: System Design and Performance Analysis
    About
    |
    Contact
    |
    Copyright © PubCard