Home » Teams » MLIA » Benjamin Piwowarski
  • Benjamin Piwowarski

  • Chargé de Recherche
  • Équipe: MLIA
  • Email: Benjamin@piwowarski.fr
  • Telephone:0144277260
  • Addresse: Pyramide – Tour 55, 4 place Jussieu, 75005 Paris
  • Site web: https://www.piwowarski.fr

Publications

  • Mathias Vast, Basile van Cooten, Laure Soulier, Benjamin Piwowarski. Understanding Matching Mechanisms in Cross-Encoders. Workshop on Explainability in Information Retrieval, Jul 2025, Padova, Italy, Italy. ⟨hal-05122957⟩
  • Yuxuan Zong, Benjamin Piwowarski. Towards Lossless Token Pruning in Late-Interaction Retrieval Models. The 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul 2025, Padoue, Italy. 2025, 979-8-4007-1592-1/2025/07. ⟨10.1145/3726302.3730100⟩. ⟨hal-05037885⟩
  • Arthur Satouf, Gabriel Ben Zenou, Benjamin Piwowarski, Habiboulaye Amadou-Boubacar, Pablo Piantanida. Rational Retrieval Acts: Leveraging Pragmatic Reasoning to Improve Sparse Retrieval. 2025. ⟨hal-05074220⟩
  • Mathias Vast, Basile van Cooten, Laure Soulier, Benjamin Piwowarski. Comprendre la Nature des Signaux de Correspondance dans les Modèles Neuronaux pour la RI. Conférence en Recherche d’Information et Applications, Association francophone de Recherche d’Information et Applications, Jun 2025, Marseille, France. ⟨hal-05122843⟩
  • Folco Bertini Baldassini, Mustafa Shukor, Matthieu Cord, Laure Soulier, Benjamin Piwowarski. What Makes Multimodal In-Context Learning Work?. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Jun 2024, Seattle, United States. pp.1539-1550, ⟨10.1109/CVPRW63382.2024.00161⟩. ⟨hal-04791285⟩
  • Laura Nguyen, Benjamin Piwowarski, Julio Laborde, Gilles Moyse. Learning Reading Order via Document Layout with Layout2Pos. Linking Theory and Practice of Digital Libraries, Sep 2024, Ljubbljana, Slovenia. pp.3-19, ⟨10.1007/978-3-031-72437-4_1⟩. ⟨hal-04718874⟩
  • João Maria Janeiro, Benjamin Piwowarski, Patrick Gallinari, Loïc Barrault. MEXMA: Token-level objectives improve sentence representations. 2024. ⟨hal-04788199⟩
  • Mathias Vast, Basile van Cooten, Laure Soulier, Benjamin Piwowarski. Which Neurons Matter in IR? Applying Integrated Gradients-based Methods to Understand Cross-Encoders. ICTIR '24: The 2024 ACM SIGIR International Conference on the Theory of Information Retrieval, Jul 2024, Washington DC, United States. pp.133-143, ⟨10.1145/3664190.3672528⟩. ⟨hal-04668348⟩
  • Thibault Formal, Carlos Lassance, Benjamin Piwowarski, Stéphane Clinchant. Towards Effective and Efficient Sparse Neural Information Retrieval. ACM Transactions on Information Systems, 2024, 42 (5), pp.1-46. ⟨10.1145/3634912⟩. ⟨hal-04787990⟩
  • Folco Bertini Baldassini, Mustafa Shukor, Matthieu Cord, Laure Soulier, Benjamin Piwowarski. What Makes Multimodal In-Context Learning Work?. 2024. ⟨hal-04788197⟩
  • Yuxuan Zong, Benjamin Piwowarski. Structured representation for Information Retrieval. COnférence en Recherche d'Informations et Applications, Apr 2024, La Rochelle, France. ⟨10.24348/coria.2024.abstract_24⟩. ⟨hal-04788243⟩
  • Mathias Vast, Yuxuan Zong, Benjamin Piwowarski, Laure Soulier. Simple Domain Adaptation for Sparse Retrievers. Advances in Information Retrieval, 14610, Springer Nature Switzerland, pp.403-412, 2024, Lecture Notes in Computer Science, ⟨10.1007/978-3-031-56063-7_32⟩. ⟨hal-04517668⟩
  • Raphaël Mouravieff, Benjamin Piwowarski, Sylvain Lamprier. Training Table Question Answering via SQL Query Decomposition. 2024. ⟨hal-04788185⟩
  • Raphaël Mouravieff, Benjamin Piwowarski, Sylvain Lamprier. Learning Relational Decomposition of Queries for Question Answering from Tables. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024, Bangkok, Thailand. pp.10471--10485. ⟨hal-04677411⟩
  • Guglielmo Faggioli, Thibault Formal, Simon Lupart, Stefano Marchesin, Stephane Clinchant, et al.. Towards Query Performance Prediction for Neural Information Retrieval: Challenges and Opportunities. ICTIR '23: The 2023 ACM SIGIR International Conference on the Theory of Information Retrieval, Jul 2023, Taipei Taiwan, Taiwan. pp.51-63, ⟨10.1145/3578337.3605142⟩. ⟨hal-04290247⟩
  • Laura Nguyen, Thomas Scialom, Benjamin Piwowarski, Jacopo Staiano. LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization. The 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), May 2023, Dubrovnik, Croatia. ⟨hal-03992015⟩
  • Laura Nguyen, Thomas Scialom, Benjamin Piwowarski, Jacopo Staiano. LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, May 2023, Dubrovnik, Croatia. pp.636-651, ⟨10.18653/v1/2023.eacl-main.46⟩. ⟨hal-04287851⟩
  • Nam Le Hai, Thomas Gerald, Thibault Formal, Jian-Yun Nie, Benjamin Piwowarski, et al.. CoSPLADE: Contextualizing SPLADE for Conversational Information Retrieval. 45th European Conference on Information Retrieval (ECIR 2023), Apr 2023, Dublin, Ireland. pp.537-552, ⟨10.1007/978-3-031-28244-7_34⟩. ⟨hal-04168526⟩
  • Guglielmo Faggioli, Thibault Formal, Stefano Marchesin, Stéphane Clinchant, Nicola Ferro, et al.. Query Performance Prediction for Neural IR: Are We There Yet?. European Conference on Information Retrieval, Apr 2023, Dublin (Ireland), Ireland. pp.232-248, ⟨10.1007/978-3-031-28244-7_15⟩. ⟨hal-04290230⟩
  • Yuxuan Zong, Benjamin Piwowarski. XPMIR: Une bibliothèque modulaire pour l'apprentissage d'ordonnancement et les expériences de RI neuronale. 18e Conférence en Recherche d'Information et Applications -- 16e Rencontres Jeunes Chercheurs en RI -- 30e Conférence sur le Traitement Automatique des Langues Naturelles -- 25e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, 2023, Paris, France. pp.222-233. ⟨hal-04131554⟩
  • Nam Le Hai, Thomas Gerald, Thibault Formal, Jian-Yun Nie, Benjamin Piwowarksi, et al.. CoSPLADE : Adaptation d'un Modèle Neuronal Basé sur des Représentations Parcimonieuses pour la Recherche d'Information Conversationnelle. CORIA-TALN 2023 18e Conférence en Recherche d'Information et Applications (CORIA), Jun 2023, Paris, France. pp.207-212. ⟨hal-04131548⟩
  • Agnès Mustar, Sylvain Lamprier, Benjamin Piwowarski. IRnator: A Framework for Discovering Users Needs from Sets of Suggestions. ICTIR '22: The 2022 ACM SIGIR International Conference on the Theory of Information Retrieval, Aug 2022, Madrid Spain, France. pp.138-143, ⟨10.1145/3539813.3545152⟩. ⟨hal-03923274⟩
  • Thibault Formal, Carlos Lassance, Benjamin Piwowarski, Stéphane Clinchant. From Distillation to Hard Negative Sampling. SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul 2022, Madrid, Spain. pp.2353-2359, ⟨10.1145/3477495.3531857⟩. ⟨hal-03736109⟩
  • Antoine Chaffin, Thomas Scialom, Sylvain Lamprier, Jacopo Staiano, Benjamin Piwowarski, et al.. Which Discriminator for Cooperative Text Generation?. SIGIR 2022 - 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul 2022, Madrid, Spain. pp.2360-2365, ⟨10.1145/3477495.3531858⟩. ⟨hal-03718429⟩
  • Thibault Formal, Benjamin Piwowarski, Stéphane Clinchant. Match Your Words! A Study of Lexical Matching in Neural Information Retrieval. European Conference on Information Retrieval, Apr 2022, Stavenger, Norway. pp.120-127, ⟨10.1007/978-3-030-99739-7_14⟩. ⟨hal-03736112⟩
  • Benjamin Piwowarski, Florian Boudin, Gaël Dias, Jean-Pierre Chevallet, Jose G. Moreno, et al.. Actes de la 3ème journée : Technologies du langage humain et accès interactif à l'information (avril 2022). 3ème journée : Technologies du langage humain et accès interactif à l'information (2022), pp.1-36, 2022. ⟨hal-03946132⟩
  • Sylvain Lamprier, Thomas Scialom, Antoine Chaffin, Vincent Claveau, Ewa Kijak, et al.. Generative Cooperative Networks for Natural Language Generation. ICML 2022 - 39th International Conference on Machine Learning, Jul 2022, Baltimore, MD, United States. pp.11891--11905. ⟨hal-03736116⟩
  • Agnès Mustar, Sylvain Lamprier, Benjamin Piwowarski. On the Study of Transformers for Query Suggestion. ACM Transactions on Information Systems, 2022, 40 (1), pp.18. ⟨10.1145/3470562⟩. ⟨hal-03541893⟩
  • Antoine Chaffin, Thomas Scialom, Sylvain Lamprier, Jacopo Staiano, Benjamin Piwowarski, et al.. Choisir le bon co-équipier pour la génération coopérative de texte. TALN 2022 - 29e conférence sur le Traitement Automatique des Langues Naturelles, Jun 2022, Avignon, France. pp.12-26. ⟨hal-03701506⟩
  • Laura Nguyen, Thomas Scialom, Jacopo Staiano, Benjamin Piwowarski. Skim-Attention: Learning to Focus via Document Layout. Findings of the 2021 Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP 2021), Nov 2021, Punta Cana, Dominican Republic. ⟨hal-03333889⟩
  • Thomas Scialom, Paul-Alexis Dray, Patrick Gallinari, Sylvain Lamprier, Benjamin Piwowarski, et al.. QuestEval: Summarization Asks for Fact-based Evaluation. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Nov 2021, Punta Cana (en ligne), Dominican Republic. pp.6594-6604, ⟨10.18653/v1/2021.emnlp-main.529⟩. ⟨hal-03541895⟩