Home » Équipes » MLIA » Publications

Publications

  • Sadaf Abdul Rauf, François Yvon. Translating scientific abstracts in the bio-medical domain with structure-aware models. Computer Speech and Language, 2024, 87, pp.101623. ⟨10.1016/j.csl.2024.101623⟩. ⟨hal-04476788⟩
  • Rachel Bawden, Ziqian Peng, Maud Bénard, Eric Villemonte de La Clergerie, Raphaël Esamotunu, et al.. Translate your Own: a Post-Editing Experiment in the NLP domain. The 25th Annual Conference of the European Association for Machine Translation, European Association for Machine Translation, Jun 2024, Sheffield, United Kingdom. ⟨hal-04573922⟩
  • Amir Hossein Kargaran, François Yvon, Hinrich Schütze. GlotScript: A Resource and Tool for Low Resource Writing System Identification. Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), ELRA Language Resources Association (ELRA); International Committee on Computational Linguistics (ICCL), May 2024, Torino, Italy. ⟨hal-04587980⟩
  • Emanuele Dalsasso, Clément Rambour, Nicolas Trouvé, Nicolas Thome. MERLIN-Seg: self-supervised despeckling for label-efficient semantic segmentation. Computer Vision and Image Understanding, 2024, 241, ⟨10.1016/j.cviu.2024.103940⟩. ⟨hal-04163624v2⟩
  • Manuel Faysse, Patrick Fernandes, Nuno Guerreiro, Antonio Loison, Duarte Alves, et al.. CroissantLLM: A Truly Bilingual French-English Language Model. 2024. ⟨hal-04574908⟩
  • Mathias Vast, Yuxuan Zong, Benjamin Piwowarski, Laure Soulier. Simple Domain Adaptation for Sparse Retrievers. Advances in Information Retrieval, 14610, Springer Nature Switzerland, pp.403-412, 2024, Lecture Notes in Computer Science, ⟨10.1007/978-3-031-56063-7_32⟩. ⟨hal-04517668⟩
  • Vaynee Sungeelee, Antoine Loriette, Olivier Sigaud, Baptiste Caramiaux. Interactive curriculum learning increases and homogenizes motor smoothness. Scientific Reports, 2024, ⟨10.1038/s41598-024-53253-3⟩. ⟨hal-04529557⟩
  • Rachel Bawden, Hatim Bourfoune, Bertrand Cabot, Nathan Cassereau, Pierre Cornette, et al.. Les modèles Bloom pour le traitement automatique de la langue française. 2024. ⟨hal-04435371⟩
  • Noémie Jacquet, Vincent Guigue, Cristina Manfredotti, Fatiha Saïs, Stéphane Dervaux, et al.. Modélisation du caractère séquentiel des repas pour améliorer la performance d'un système de recommandation alimentaire. Extraction et Gestion des Connaissances (EGC 2024), Jan 2024, Dijon, France. ⟨hal-04440140⟩
  • Rémy Sun, Clément Masson, Gilles Hénaff, Nicolas Thome, Matthieu Cord. Semantic augmentation by mixing contents for semi-supervised learning. Pattern Recognition, 2024, 145, pp.109909. ⟨10.1016/j.patcog.2023.109909⟩. ⟨hal-04385089⟩
  • François Yvon. La traduction multilingue : analyse d'une prouesse technologique. Mediazioni. Rivista online du studi interdisciplinari su lingue e culture, 2023, 39, pp.A17-A34. ⟨10.6092/issn.1974-4382/18785⟩. ⟨hal-04365112⟩
  • Léo Grinsztajn, Myung Jun Kim, Edouard Oyallon, Gaël Varoquaux. Vectorizing string entries for data processing on tables: when are larger language models better?. 2023. ⟨hal-04345931⟩
  • Skander Karkar, Ibrahim Ayed, Emmanuel de Bézenac, Patrick Gallinari. Module-wise Training of Neural Networks via the Minimizing Movement Scheme. Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023), Dec 2023, New Orleans (Louisiana), United States. ⟨hal-04223364⟩
  • Adel Nabli, Eugene Belilovsky, Edouard Oyallon. $\textbf{A}^2\textbf{CiD}^2$: Accelerating Asynchronous Communication in Decentralized Deep Learning. Thirty-seventh Conference on Neural Information Processing Systems, Dec 2023, New Orleans, United States. ⟨hal-04124318v2⟩
  • Shu Okabe, François Yvon. Towards Multilingual Interlinear Morphological Glossing. 2023 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Dec 2023, Singapore, Singapore. pp.5958-5971, ⟨10.18653/v1/2023.findings-emnlp.396⟩. ⟨hal-04357157⟩
  • Gwen Legate, Nicolas Bernier, Lucas Caccia, Edouard Oyallon, Eugene Belilovsky. Guiding The Last Layer in Federated Learning with Pre-Trained Models. Neurips, In press. ⟨hal-04262471⟩
  • Maxime Bouthors, Josep Crego, François Yvon. Towards Example-Based NMT with Multi-Levenshtein Transformers. Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Dec 2023, Singapour, Singapore. pp.1830-1846. ⟨hal-04332427⟩
  • Edouard Oyallon. Contributions to Local, Asynchronous and Decentralized Learning, and to Geometric Deep Learning. Artificial Intelligence [cs.AI]. Sorbonne Université, 2023. ⟨tel-04334118⟩
  • Amir Hossein Kargaran, Ayyoob Imani, François Yvon, Hinrich Schütze. GlotLID: Language Identification for Low-Resource Languages. Findings of the Association for Computational Linguistics: EMNLP 2023, Association for Computational Linguistics, Dec 2023, Singapore, Singapore. pp.6155-6218. ⟨hal-04332442⟩
  • Alban Petit, Caio Corro, François Yvon. Structural generalization in COGS: Supertagging is (almost) all you need. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023, Singapour, Singapore. pp.1089-1101, ⟨10.18653/v1/2023.emnlp-main.69⟩. ⟨hal-04382463⟩
  • Skander Karkar, Patrick Gallinari, Alain Rakotomamonjy. Adversarial Sample Detection Through Neural Network Transport Dynamics. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2023), Sep 2023, Torino, Italy. ⟨hal-04120861⟩
  • Maya Sahraoui, Youcef Sklab, Marc Pignal, Régine Vignes Lebbe, Vincent Guigue. Leveraging Multimodality for Biodiversity Data: Exploring joint representations of species descriptions and specimen images using CLIP. TDWG, Oct 2023, Tasmania, Australia, Australia. ⟨10.3897/biss.7.112666⟩. ⟨hal-04287622⟩
  • Guglielmo Faggioli, Thibault Formal, Simon Lupart, Stefano Marchesin, Stephane Clinchant, et al.. Towards Query Performance Prediction for Neural Information Retrieval: Challenges and Opportunities. ICTIR '23: The 2023 ACM SIGIR International Conference on the Theory of Information Retrieval, Jul 2023, Taipei Taiwan, Taiwan. pp.51-63, ⟨10.1145/3578337.3605142⟩. ⟨hal-04290247⟩
  • Louis Fournier, Adeetya Patel, Michael Eickenberg, Edouard Oyallon, Eugene Belilovsky. Preventing Dimensional Collapse in Contrastive Local Learning with Subsampling. ICML 2023 Workshop on Localized Learning (LLW), Jul 2023, Honolulu (Hawaii), USA, United States. ⟨hal-04156218⟩
  • Adel Nabli, Edouard Oyallon. DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization. International Conference on Machine Learning, Jul 2023, Honolulu, United States. ⟨hal-03737694v3⟩
  • Ayyoob Imani, Peiqin Lin, Amir Hossein Kargaran, Silvia Severini, Masoud Jalili Sabet, et al.. Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages. 61th Annual Meeting of the Association for Computational Linguistics, ACL, Jul 2023, Toronto, Canada. ⟨hal-04163023⟩
  • Louis Fournier, Stéphane Rivaud, Eugene Belilovsky, Michael Eickenberg, Edouard Oyallon. Can Forward Gradient Match Backpropagation?. Fortieth International Conference on Machine Learning, Jul 2023, Honolulu (Hawaii), USA, United States. ⟨hal-04119829⟩
  • Marc Lafon, Elias Ramzi, Clément Rambour, Nicolas Thome. Hybrid Energy Based Model in the Feature Space for Out-of-Distribution Detection. International Conference on Machine Learning, Jul 2023, Honololu, Hawaii, United States. ⟨hal-04112184v2⟩
  • Laura Nguyen, Thomas Scialom, Benjamin Piwowarski, Jacopo Staiano. LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization. The 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), May 2023, Dubrovnik, Croatia. ⟨hal-03992015⟩
  • Yuan Yin, Matthieu Kirchmeyer, Jean-Yves Franceschi, Alain Rakotomamonjy, Patrick Gallinari. Continuous PDE Dynamics Forecasting with Implicit Neural Representations. The Eleventh International Conference on Learning Representations, May 2023, Kigali, Rwanda. . ⟨hal-04081163⟩
  • Yuan Yin, Matthieu Kirchmeyer, Jean-Yves Franceschi, Alain Rakotomamonjy, Patrick Gallinari. Continuous PDE Dynamics Forecasting with Implicit Neural Representations. The Eleventh International Conference on Learning Representations, International Conference on Representation Learning, May 2023, Kigali, Rwanda. ⟨hal-03792179v2⟩
  • Léon Migus, Julien Salomon, Patrick Gallinari. Stability of implicit neural networks for long-term forecasting in dynamical systems. ICLR 2023 Workshop on Physics for Machine Learning, May 2023, Kigali, Rwanda. ⟨hal-04132587⟩
  • Thomas Gerald, Hadi Zaatiti, Hatem Hajri, Nicolas Baskiotis, Olivier Schwander. A hyperbolic approach for learning communities on graphs. Data Mining and Knowledge Discovery, 2023, 37, pp.1090-1124. ⟨10.1007/s10618-022-00902-8⟩. ⟨hal-04022426⟩
  • Steeven Janny, Aurélien Beneteau, Madiha Nadri, Julie Digne, Nicolas Thome, et al.. EAGLE: Large-Scale Learning of Turbulent Fluid Dynamics with Mesh Transformers. International Conference on Learning Representation, May 2023, Kigali, Rwanda. ⟨hal-03992436v2⟩
  • Guillaume Couairon, Jakob Verbeek, Holger Schwenk, Matthieu Cord. DiffEdit: Diffusion-based Semantic Image Editing with Mask Guidance. ICLR 2023 (Eleventh International Conference on Learning Representations ), ICLR, May 2023, Kigali, Rwanda. ⟨hal-03957480⟩
  • Laura Nguyen, Thomas Scialom, Benjamin Piwowarski, Jacopo Staiano. LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, May 2023, Dubrovnik, Croatia. pp.636-651, ⟨10.18653/v1/2023.eacl-main.46⟩. ⟨hal-04287851⟩
  • Nicolas Thome, Christian Wolf. Histoire des réseaux de neurones et du deep learning en traitement des signaux et des images. 2023. ⟨hal-04058482⟩
  • Nam Le Hai, Thomas Gerald, Thibault Formal, Jian-Yun Nie, Benjamin Piwowarski, et al.. CoSPLADE: Contextualizing SPLADE for Conversational Information Retrieval. 45th European Conference on Information Retrieval (ECIR 2023), Apr 2023, Dublin, Ireland. pp.537-552, ⟨10.1007/978-3-031-28244-7_34⟩. ⟨hal-04168526⟩
  • Guglielmo Faggioli, Thibault Formal, Stefano Marchesin, Stéphane Clinchant, Nicola Ferro, et al.. Query Performance Prediction for Neural IR: Are We There Yet?. European Conference on Information Retrieval, Apr 2023, Dublin (Ireland), Ireland. pp.232-248, ⟨10.1007/978-3-031-28244-7_15⟩. ⟨hal-04290230⟩
  • Song Duong, Alberto Lumbreras, Mike Gartrell, Patrick Gallinari. Learning from Multiple Sources for Data-to-Text and Text-to-Data. International Conference on Artificial Intelligence and Statistics (AISTATS), Apr 2023, Valencia, Spain. ⟨hal-04002656⟩
  • Loïc Themyr, Clément Rambour, Nicolas Thome, Toby Collins, Alexandre Hostettler. Full Contextual Attention for Multi-resolution Transformers in Semantic Segmentation. 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Jan 2023, Waikoloa, United States. pp.3223-3232, ⟨10.1109/WACV56688.2023.00324⟩. ⟨hal-03901666⟩
  • Tristan Luiggi, Vincent Guigue, Laure Soulier, Siwar Jendoubi, Aurelien Baelde. Dynamic Named Entity Recognition. 38th ACM/SIGAPP Symposium on Applied Computing, Mar 2023, Tallinn, Estonia. pp.890-897, ⟨10.1145/3555776.3577603⟩. ⟨hal-04284318⟩
  • Nam Le Hai, Thomas Gerald, Thibault Formal, Jian-Yun Nie, Benjamin Piwowarksi, et al.. CoSPLADE : Adaptation d'un Modèle Neuronal Basé sur des Représentations Parcimonieuses pour la Recherche d'Information Conversationnelle. 18e Conférence en Recherche d'Information et Applications -- 16e Rencontres Jeunes Chercheurs en RI -- 30e Conférence sur le Traitement Automatique des Langues Naturelles -- 25e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, 2023, Paris, France. pp.207-212. ⟨hal-04131548⟩
  • Louis Falissard, Vincent Guigue, Laure Soulier. Apprentissage de sous-espaces de préfixes. 18e Conférence en Recherche d'Information et Applications -- 16e Rencontres Jeunes Chercheurs en RI -- 30e Conférence sur le Traitement Automatique des Langues Naturelles -- 25e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, Jun 2023, Paris, France. pp.59-73. ⟨hal-04131562⟩
  • Edouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly. SPIQ: Data-Free Per-Channel Static Input Quantization. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2023), Jan 2023, Waikoloa (Hawaii), United States. ⟨hal-03953685⟩
  • Noémie Jacquet, Cristina Manfredotti, Vincent Guigue, Fatiha Saïs, Paolo Viappiani. An EXplainable RecommandER SYStem for the Nutrition Domain, combining Knowledge Graphs and Machine Learning. 2023. ⟨hal-04439592⟩
  • Philippe Langlais, François Yvon. For a common European framework for evaluating AI- based translation technologies. Rachele Raus. How artificial intelligence can further European multilingualism Strategic recommendations for European decision-makers, Università di Torino - Artificial Intelligence for European Integration; Ledizioni, pp.93-96, 2023, 9791256000142. ⟨hal-04392444⟩
  • Etienne Le Naour, Ghislain Agoua, Nicolas Baskiotis, Vincent Guigue. Représentation Interprétable pour la Classification de Séries Temporelles. Conférence d'Apprentissage Automatique (CAp), 2023, Strasbourg, France. ⟨hal-04439603⟩
  • Louis Falissard, Vincent Guigue, Laure Soulier. Improving generalization in large language models by learning prefix subspaces. 61st Annual Meeting of the Association for Computational Linguistics (2023), Association for Computational Linguistics, Jul 2023, Toronto, Canada. ⟨hal-04286331⟩
  • Jingang Qu, Thibault Faney, Jean-Charles de Hemptinne, Soleiman Yousef, Patrick Gallinari. PTFlash : A vectorized and parallel deep learning framework for two-phase flash calculation. Fuel, 2023, 331, Part 1, pp.125603. ⟨10.1016/j.fuel.2022.125603⟩. ⟨hal-03659647v3⟩