-
François Yvon
- Team: MLIA
- Office: H13
- Email: prénom point nom arobase sorbonne-universite point fr
- Website: https://fyvo.github.io
- Bio: François Yvon is a CNRS senior researcher working primarily on natural language processing. He is currently affiliated to the "Machine Learning and Deep Learning for Intelligent Access" group at ISIR. His recent work focuses on machine translation using neural and probabilistic methods - and more generally on multilingual machine language processing. Previously, F. Yvon was genral director of LIMSI/CNRS in Orsay and Professor of Computer Science at Université Paris-Sud and Télécom Paris, and also, briefly, Visiting Scientist at IBM's T.J Watson research center (NY).
Publications
- Nicolas Dahan, Rachel Bawden, François Yvon. Survey of Automatic Metrics for Evaluating Machine Translation at the Document Level. Inria Paris, Sorbonne Université; Sorbonne Universite; Inria Paris. 2024. ⟨hal-04798759⟩
- Amir Hossein Kargaran, François Yvon, Hinrich Schütze. MaskLID: Code-Switching Language Identification through Iterative Masking. 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Association for Computational Linguistics, Aug 2024, Bangkok, Thailand. pp.459-469. ⟨hal-04670790⟩
- Sadaf Abdul Rauf, François Yvon. Translating scientific abstracts in the bio-medical domain with structure-aware models. Computer Speech and Language, 2024, 87, pp.101623. ⟨10.1016/j.csl.2024.101623⟩. ⟨hal-04476788⟩
- Rachel Bawden, Hatim Bourfoune, Bertrand Cabot, Nathan Cassereau, Pierre Cornette, et al.. Evaluer BLOOM en français. Atelier sur l'évaluation des modèles génératifs (LLM) et challenge d'extraction d'information few-shot, Institut des sciences informatiques et de leurs interactions - CNRS Sciences informatiques [INS2I-CNRS], Jul 2024, Toulouse, France. ⟨hal-04678039⟩
- Rachel Bawden, Ziqian Peng, Maud Bénard, Eric Villemonte de La Clergerie, Raphaël Esamotunu, et al.. Translate your Own: a Post-Editing Experiment in the NLP domain. The 25th Annual Conference of the European Association for Machine Translation, European Association for Machine Translation, Jun 2024, Sheffield, United Kingdom. ⟨hal-04573922⟩
- Maxime Bouthors, Josep Crego, François Yvon. Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison. 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024), Association for Computational Linguistics, Jun 2024, Mexico, Mexico. pp.3022-3039, ⟨10.18653/v1/2024.findings-naacl.190⟩. ⟨hal-04670614⟩
- Ziqian Peng, Rachel Bawden, François Yvon. Handling Very Long Contexts in Neural Machine Translation: a Survey. Livrable D3-2.1, Projet ANR MaTOS. 2024, pp.50. ⟨hal-04652584v2⟩
- Amir Hossein Kargaran, François Yvon, Hinrich Schütze. GlotScript: A Resource and Tool for Low Resource Writing System Identification. Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), ELRA Language Resources Association (ELRA); International Committee on Computational Linguistics (ICCL), May 2024, Torino, Italy. ⟨hal-04587980⟩
- Manuel Faysse, Patrick Fernandes, Nuno Guerreiro, Antonio Loison, Duarte Alves, et al.. CroissantLLM: A Truly Bilingual French-English Language Model. 2024. ⟨hal-04574908⟩
- Rachel Bawden, Hatim Bourfoune, Bertrand Cabot, Nathan Cassereau, Pierre Cornette, et al.. Les modèles Bloom pour le traitement automatique de la langue française. 2024. ⟨hal-04435371⟩
- Ziqian Peng, Rachel Bawden, François Yvon. À propos des difficultés de traduire automatiquement de longs documents. 35èmes Journées d'Études sur la Parole (JEP 2024) 31ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN 2024) 26ème Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL 2024), Jul 2024, Toulouse, France. pp.2-21. ⟨hal-04623006⟩
- Paul Lerner, François Yvon. Vers la traduction automatique des néologismes scientifiques. 35èmes Journées d'Études sur la Parole (JEP 2024) 31ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN 2024) 26ème Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL 2024), Jul 2024, Toulouse, France. pp.245-261. ⟨hal-04623021⟩
- Bérengère Podvin, L. Soucasse, F. Yvon. Analysis of Rayleigh-Bénard convection using latent Dirichlet allocation. Physical Review Fluids, 2024, 9 (6), pp.063502. ⟨10.1103/PhysRevFluids.9.063502⟩. ⟨hal-04729077⟩
- Maxime Bouthors, Josep Crego, François Yvon. Optimiser le choix des exemples pour la traduction automatique augmentée par des mémoires de traduction. 35èmes Journées d'Études sur la Parole (JEP 2024) 31ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN 2024) 26ème Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RECITAL 2024), Jul 2024, Toulouse, France. pp.582-604. ⟨hal-04623042⟩
- François Yvon. La traduction multilingue : analyse d'une prouesse technologique. Mediazioni. Rivista online du studi interdisciplinari su lingue e culture, 2023, 39, pp.A17-A34. ⟨10.6092/issn.1974-4382/18785⟩. ⟨hal-04365112⟩
- Shu Okabe, François Yvon. Towards Multilingual Interlinear Morphological Glossing. 2023 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Dec 2023, Singapore, Singapore. pp.5958-5971, ⟨10.18653/v1/2023.findings-emnlp.396⟩. ⟨hal-04357157⟩
- Maxime Bouthors, Josep Crego, François Yvon. Towards Example-Based NMT with Multi-Levenshtein Transformers. Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Dec 2023, Singapour, Singapore. pp.1830-1846. ⟨hal-04332427⟩
- Amir Hossein Kargaran, Ayyoob Imani, François Yvon, Hinrich Schütze. GlotLID: Language Identification for Low-Resource Languages. Findings of the Association for Computational Linguistics: EMNLP 2023, Association for Computational Linguistics, Dec 2023, Singapore, Singapore. pp.6155-6218. ⟨hal-04332442⟩
- Alban Petit, Caio Corro, François Yvon. Structural generalization in COGS: Supertagging is (almost) all you need. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023, Singapour, Singapore. pp.1089-1101, ⟨10.18653/v1/2023.emnlp-main.69⟩. ⟨hal-04382463⟩
- Shu Okabe, François Yvon. LISN @ SIGMORPHON 2023 Shared Task on Interlinear Glossing. The 20th SIGMORPHON workshop on Computational Morphology, Phonology, and Phonetics, Association for computational linguistics, Jul 2023, Toronto, Canada. ⟨10.18653/v1/2023.sigmorphon-1.21⟩. ⟨hal-04186388⟩
- Josep Crego, Jitao Xu, François Yvon. BiSync: A Bilingual Editor for Synchronized Monolingual Texts. The 61st Annual Meeting of the Association for Computational Linguistics, ACL, Jul 2023, Toronto, Canada. pp.369-376. ⟨hal-04163029⟩
- Dávid Javorský, Ondřej Bojar, François Yvon. Assessing Word Importance Using Models Trained for Semantic Tasks. 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), ACL, Jul 2023, Toronto, Canada. pp.8846-8856. ⟨hal-04163044⟩
- Ayyoob Imani, Peiqin Lin, Amir Hossein Kargaran, Silvia Severini, Masoud Jalili Sabet, et al.. Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages. 61th Annual Meeting of the Association for Computational Linguistics, ACL, Jul 2023, Toronto, Canada. ⟨hal-04163023⟩
- Shu Okabe, François Yvon. Joint Word and Morpheme Segmentation with Bayesian Non-Parametric Models. 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), Association for Computational Linguistics, May 2023, Dubrovnik, Croatia. pp.628-642. ⟨hal-04086368⟩
- Jitao Xu, Josep Crego, François Yvon. Integrating Translation Memories into Non-Autoregressive Machine Translation. EACL 2023, May 2023, Dubrovnik, Croatia. ⟨hal-03995339⟩
- Rachel Bawden, François Yvon. Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM. EAMT 2023 - 24th Annual Conference of the European Association for Machine Translation, Jun 2023, Tampere, Finland. ⟨10.48550/ARXIV.2303.01911⟩. ⟨hal-04015863v2⟩
- Gilles Adda, Ioana Vasilescu, François Yvon. Language Report French. Georg Rehm; Andy Way. European Language Equality. A Strategic Agenda for Digital Language Equality, Springer International Publishing, pp.139-142, 2023, Cognitive Technologies, 978-3-031-28818-0. ⟨10.1007/978-3-031-28819-7_16⟩. ⟨hal-04121465⟩
- Shu Okabe, François Yvon. Production automatique de gloses interlinéaires à travers un modèle probabiliste exploitant des alignements. 18e Conférence en Recherche d'Information et Applications -- 16e Rencontres Jeunes Chercheurs en RI -- 30e Conférence sur le Traitement Automatique des Langues Naturelles -- 25e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, 2023, Paris, France. pp.262-274. ⟨hal-04130176⟩
- François Yvon. Transformers in Natural Language Processing. Mohamed Chetouani; Virginia Dignum; Paul Lukowicz; Carles Sierra. Human-Centered Artificial Intelligence. Advanced Lectures, 13500, Springer International Publishing, pp.81-105, 2023, Lecture Notes in Computer Science, 978-3-031-24348-6. ⟨10.1007/978-3-031-24349-3_6⟩. ⟨hal-04224531⟩
- Philippe Langlais, François Yvon. For a common European framework for evaluating AI- based translation technologies. Rachele Raus. How artificial intelligence can further European multilingualism Strategic recommendations for European decision-makers, Università di Torino - Artificial Intelligence for European Integration; Ledizioni, pp.93-96, 2023, 9791256000142. ⟨hal-04392444⟩
- Maud Bénard, Alexandra Mestivier, Natalie Kubler, Lichao Zhu, Rachel Bawden, et al.. MaTOS: Traduction automatique pour la science ouverte. 18e Conférence en Recherche d'Information et Applications -- 16e Rencontres Jeunes Chercheurs en RI -- 30e Conférence sur le Traitement Automatique des Langues Naturelles -- 25e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, Jun 2023, Paris, France. pp.8-15. ⟨hal-04131594⟩
- François Yvon. Evaluer, diagnostiquer et analyser la traduction automatique neuronale. FORUM. Revue internationale d’interprétation et de traduction / International Journal of Interpretation and Translation , 2022, 20 (2), pp.315-332. ⟨10.1075/forum.00023.yvo⟩. ⟨hal-03975750⟩
- Guillaume Wisniewski, Lichao Zhu, Nicolas Ballier, François Yvon. Analyzing Gender Translation Errors to Identify Information Flows between the Encoder and Decoder of a NMT System. BlackboxNLP 2022, Dec 2022, Abu Dhabi, United Arab Emirates. ⟨hal-03912438⟩
- Ayyoob Imani, Silvia Severini, Masoud Jalili Sabet, François Yvon, Hinrich Schütze. Graph-Based Multilingual Label Propagation for Low-Resource Part-of-Speech Tagging. Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Dec 2022, Abu Dhabi, United Arab Emirates. ⟨hal-03832874⟩
- Jitao Xu, Josep Crego, François Yvon. Bilingual Synchronization: Restoring Translational Relationships with Editing Operations. The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), Dec 2022, Abou Dabi, United Arab Emirates. ⟨hal-03827010⟩
- François Buet, François Yvon. Sous-titrage automatique : étude de stratégies d'adaptation aux genres télévisuels. Revue TAL : traitement automatique des langues, 2022, Varia, 63 (1), pp.11-35. ⟨hal-03890594⟩
- Guillaume Wisniewski, Lichao Zhu, Nicolas Ballier, François Yvon. Biais de genre dans un système de traduction automatique neuronale : une étude des mécanismes de transfert cross-langue. Revue TAL : traitement automatique des langues, 2022, 63 (1), pp.37-61. ⟨hal-03890622⟩
- Minh Quang Pham, Josep Crego, François Yvon. Latent Group Dropout for Multilingual and Multidomain Machine Translation. Findings of the ACL: NAACL 2022, Association for Computational Linguistics, Jul 2022, Seattle, United States. ⟨hal-03720395⟩
- Alina Karakanta, François Buet, Mauro Cettolo, François Yvon. Evaluating Subtitle Segmentation for End-to-end Generation Systems. 13th Language Resources and Evaluation Conference (LREC), ELDA, Jun 2022, Marseille, France. ⟨hal-03783891⟩
- Minh-Quang Pham, Josep Crego, François Yvon. Multi-Domain Adaptation in Neural Machine Translation with Dynamic Sampling Strategies. Conference of the European Association for Machine Translation, European Association for Machine Translation, Jun 2022, Ghent, Belgium. ⟨hal-03686763⟩
- Jitao Xu, François Buet, Josep Crego, Elise Bertin-Lemée, François Yvon. Joint Generation of Captions and Subtitles with Dual Decoding. 19th International Conference on Spoken Language Translation (IWSLT 2022), May 2022, Dublin, Ireland. ⟨hal-03666567⟩
- Shu Okabe, Laurent Besacier, François Yvon. Weakly Supervised Word Segmentation for Computational Language Documentation. Annual meeting of the Association for Computational Linguistics, Association for Computational Linguistics, May 2022, Dublin, Ireland. ⟨hal-03679416⟩
- Ayyoob Imani, Lütfi Şenel, Masoud Sabet, François Yvon, Hinrich Schütze. Graph Neural Networks for Multiparallel Word Alignment. Findings of the ACL: ACL 2022, The Association for Computational Linguistics (ACL), May 2022, Dublin, Ireland. ⟨hal-03679419⟩
- Gilles Adda, Annelies Braffort, Ioana Vasilescu, François Yvon, Jean-François Nominé. État de l'art des technologies linguistiques pour la langue française. [Rapport de recherche] CNRS - LISN. 2022. ⟨hal-03637784⟩
- François Yvon. Le modèle Transformer: un « couteau suisse » pour le traitement automatique des langues. Techniques de l'Ingénieur, 2022, pp.IN195 v1. ⟨10.51257/a-v1-in195⟩. ⟨hal-03619077v2⟩
-
Joseph J Mariani, François Yvon. European Language Equality : Ressources et technologies pour le traitement de la langue française. Forum « Innovation, Technologies et Plurilinguisme » organisé par la Délégation Générale à la Langue Française et aux Langues de France (DGLFLF) à l’occasion de la Présidence Française de l’Union Européenne, Feb 2022, Lille, France. ⟨hal-04415400⟩
[ HTTP ]
- Gilles Adda, Annelies Braffort, Ioana Vasilescu, François Yvon. European Language Equality - Report on the French Language. [Research Report] CNRS - LISN. 2022. ⟨hal-03637776⟩
- Nicolas Devatine, Caio Corro, François Yvon. Ré-ordonnancement via programmation dynamique pour l'adaptation cross-lingue d'un analyseur en dépendances. 29ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN 2022), Association pour le Traitement Automatique des Langues (ATALA); Laboratoire d’Informatique et Systèmes (LIS); Laboratoire Informatique d’Avignon (LIA), Jun 2022, Avignon, France. pp.183-197. ⟨hal-03701508⟩
- Lichao Zhu, Guillaume Wisniewski, Nicolas Ballier, François Yvon. Flux d'informations dans les systèmes encodeur-décodeur. Application à l'explication des biais de genre dans les systèmes de traduction automatique.. Traitement Automatique des Langues Naturelles, Jun 2022, Avignon, France. pp.10-18. ⟨hal-03701474⟩
- Shu Okabe, François Yvon. Vers la génération automatique de gloses pour la documentation automatique des langues. Journées Jointes des Groupements de Recherche Linguistique Informatique, Formelle et de Terrain (LIFT) et Traitement Automatique des Langues (TAL), Nov 2022, Marseille, France. pp.198-203. ⟨hal-03846843⟩