Mario Sanz-Guerrero

prof_pic.jpg

Hi, I’m Mario 👋! I am a PhD student in Natural Language Processing at Johannes Gutenberg University Mainz, supervised by Katharina von der Wense in the NALA lab. Previously, I completed my BSc in Computer Science and MSc in Artificial Intelligence. I have also worked as an AI engineer in the healthcare industry.

📚 Research Interests

I’m continually impressed by how large language models, trained on the seemingly “simple” task of next‑word prediction, exhibit surprising emergent capabilities far beyond their original design.

  • Language Modeling & Emergent Abilities 🤖

    How and why do large language models acquire surprisingly complex skills?

  • LLM Calibration 📊

    Techniques to make model confidences better aligned with correctness.

  • Biomedical NLP 💊

    Applying LLMs to assist with drug discovery, clinical note analysis, and literature mining.

News

Sep. 2025

📄 Two of our papers, “Mind the Gap: A Closer Look at Tokenization for Multiple-Choice Question Answering with LLMs” and “Molecular String Representation Preferences in Pretrained LLMs”, were accepted to EMNLP 2025 (Main)! See you in Suzhou, China 🇨🇳!

Aug. 2025

📄 Our paper, “Reducing leads, enhancing wearable practicality: A comparative study of 3-lead vs. 12-lead ECG classification” was accepted to the journal Medical Engineering & Physics!

Jul. 2025

I’ll be attending ACL 2025. See you in Vienna, Austria 🇦🇹!

Mar. 2025

📄 Our paper, “Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models” was accepted to the 6th Workshop on Insights from Negative Results in NLP @ NAACL 2025! See you in Albuquerque, New Mexico 🇺🇸!

📄 Our paper, “Asking Again and Again: Exploring LLM Robustness to Repeated Questions” is now available on arXiv!

Feb. 2025

📄 Our paper, “Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending” was accepted to the journal Inteligencia Artificial!

Selected Publications

  1. Inteligencia Artificial
    Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending
    Mario Sanz-Guerrero and Javier Arroyo
    Inteligencia Artificial, Mar 2025
  2. NAACL’25 Workshop
    Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models
    Mario Sanz-Guerrero and Katharina von der Wense
    In The Sixth Workshop on Insights from Negative Results in NLP, May 2025
  3. EMNLP’25
    Mind the Gap: A Closer Look at Tokenization for Multiple-Choice Question Answering with LLMs
    Mario Sanz-Guerrero, Minh Duc Bui, and Katharina von der Wense
    Nov 2025