2023 – October:

Heydar Soudani, Evangelos Kanoulas, Faegheh Hasibi, “Data Augmentation for Conversational AI”, Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, CIKM 2023, pp. 5220–5223, DOI: link, URL: publication page, Open Access: Yes.

2023 – December:

Jirui Qi, Raquel Fernández, Arianna Bisazza, “Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models”, EMNLP 2023 Main, vol. long/658, pp. 10650–10666, DOI: link, URL: publication page, Open Access: Yes.

Suzan Verberne, “Service-Chatbots voor het Nederlands: De Onderzoeksagenda van het LESSEN Project”, DIXIT magazine, pp. 32-33, URL: publication page, Open Access: Yes.

Xinyi Chen, Raquel Fernández, Sandro Pezzelle, “The BLA Benchmark: Investigating Basic Language Abilities of Pre-Trained Multimodal Models”, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pp. 5817–5830, DOI: link, URL: publication page, Open Access: YES.

2024 – February:

Mert Yazan, Frederik Situmeang, Ruilin Xiao, “Rethinking Conversation Styles of Chatbots from the Customer Perspective: Relationships between Conversation Styles of Chatbots, Chatbot Acceptance, and Perceived Tie Strength and Perceived Risk”, International Journal of Human–Computer Interaction, vol. 41(2), pp. 1342-1363, DOI: link, URL: publication page, Open Access: No.

2024 – May:

Mohanna Hoveyda, Arjen de Vries, Maarten de Rijke, Faegheh Hasibi, “Real World Conversational Entity Linking Requires More Than Zero-Shots”, Findings of the Association for Computational Linguistics ACL 2024, DOI: link, URL: publication page, Open Access: Yes.

Heydar Soudani, Roxana Petcu, Evangelos Kanoulas, and Faegheh Hasibi., “Data Augmentation for Conversational AI”, Proceedings of the ACM Web Conference 2024 (WWW ’24), DOI: link, URL: publication page, Open Access: Yes.

2024 – June:

Andreas Paraskeva, Joao Pedro Reis, Suzan Verberne, Jan N. van Rijn, “Resource-constrained Neural Architecture Search on Language Models: A Case Study”, WANT@ICML2024 workshop, URL: publication page, Open Access: Yes.

2024 – July:

Mert Yazan, Frederik Situmeang, Suzan Verberne, “The Impact of Quantization on Retrieval-Augmented Generation: An Analysis of Small LLMs”, IR-RAG@SIGIR’24, vol. 3784, pp. 77-81, DOI: link, URL: publication page, Open Access: Yes.

2024 – August:

Nonkes, N., Agaronian, S., Kanoulas, E., Petcu, R., “Leveraging graph structures to detect hallucinations in large language models.”, Proceedings of the TextGraphs-17 Workshop, ACL 2024, pp. 93–104, DOI: link, URL: publication page, Open Access: Yes.

2024 – October:

Vera Neplenbroek, Arianna Bisazza, Raquel Fernández, “MBBQ: A Dataset for Cross-Lingual Comparison of Stereotypes in Generative LLMs”, Conference on Language Modeling (COLM), DOI: link, URL: publication page, Open Access: Yes.

2024 – November:

Yumeng Wang, “Towards Intent-Driven Transparency in Conversational Search Systems”, ECIR 2025; Doctoral Consortium, URL: publication page, Open Access: Yes.

Tianyu Liu*, Jirui Qi*, Paul He, Arianna Bisazza, Mrinmaya Sachan, Ryan Cotterell, “Likelihood as a Performance Gauge for Retrieval-Augmented Generation”, NAACL 2025 Main, URL: publication page, Open Access: Yes.

2024 – December:

Jirui Qi*, Gabriele Sarti*, Raquel Fernández, Arianna Bisazza, “Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation”, EMNLP 2024 Main, DOI: link, URL: publication page, Open Access: Yes.

Xinyi Chen, Baohao Liao, Jirui Qi, Panagiotis Eustratiadis, Christof Monz, Arianna Bisazza, Maarten de Rijke, “The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models”, Findings of the 2024 Conference on Empirical Methods in Natural Language Processing, DOI: link, URL: publication page, Open Access: Yes.

I-Fan Lin, Faegheh Hasibi, Suzan Verberne, “Generate then Refine: Data Augmentation for Zero-shot Intent Detection”, Findings of the Association for Computational Linguistics: EMNLP 2024., DOI: link, URL: publication page, Open Access: Yes.

Heydar Soudani, Evangelos Kanoulas, Faegheh Hasibi, “Fine Tuning vs. Retrieval Augmented Generation for Less Popular Knowledge”, Proceedings of International ACM SIGIR Conference on Information Retrieval in the Asia Pacific, DOI: link, URL: publication page, Open Access: Yes.

2025 – March:

I-Fan Lin, Faegheh Hasibi, Suzan Verberne, “SPILL: Zero-shot Intent Clustering with Large Language Models”, Findings of the Association for Computational Linguistics ACL 2025.

2025 – April:

Mert Yazan, Suzan Verberne, Frederik Situmeang, “Improving RAG for Personalization with Author Features and Contrastive Examples”, ECIR 2025, vol. 15574, pp. 408-416, DOI: link, URL: publication page, Open Access: Yes.

Mohanna Hoveyda, Harrie Oosterhuis, Arjen P. de Vries, Maarten de Rijke, Faegheh Hasibi, “Adaptive Orchestration of Modular Generative Information Access Systems”, SIGIR 2025; Perspective Track, DOI: link, URL: publication page, Open Access: Yes.

Heydar Soudani, “Enhancing Knowledge Injection in Large Language Models for Efficient and Trustworthy Responses”, SIGIR 2025; Doctoral Consortium, DOI: link, URL: publication page, Open Access: Yes.

Paraskeva, Andreas and van Duijn, Max Johannes and de Rijke, Maarten and Verberne, Suzan and van Rijn, Jan Nicolaas, “Data Efficient Pre-training for Language Models: An Empirical Study of Compute Efficiency and Linguistic Competence”, ICLR 2025 Workshop on Navigating and Addressing Data Problems for Foundation Models, DOI: link, URL: publication page.

Arian Askari, Roxana Petcu, Chuan Meng, Mohammad Aliannejadi, Amin Abolghasemi, Evangelos Kanoulas, Suzan Verberne, “SOLID: Self-seeding and Multi-intent Self-instructing LLMs for Generating Intent-aware Information-Seeking Dialogs”, https://aclanthology.org/2025.findings-naacl.357/, DOI: link, URL: publication page, Open Access: Yes.

Li, Y., Eustratiadis, P., Kanoulas, E., “Reproducing HotFlip for Corpus Poisoning Attacks in Dense Retrieval.”, ECIR 2025, pp. 95–111, DOI: link, URL: publication page, Open Access: yes.

2025 – May:

Yumeng Wang, Xiuying Chen, and Suzan Verberne., “QUIDS: Query Intent Description for Exploratory Search via Dual Space Modeling”, EMNLP 2025 Main, DOI: link, URL: publication page.

Heydar Soudani, Evangelos Kanoulas, Faegheh Hasibi, “Why Uncertainty Estimation Methods Fall Short in RAG: An Axiomatic Analysis”, Findings of the Association for Computational Linguistics ACL 2025, DOI: link, URL: publication page.

Kudzai Sauka, Yitong Wang, Frederik Situmeang, “Anthropomorphism and transparency interplay on consumer behaviour in generative AI-driven marketing communication”, Journal of Consumer Marketing, vol. Volume 42 Issue 4, DOI: link, URL: publication page, Open Access: Yes.

2025 – July:

Jakub Podolak, Leon Perić, Mina Janićijević, Roxana Petcu, “Beyond Reproducibility: Advancing Zero-shot LLM Reranking Efficiency with Setwise Insertion”, SIGIR 2025, URL: publication page.

Oliver Savolainen, Dur e Najaf Amjad, Roxana Petcu, “Interpreting Multilingual and Document-Length Sensitive Relevance Computations in Neural Retrieval Models through Axiomatic Causal Interventions”, SIGIR 2025, URL: publication page.

Cile van Marken, Roxana Petcu, “Reproducing and Extending Causal Insights Into Term Frequency Computation in Neural Rankers”, SIGIR-AP 2025, pp. 189 – 198, DOI: link, URL: publication page, Open Access: Yes.

Yongkang Li, Panagiotis Eustratiadis, Simon Lupart, and Evangelos Kanoulas. 2025., “Unsupervised Corpus Poisoning Attacks in Continuous Space for Dense Retrieval.”, SIGIR 2025, pp. 2452 – 2462, DOI: link, URL: publication page, Open Access: Yes.

Antonios Tragoudaras, Theofanis Aslanidis, Emmanouil Georgios Lionis, Marina Orozco González, and Panagiotis Eustratiadis., “Information Leakage of Sentence Embeddings via Generative Embedding Inversion Attacks”, SIGIR 2025, pp. 3234 – 3243, DOI: link, URL: publication page, Open Access: yes.

2025 – August:

Vera Neplenbroek, Arianna Bisazza, Raquel Fernández, “Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation”, Findings of the Association for Computational Linguistics ACL 2025, pp. 2805–2830, DOI: link, URL: publication page, Open Access: Yes.

Ali Satvaty, Anna Visman, Dan Seidel, Suzan Verberne, Fatih Turkmen, “Memorization is Language-Sensitive: Analyzing Memorization and Inference Risks of LLMs in a Multilingual Setting”, L2M2 Workshop at ACL.

2025 – October:

Kudzai Sauka, Youssef Saou, Frederik Situmeang, Monika Kackovic, “A Nexus of Explainability and Anthropomorphism in AI-Chatbots”, Communications in Computer and Information Science, vol. CCIS,volume 2576, pp. pp 115–138, DOI: link, URL: publication page, Open Access: Yes.

2025 – November:

Jirui Qi, Raquel Fernández, Arianna Bisazza, “On the Consistency of Multilingual Context Utilization in Retrieval-Augmented Generation”, Workshop on Multilingual Representation Learning (MRL) at EMNLP 2025, DOI: link, URL: publication page, Open Access: Yes.

Vera Neplenbroek, Arianna Bisazza, Raquel Fernández, “Reading Between the Prompts: How Stereotypes Shape LLM’s Implicit Personalization”, EMNLP 2025 Main, pp. 20378–20411, DOI: link, URL: publication page, Open Access: Yes.

Jirui Qi†, Shan Chen†, Zidi Xiong, Raquel Fernández, Danielle S Bitterman‡, Arianna Bisazza‡, “When Models Reason in Your Language: Controlling Thinking Language Comes at the Cost of Accuracy”, EMNLP 2025 Findings, DOI: link, URL: publication page.

Xinyi Chen, Yifei Yuan, Jiaang Li, Serge Belongie, Maarten de Rijke, Anders Søgaard, “What if Othello-Playing Language Models Could See?”, ENMLP 2025, URL: publication page.

Roxana Petcu, Samarth Bhargav, Maarten de Rijke, Evangelos Kanoulas, “A comprehensive taxonomy of negation for nlp and neural retrievers”, EMNLP 2025, pp. 15511–15533, DOI: link, URL: publication page, Open Access: Yes.

Kanoulas, E., Eustratiadis, P., Sanderson, M., & Callan, J, “Overview of the TREC 2025 Million Large Language Models Track”, NIST TREC.

Kudzai Sauka, Gianluigi Bardelloni, Jigsa Bulto, Frederik BI Situmeang, “Building Hierarchy-Aware Knowledge Graphs: Ontology-Grounded Triple Extraction with LLMs”, ISWC, vol. 4085, URL: publication page, Open Access: Yes.

2025 – December:

Sidiropoulos, Georgios, Samarth Bhargav, Panagiotis Eustratiadis, and Evangelos Kanoulas., “Multivariate dense retrieval: A reproducibility study under a memory-limited setup.”, TMLR, URL: publication page.