AI Researcher | PhD Student

Hi, I'm Hassan, a PhD student at the UKP Lab at Technische Universität Darmstadt (TUD) in Germany, supervised by Prof. Iryna Gurevych. Previously, I was an AI Researcher at the German Research Center for Artificial Intelligence (DFKI) in Berlin. I hold a Master’s degree in Computer Science from Saarland University, where I graduated with distinction. During my studies, I gained valuable industry experience at renowned organizations such as Bosch Center for AI (BCAI) in Germany, Amazon EU in Luxembourg, and the Max Planck Institute for Informatics, where I contributed as a Research Assistant.

My current PhD topic is on Multi-table Retrieval. My research in general centers on LLMs and Generative AI. I focus on developing innovative LLM-based applications, ranging from research prototypes to practical demonstrations. For instance, during my time at DFKI, I worked on creating a chatbot tailored for graduate students, aimed at improving their understanding of university courses. Additionally, I contributed to enhancing the user experience of AI-driven phone assistants by utilizing the capabilities of LLMs.

For my Master's thesis, supervised by Prof. Dietrich Klakow in collaboration with the NLP and Semantic Reasoning group at Bosch Center for AI, I explored Cross-Domain Neural Entity Linking. My work involved investigating a Transformer-based model to streamline domain adaptation by identifying optimal fine-tuning data across knowledge bases.

I am deeply passionate about researching and experimenting with AI models, balancing trade-offs, and applying these models to impactful real-world use cases. Outside of work, I enjoy photography, staying active through fitness, delving into philosophy, and immersing myself in new cultures through travel and group activities.

Hassan Soliman

Updates

2025

[03-2025] Presented my publication with the title "Retrieval-Augmented Chatbots for Scalable Educational Support in Higher Education" at the GenAL-LA workshop at the LAK'25 conference.

[02-2025] Presented an abstract poster with the title "A Survey on Advances in Retrieval-Augmented Generation over Tabular Data and Table QA" at the ELLIS workshop on Representation Learning and Generative Models for Structured Data.

[01-2025] Started a Ph.D. at the Technical University of Darmstadt in Germany. My first work is on researching and developing a robust multi-table retrieval system that not only retrieves relevant tables from complex databases but also ensures their joinability for coherent evidence assembly.

Interests

Artificial Intelligence (AI)

Natural Language Processing (LLM)

Information Retrieval (IR)

Question Answering (QA)

Retrieval Augmented Generation (RAG)

Education

Awards

Master of Science in Computer Science

Saarland Informatics Campus, Saarland University

2019 — 2022

Thesis Title: Cross-Domain Neural Entity Linking
Supervision: Prof. Dietrich Klakow

Saarbrücken, Germany

Bachelor of Science in Computer and Communication Engineering

Faculty of Engineering, Alexandria University

2013 — 2018

Thesis Title: Egyptian Car License Plate Information Detection
Supervision: Prof. Marwan Torki

Alexandria, Egypt

Second Best Demo Paper

ECTEL

2024

Krems, Austria

First Class Honor Degree

Alexandria University

2018

Alexandria, Egypt

Philosophy Doctor in Computer Science

Uniquoutious Knowledge Processing Lab,
Technische Universität Darmstadt

2025 — Present

Thesis Title: Safeguarding Multi-modal LLMs against Misleading Evidence Attacks (Multi-table Retrieval)
Supervision: Prof. Iryna Gurevych

Darmstadt, Germany

Selected Publications

[09-2024] Hassan Soliman, Miloš Kravčík, Alexander Tobias Neumann, Yue Yin, Norbert Pengel and Maike Haag. 2024. Scalable Mentoring Support with a Large Language Model Chatbot. Technology Enhanced Learning for Inclusive and Equitable Quality Education (ECTEL), September 16–20, 2024, Krems, Austria, 6 pages.

ECTEL'24

ACL'24
RepL4NLP

Paper

Video

[05-2022] Hassan Soliman, Heike Adel, Mohamed H. Gad-Elrab, Dragan Milchevski, and Jannik Strötgen. 2022. A Study on Entity Linking Across Domains: Which Data is Best for Fine-Tuning?. In Proceedings of the 7th Workshop on Representation Learning for NLP, ACL, 184–190, Dublin, Ireland.

Experience

AI Researcher

German Research Center for AI (DFKI)

Jan 2023 — Dec 2024

Led two projects in the Educational Technology lab, managing technical implementation and supervising two students.
Developed a chatbot for a graduate-level course that answered student queries with 87% accuracy. One of the two papers published on the project was nominated for the Best Demo Award at ECTEL 2024.
Applied advanced Retrieval-Augmented Generation (RAG) techniques, including Hybrid Ensemble Search and Reranking Mechanism, to enhance chatbot interactions and improve the retrieval of course materials.
Supported mentoring-style conversations by leveraging flexible agentic workflows with LangGraph, utilizing multiple small open-source models hosted on Azure, and using databases for user usage tracking and monitoring.
Implemented a sub-module for adaptive dialogue systems, customizing responses based on user emotional state and demographics, and benchmarking performance using OpenAI LLMs and open-source models.

Berlin, Germany

Applied Scientist Intern

Bosch Center for AI (BCAI)

May 2022 — Aug 2022

Contributed to the NLP & Semantic Reasoning group, applying findings from my master’s thesis on Neural Entity Linking to a high-impact industrial project using real data at Bosch.
Refactored, tested, and documented production-level code for machine learning models, ensuring scalability and efficiency for real-world deployment, leveraging the in-house GPU cluster for model fine-tuning.
Trained and evaluated machine learning models on a large-scale domain-specific dataset, achieving 77% end-to-end recall for top-3 entity predictions, outperforming existing models.

Renningen, Germany

Research Assistant

Max Planck Institute for Informatics (MPII)

Nov 2020 — May 2021

Developed a model prototype within the Database & Information Systems group to identify diverse peer groups for entities, contributing to advanced set expansion techniques.
Implemented a baseline model for entity set expansion, leveraging Wikipedia lists as a knowledge source to enhance the model’s accuracy and comprehensiveness in the expanded sets.
Optimized the algorithm’s performance by achieving a 3x faster runtime using efficient sparse matrix multiplication techniques, significantly improving computational efficiency.