Jakob Sturm

M. Sc. (he/him) (external PhD student with BMW)


E-Mail: jakob.sturm(AT)bmw.de
Address: TUM - Fakultät für Informatik, Boltzmannstr. 3, 85748 Garching


Research interest

The focus of my research is on Retrieval-Augmented Generation (RAG) as an approach for domain adaptation. While pretrained large language models (LLMs) offer great potential, they need to be adapted to handle domain-specific tasks and incorporate relevant knowledge. RAG addresses this need, but it is not yet a perfect solution. I aim to contribute to the identification and resolution of existing challenges.

Current research topics include:

  • Evaluation methods for RAG, both end-to-end and component-wise
  • Domain adaptation strategies for the retrieval component of RAG
  • Comparison and integration of RAG with fine-tuning approaches
  • Identification and analysis of key pain points in RAG systems


Supervision of Theses

If you are looking for a thesis or guided research topic and are motivated to work on a project related to my research interests, don't hesitate to contact me via email! Please attach your CV, Transcript of Records, and a short (< 400 words) introduction about yourself and your motivation. As I only have very limited capacities, please understand that I cannot offer a thesis to every interested student. However, I will try to answer your enquiries nevertheless, i.e. if you haven't heard from me, I am still deciding whether I can provide you a topic or not. Please be aware that I consider it fundamental for a master's thesis to be grounded in the current state of the art, which means there are no exclusively practical MAs but all require a solid grounding in literature. To balance research autonomy and close callobration on the project, I usually offer a weekly meeting.

Important: Currently, I am focused on kick starting the listed open projects, so I am not available to supervise any additional topics. Looking forward to new applications!

(Last updated on 03.11.2025)

(M = master thesis; B = bachelor thesis; P = student project) (TUM = Internal TUM thesis; BMW = external thesis, linked to a thesis position at BMW)

Open Topics

  • (M) (TUM) Assessing the potential of hybrid retrieval methods for special domain items with focus on large text elements.
  • (M) (TUM) Comparing RAG and Finetuning with respect to output quality and costs
  • (M) (BMW) Exploring the potential of combining Agents, Reasoning, RAG and domain adaptation for a Quality Management Process
  • (M) (TUM) Training of a reasoning embedding model
  • (M) (TUM) A public NLP data set for german automotive industry

Currently ongoing Thesises

Finished Thesises

  • (2025) (M) Learning From Structure: Enhancing Semantic Information Retrieval Using Expert Knowledge (old title: S3EK: boosting task-Specific Semantic Search with Expert Knowledge data)
  • (2025) (M) Retrieval Augmented Generation with Knowledge Graphs (old title: Reasoning with Large Language Models over Knowledge Graphs: Explainable Knowledge Access in Automotive Knowledge Management Systems)
  • (2025) (P) Exploration of Reranking, relevance feedback and keyword identification from search histories for IR and RAG
  • (2025) (B) Assessing the role of generator and retriever within a RAG pipeline on the Question Answering task with the help of NLP metrics (Benchmarking LLMs)
  • (2025) (M) Advanced Methods for Finding Related Tickets Based on Semantic Search (Continous pretraing and finetuning for IR; hybrid retrieval)
  • (2024) (M) Evaluation of Retrieval augmented Generation architectures
  • (2024) (P) Boosting Quality Control in the Automotive Industry using LLMs and Contrastive Learning (Blackbox finetuning for Embedding Models)