Teaching

Winter Term

Computer Graphics (infCG-02a)

Abstract: Computer Graphics enables the generation of visualizations that can be used in a variety of domains and applications, such as movies and games, manufacturing, architecture, and in research. Due to recently introduced concepts, such as the Metaverse and Digital Twins, 3D computer graphics has gained a considerable amount of momentum. Key goals of Computer Graphics are to develop algorithms for rendering images, for modeling the 3D geometry of objects and scenes, and for generating simulations and animations.

More Info | MDB

Summer Term

Generative AI (infGAI-01a)

Abstract: This lecture is dedicated to developing an understanding of generative AI models with a focus on visual computing. The lecture will introduce the diverse applications of generative AI, ranging from generating new data samples and using generative architectures for data augmentation to highlighting how these technologies are revolutionizing multiple industries. By the end of the lecture, attendees will gain an understanding of generative AI, equipped with the knowledge to critically assess its applications. This lecture is designed to cater to a wide range of students, from BA and MSc to students of other courses.

More Info | MDB

Winter Term

Time Series Predictions with Sequential Neural Networks (infBSemZsnN-01a)

Abstract: This seminar addresses the learning of models for the analysis and prediction of time series. Time series are temporally ordered data points that are collected, for example, hourly, weekly, or annually. The analysis and prediction of time series are of great interest in a variety of fields (e.g., economics, remote sensing, robotics, meteorology). Time series models aim to enable the prediction of trends, which allow statements to be made about the future development of observations.

More Info | MDB

Winter/Summer Term

Master Project - Computer Graphics (infMPCG-01a)

Abstract: This master project provides the opportunity to apply and advance the theoretical knowledge of computer graphics to a concrete practical problem. At the start of the semester students will pick concrete projects, for which they work together through the semester in small teams to achieve the goals.

More Info | MDB

Winter/Summer Term

Master Project - GenerativeAI (infMPGAI-01a)

More Info | MDB

Below you can find a selection of open thesis projects that the VCAI group is offering. Please reach out to Prof. Sören Pirk if you are interested in one of the topics. You can also propose your own projct.

Automatic Robot-driven Dollhouse Generation

[Type: Master Thesis, Focus: Robotics, Machine Learning, Computer Vision]

This project aims to develop a system for automatic 3D indoor scene reconstruction using a quadruped robot equipped with onboard cameras. The robot will either autonomously or manually explore indoor environments and build compact 3D models, similar to "dollhouse views," using modern techniques such as Gaussian Splatting or Neural Radiance Fields. Key challenges include robust SLAM, trajectory planning, and real-time scene representation from egocentric visual data. The resulting system enables applications in inspection, digital twins, or VR/AR without requiring external scanning hardware.

Gesture-based Robot Control

[Type: Bachelor or Master Thesis, Focus: Robotics, Machine Learning, Computer Vision]

This project explores intuitive human-robot interaction through vision-based gesture recognition. A camera-equipped robot will be trained to recognize a set of predefined hand/body gestures in real time and map them to locomotion or manipulation commands. The system will use neural networks or transformer-based models for gesture detection, trained on either public datasets or custom recordings. The goal is to enable hands-free control for tasks like guiding, commanding, or stopping the robot, particularly useful in assistive or social robotics contexts.

VLA: Training and Deployment on Legged Robot

[Type: Bachelor or Master Thesis, Focus: Robotics, Machine Learning, Computer Vision]

This project investigates the training and deployment of a Vision Language Architecture (VLA) for quadruped robots navigating unstructured environments. The system integrates vision-based perception (e.g. depth, segmentation) with learned locomotion policies to enable agile and terrain-aware movement. The project includes collecting training data, training the perception and control modules, and validating the full stack on real robot hardware. Key goals include robustness to occlusion, generalization across environments, and tight integration of visual input with proprioceptive control.

Proprioceptive Touch Localization for Legged Robots Using Data-Driven Methods

[Type: Bachelor or Master Thesis, Focus: Robotics, Machine Learning]

This project aims to develop a data-driven whole-body touch sensing system for a quadruped robot using only proprioceptive signals, without requiring additional sensors. Inspired by the UniTac approach, the system will be trained on real-world data collected by manually inducing contact at known body locations and recording joint torques and positions. A neural network will then be trained to predict the location of contact from this proprioceptive input, enabling accurate touch localization and enhancing physical human-robot interaction without modifying the robot’s hardware.

Neural Marching Cubes

[Type: Bachelor or Master Thesis, Focus: Machine Learning]

The objective of this project is to investigate the potential of combining implicit neural representations with traditional surface extraction techniques, specifically through the use of Neural Marching Cubes, for high-fidelity 3D shape reconstruction. Neural Marching Cubes leverages a neural network to learn a continuous signed distance function (SDF) or occupancy field that defines a 3D surface implicitly in space. In this project, we explore how a neural network can be trained to approximate the underlying geometry of a 3D object from sparse or noisy data such as point clouds or multi-view images. Once trained, this continuous representation allows for the application of the classic Marching Cubes algorithm on a dense sampling of the learned field, producing a mesh that accurately captures the shape's fine-grained details.This approach bridges deep learning with classical computer graphics, enabling high-resolution 3D surface reconstruction with improved generalization and topological consistency. Ultimately, the project aims to demonstrate the effectiveness of neural fields in 3D modeling workflows, and how their integration with structured extraction techniques can advance the state-of-the-art in 3D reconstruction and generation.

Image Generation Using GPT-Based Model and VQ-VAE

[Type: Bachelor Thesis, Focus: Machine Learning]

The objective of this project is to investigate the potential of combining a transformer-based model, such as GPT, with a Vector Quantized Variational Autoencoder (VQ-VAE) for the purpose of image generation. VQ-VAE is a type of generative model that effectively compresses images into discrete latent representations, making them suitable for sequence modeling.In this project, we explore how the discrete latent codes produced by a trained VQ-VAE can be treated similarly to tokens in a language model. By using a GPT-style transformer to model the distribution over these latent codes, we aim to generate new images in a sequential, token-by-token manner. This approach leverages the powerful generative capabilities of transformers in the domain of visual data, allowing for the creation of coherent and high-quality images. Ultimately, the project aims to provide insights into the synergy between VQ-VAE and transformer-based models, demonstrating how they can be integrated to push the boundaries of generative image modeling.

Monocular Fire Reconstruction

[Type: Master Thesis, Focus: Computer Vision, Machine Learning]

Traditional reconstructions of fire are based on a set of inward-facing cameras. The large baselines enable precise triangulation of the individual flames. This, however, comes with the main disadvantage of cost and intense calibration. For this reason, this project aims to explore the possibility of significantly reducing the amount of required cameras by introducing powerful pre-trained methods, such as monocular depth, point tracking, and diffusion-based novel-view-synthesis. While approaching the target of using only a single camera, photometric losses as well the generalization capabilities of the reconstructed model will be evaluated.

Prediction of Bone Healing Processes using Sparse Longitudinal Data

[Type: Master Thesis, Focus: Computer Vision, Medical, Machine Learning]

The aim of this project is to investigate whether bone healing processes can be predicted on the basis of sparse longitudinal data, both in terms of overall extent and temporal resolution. To this end, a dummy CT data set must first be created, on which the various AI models will then be trained and tested.

Bone Strength Analysis (FemoraLyze)

[Type: Bachelor or Master Thesis, Focus: Computer Vision, Medical, Machine Learning]

FemoraLyze is a modular framework for a fully-automated proximal femur analysis. Based on different segmentation masks, the framework is able to collect metrics such as cortical thickness in different bone areas, volumes, angles and distances as well as bone structure parameters. These are relevant in medical prognosis, diagnosis and therapy. In order to further increase the benefits for medical professionals, this framework is to be expanded with additional modules (e.g., finite element analysis) and modalities (e.g., MRI or clinical CT). Runtime optimization and a GUI would also be desirable.

AI-based Implant Modeling

[Type: Bachelor or Master Thesis, Focus: Computer Vision, Medical, Machine Learning]

The Institute for Digital Implant Research (IDIR) is investigating how digital twins can be used to improve the development of patient-specific implants. This includes the sub-projects aneurysms, bone nails and heart valves. They all have the same approach in common: We aim to create a digital twin from real data (e.g., CT, MRI) of the patient. The optimum implant for the patient must now be determined/developed by simulation, which then only needs to be manufactured and implanted in the final step. In this context, there are always exciting questions that could be dealt with in the course of a Bachelor's or Master's thesis. If you are interested, please get in touch and we will see if we can find a suitable topic.

Learning based reverse engineering of 3D file formats

[Type: Bachelor or Master Thesis, Focus: Machine Learning]

This project aims to develop a neural-network architecture that receives a binary 3D file of unknown format as input and outputs a file format specification. To train this network, a method to create synthetic data should be developed. On one hand, arbitrary file format specifications can be synthesized and on the other hand 3D files can either be created procedurally or by using a dataset of 3D objects which are then augmented and converted to the previously synthesized file format specifications.

Generating Evacuation Tours through Deep Reinforcement Learning

[Type: Master Thesis, Focus: Computer Graphics, Machine Learning]

Digital twins of indoor scenes are today more and more available in several forms such as 3D models, games and through building information modelling tools. In this project we want to leverage 3D building models for evacuation safety where evacuation tours are automatically generated for the scope of training people on evacuation paths in case ofemergency. The computed evacuation tours are used to guide and train people in case of emergency evacuation. Such tours could also be enhanced with inclusive features (e.g., an automated generation of tours that include a vocal description of what is seen, so that could be used to help people with visual impairments learning about evacuation paths). This project will focus on developing a methodology for automating the generation of trajectories of evacuation paths based on deep reinforcement learning. An agent is trained to learn how to automatically control a camera to generate an evacuation path that can be used for safety purposes.

Modeling and Rendering of Underwater Environments

[Type: Bachelor or Master Thesis, Focus: Computer Graphics]

This project aims to create photorealistic underwater scenes that can be explored with a robotic agent. The core focus is on developing sophisticated computer graphics techniques to accurately simulate underwater optical properties like light refraction, scattering, and caustics, ensuring detailed and lifelike visuals. The project also involves programming a robotic agent to move through these scenes, interact with underwater terrain, and engage with marine flora and fauna, emulating real-life underwater exploration.

Uncertainty Visualization for AI-based Segmentations of Medical CT Volumes

[Type: Bachelor or Master Thesis, Focus: Computer Vision, Medical, Machine Learning]

Various organs of the human patient can be segmented from CT scans and the corresponding greyscale axial views. AI models already achieve a high accuracy in the segmentation, but still need to be corrected. Instead of a binary classification mask as output, a heatmap may display the certainty of the segmentation and improve the understandability of the model.

Effect of the Patient's Rotation Angle on Tissue Deformation in CT Volumes

[Type: Bachelor or Master Thesis, Focus: Computer Vision, Medical]

While medical CT scans are generally taken in a supine position (lying on the back), during the actual surgery, patients are often positioned on their side. This leads to a deformation of internal organs so that their position does not match the CT scan anymore. The elasticity of various organ types may be estimated from tissue characteristics and used to predict absolute movement under the influence of gravity.

Semantic Segmentation of Outdoor Scenes

[Type: Master Thesis, Focus: Computer Vision]

This project aims to develop a deep learning model for semantic segmentation in outdoor environments, with a specific focus on identifying and classifying various types of vegetation such as trees, grass, shrubs, and other plant species. The student will work with datasets of outdoor images and leverage convolutional neural networks (CNNs) or other machine learning models to differentiate vegetation from non-vegetation areas. The outcome of the project will be a tool that can accurately map vegetation in outdoor scenes, potentially useful for applications like environmental monitoring, agriculture, and urban planning.

Randomized Look-Up Data Structures for Spatial Data

[Type: Bachelor or Master Thesis, Focus: Computer Graphics]

The goal of this project is to investigate how randomization can be used to engineer efficient data structures for queries on spatial data. Initially, a literature research should be conducted to compile an overview of what the state of the art is. Based on this, data structures that have only been proposed theoretically should be implemented or entirely new data structures should be developed and implemented. Experiments should then be conducted comparing the performance to that of conventional data structures (e. g. Kd-trees, octrees). Optionally, these newly developed data structures could make use of learned biases to make random decisions which might yield further performance improvements.

Ongoing

Summer 2025 (SS2025)

Geometric Modeling of Harbor Scenes and Path Planning for Boats (Bachelor Thesis)

Supervisors: Prof. Dr. Sören Pirk, M.Sc. Nikolas Schwarz

Follow Me: Vision-Based Person Following for Autonomous Robot Navigation (Bachelor Thesis)

Supervisors: Supervisors: Prof. Dr. Sören Pirk, B.Sc. Anton Wagner

Object Search and Localization: Autonomous Detection of Target Items in Indoor Environments (Bachelor Thesis)

Supervisors: Supervisors: Prof. Dr. Sören Pirk, B.Sc. Anton Wagner

Energy-based Procedural Modeling of Botanical Trees (Bachelor Thesis)

Supervisors: Supervisors: Prof. Dr. Sören Pirk, M.Sc. Nikolas Schwarz

Advanced Procedural Modeling of Leaves for Botanical Trees (Bachelor Thesis)

Supervisors: Supervisors: Prof. Dr. Sören Pirk, M.Sc. Nikolas Schwarz

Comparison of CT Segmentation Algorithms (Bachelor Thesis)

Supervisors: Supervisors: Prof. Dr. Kevin Köser, M.Sc. Silja Janßen, M. Sc. Marten Finck

Wildfire Evacuation Planning (Master Thesis)

Supervisors: Supervisors: Prof. Dr. Athanasios Vafeidis, Prof. Dr. Sören Pirk, M.Sc. Helge Wrede

Textual Inversion for 3D Diffusion (Master Thesis)

Supervisors: Supervisors: Prof. Dr. Sören Pirk, Ashwath Shetty

Interactive Visualization for Contour Placement in Intercranial Aneurysms (Master Thesis)

Supervisors: Prof. Dr. Sören Pirk, Prof. Dr. Sylvia Saalfeld

Completed

Fall 2024 (WS2024/2025)

Advanced Visual Augmentation of Conversations with Language-based Rasoning (Master Thesis)

Supervisors: Prof. Dr. Sören Pirk

Optical Music Recognition with AI (Bachelor Thesis)

Supervisors: Prof. Dr. Sören Pirk, M. Sc. Nikolas Schwarz

Automated Generation of Synthetic Data for Document Understanding Tasks (Master Thesis)

Supervisors: Prof. Dr. Sören Pirk, M. Sc. Marten Finck, M. Sc. Helge Wrede

Improving Document Image Quality Through Noise Removal: A UNet-Based Approach (Bachelor Thesis)

Supervisors: Prof. Dr. Sören Pirk, M. Sc. Marten Finck

Generierung von synthetischen Röntgenbildern der Hand mit Diffusionsmodellen und LoRA (Bachelor Thesis)

Supervisors: Prof. Dr. Sören Pirk, M. Sc. Marten Finck

Classification of binary 3D-Files (Bachelor Thesis)

Supervisors: Prof. Dr. Sören Pirk, M. Sc. Nikolas Schwarz

Conversation Amplification through Language-based Reasoning and Generative Visual Content Creation (Master Thesis)

Supervisors: Prof. Dr. Sören Pirk, Prof. Dr. Hans-Christian Jetter

Summer 2024 (SS2024)

GAN-based Generation of Terrain / Erosion (Bachelor Thesis)
Supervisors: Prof. Dr. Sören Pirk, M. Sc. Nikolas Schwarz, M. Sc. Sarker Miraz Mahfuz

Influence of Animal Behavior on Predator Prey Systems (Bachelor Thesis)
Supervisors: Prof. Dr. Sören Pirk, M. Sc. Nikolas Schwarz, M. Sc. Sarker Miraz Mahfuz

Analysis of a Evolutional Reinforcement Learning Predator Prey Simulation (Bachelor Thesis)
Supervisors: Prof. Dr. Sören Pirk, M. Sc. Sarker Miraz Mahfuz, M. Sc. Helge Wrede

Automating Espresso Grinder Adjustments Using Deep Learning Techniques (Bachelor Thesis)
Supervisors: Prof. Dr. Sören Pirk, M. Sc. Nikolas Schwarz, M. Sc. Marten Finck

Training Neural Networks to Play Chess Like Games (Bachelor Thesis)
Supervisors: Prof. Dr. Sören Pirk, M. Sc. Helge Wrede, M. Sc. Sarker Miraz Mahfuz

Guiding Stable Diffusion with a 3D Editor (Bachelor Thesis)
Supervisors: Prof. Dr. Sören Pirk, M. Sc. Nikolas Schwarz

A Multi-task Downstream Task for Biomarker Identification Using Medical Chest X-ray Images (Bachelor Thesis)
Supervisors: Prof. Dr. Sören Pirk, M. Sc. Marten Finck

Anomaliedetektion mittels Machine Learning auf Geodaten für das Asset Tracking (Bachelor Thesis)

Supervisors: Prof. Dr. Sören Pirk, Dr. Lars Schwabe

Testing, Monitoring and Analysis of AI-based Chat Bots (Master Thesis)

Supervisors: Prof. Dr. Sören Pirk, Jan Monica

Fall 2023 (WS2023/2024)

Predicting Gaussians: 3D Scene Reconstruction using Transformers (Bachelor Thesis)

Supervisors: Prof. Dr. Sören Pirk, M. Sc. Helge Wrede