groegman's portfolio

Engineer with a vibe

Logo

Senior Engineering Data Expert

AI / Data Engineer

over a decade of experience in automotive develoment/manufacturing with OEMs & suppliers

M.Sc in Engineering, RWTH Aachen

Certified Data Scientist, DATACAMP.com

Certified Scrum Master, SCRUM.org

Skills

AI / ML

Agentic AI Chroma FAISS Jupyter LLMs Langchain Langflow NLP Ollama OpenAI API Python RAG Scikit-Learn spaCy

DATA SCIENCE

Dashboards ETL Jupyter Pandas Plotly PowerBI Python SQL Streamlit

DEVELOPMENT

Apache Cursor Django Docker Git Linux NGINX Pandas PowerAutomate Python Supervisord VSCode

MECHANICAL ENGINEERING

CAD Change-Management Concept Design Diecast Joiningtechnologies Production Sheetmetal Stakeholder-Management Supplier Management

Portfolio Projects

MMA Predictor — From Data to Deployment

MMA Predictor — From Data to Deployment

Designed and delivered an end‑to‑end machine learning pipeline for UFC fight outcome prediction — including data scraping from UFCSTATS.com, feature engineering, model benchmarking, and deployment as a live web app.

Docker ETL Git Jupyter Pandas Python Scikit-Learn Streamlit

Factory AI Assistant — RAG‑powered troubleshooting prototype for industrial equipment

Factory AI Assistant — RAG‑powered troubleshooting prototype for industrial equipment

Designed and delivered a retrieval‑augmented prototype that combines a simulated machine park with indexed technical documentation to provide grounded troubleshooting and maintenance guidance — including document ingestion & indexing, embedding generation, chroma vector store, similarity search, LLM grounding with few‑shot prompting, session memory and a Streamlit chat UI.

Agentic AI Chroma LLMs Langchain OpenAI API Production Python RAG Streamlit

Lego YouTube Reviews

Lego YouTube Reviews

Are Lego Youtube Reviews biased?

I analyzed over 1,000 videos using LLMs to detect sentiment and bias patterns.

The result: interactive dashboards powered by NLP and Langchain, built with Streamlit and Pandas.

Dashboards ETL Langchain NLP Ollama Pandas Python Streamlit

Interactive Portfolio Page

Interactive Portfolio Page

A Dockerized Django web app showcasing projects and skills.

It integrates multiple Streamlit dashboards via Supervisord and reverse-proxies through NGINX – all running in a single container.

Django Docker Git NGINX Python Supervisord

Exploratory Data Analysis – YouTube Review Bias

Exploratory Data Analysis – YouTube Review Bias

Using a custom Lego dataset, I trained a model to predict whether a YouTube video is sponsored based on sentiment and metadata of the reviewed Lego set.

The project combines scikit-learn classification with feature importance analysis to identify what makes a review trustworthy. Results are presented in a Jupyter Notebook and visualized through interactive dashboards.

Jupyter Pandas Plotly Python Scikit-Learn
UNDER DEVELOPMENT

AI Engineering Agent

AI Engineering Agent

Langchain-powered assistant for engineers, built on synthetic product data from a fictional body-in-white development project.

It enables natural language queries over structured data like part metadata, supplier history, and development stages – with SQL-backed responses and dynamic graphs.

Agentic AI LLMs OpenAI API Plotly SQL
UNDER DEVELOPMENT

Staffing Assistant – Powered by RAG Setup

Staffing Assistant – Powered by RAG Setup

An AI-powered assistant designed to intelligently match engineers to new projects based on synthetic employee data. Leveraging a Retrieval-Augmented Generation (RAG) architecture, the system combines FAISS vector search with SQL-backed metadata to align skills, experience, and project requirements. Users interact via natural language, and the agent responds with structured assignments, timelines, and team compositions – all visualized through dynamic graphs.

Agentic AI FAISS LLMs Langchain NLP OpenAI API Python RAG SQL
UNDER DEVELOPMENT