KIVEDU

Mar 1, 2023 - Feb 28, 2025

My Role: Technical Project Lead

Technologies

PythonJavaScriptFastAPIPyTorchlangchainHuggingfaceDockerSupabaseNuxt.jsTailwind CSSVue.jsDaisyUIGraphile WorkerPostgreSQLtransformersvllm

Project Overview

KIVEDU is a research project funded by the Ministry of Economy, Labor, and Tourism of Baden-Württemberg, developed in collaboration with XPACE GmbH, Law & Audrey UG, and the University of Pforzheim. The project addresses a growing issue in e-commerce: identifying and enforcing consumer rights violations. Using AI-powered tools, KIVEDU automates tasks like detecting legal infractions, securely archiving evidence, and presenting actionable insights for consumer protection agencies.

As the technical project lead, I was responsible for designing the system architecture, leading a team of four developers, and ensuring the platform met both technical and regulatory requirements. One key aspect of the project was building a solution flexible enough to run in the cloud during development but fully on-premises for production, including hosting self-managed LLMs.

My Role

System Design: Designed the platform architecture to be scalable, event-driven, and modular, supporting both cloud and fully on-premises deployments. This included configuring infrastructure for hosting a self-managed open-weight LLM during testing.
Team Leadership: Led a team of four developers, overseeing implementation, assigning tasks, and ensuring deadlines were met.
Web-Based Platform Development: Built the platform using modern web technologies like Nuxt.js and Supabase, with a focus on user experience and efficient workflows.
AI-Powered Analysis: Integrated LLMs to automatically identify violations of cease-and-desist orders in German product descriptions, achieving high accuracy in legal infraction detection.
Semantic Web Scraping: Developed tools for crawling and scraping e-commerce platforms to extract and analyze product data.
Immutable Archival: Implemented a secure archival system for storing evidence, including full-page screenshots and static exports, ensuring tamper-proof data storage for legal purposes.
Event-Driven Workflow: Used event-driven processing to enable scalable and asynchronous operations, utilizing Azure services during development and transitioning to on-premises infrastructure for production.
LLM Research: Conducted research into optimizing LLMs for consumer rights enforcement, leading to three published papers presented at international conferences.

Technical Project Description

KIVEDU is built as a distributed microservices system with asynchronous event-driven processing. It consists of multiple components:

Backend: Built with Supabase (PostgreSQL), with triggers that create jobs for processing new entities.
Worker Infrastructure: Uses Graphile Worker to handle processing queues and execute various asynchronous tasks.
Crawling & Scraping: Uses Crawlee and Playwright to extract data from e-commerce stores.
AI Models:
- Text Extraction & Information Retrieval: OCR and structured data extraction from uploaded PDFs of cease-and-desist declarations.
- Violation Detection: Uses LLMs for classifying whether an e-commerce website violates consumer rights.
- Title Generation: AI-generated semantic document titles for user-friendly overviews.
Frontend: Built with Nuxt.js and Vue.js, styled with Tailwind CSS and DaisyUI, providing a seamless UI for users.
Deployment & Infrastructure:
- Development in the cloud using Azure.
- Production on-premises for privacy-sensitive environments, including self-hosted LLMs using vLLM.

KIVEDU Process

Challenges

Scalability: The system needed to support large-scale web crawling and scraping without breaking under load.
AI Model Accuracy: Optimizing violation detection models required extensive evaluation, prompt tuning, and fine-tuning on domain-specific datasets.
Data Privacy & Security: The system was designed for full on-prem deployment due to regulatory constraints, requiring specialized infrastructure setups.
Archival Integrity: Ensuring that archived evidence was immutable and could be presented in legal contexts without risk of tampering.
User Experience: Designing a UI that made complex legal workflows accessible to non-technical users.

Achievements

High-Accuracy Violation Detection: Developed AI models capable of detecting violations with ~90% accuracy.
Scalable Event-Driven System: Designed a backend architecture that scales seamlessly as workloads increase.
On-Premises Deployment Success: Successfully transitioned from cloud-based development to fully on-premises production infrastructure.
Research Publications: Published three research papers on AI for consumer rights enforcement, presented at international conferences.
Stakeholder Engagement: Presented the system to consumer protection agencies and industry professionals, generating significant interest.

Current Status

The KIVEDU platform is currently in the final stages of testing, transitioning fully to on-prem infrastructure for production deployment. The project has been well received by stakeholders, and ongoing refinements are being made to further improve AI accuracy and user workflows.