DAISI (Decathlon AI Supplier Informations) - Enterprise Agent for Supplier Processes

Overview

Built DAISI (Decathlon AI Supplier Informations), a production Enterprise Agent on Google Chat for supplier-process questions. It combines agentic business tools, RAG, Gemini, Vertex AI Vector Search, Cloud Run, Cloud SQL, MLflow/Databricks, Model Armor, DLP to protect sensitive information, OpenTelemetry, and Terraform. Outcomes: 13,000 hours/year saved and successful load test with 1000 concurrent users. The application is currently running in production, serving real users at Decathlon with automated infrastructure management via Terraform and comprehensive observability through MLflow and OpenTelemetry.

Business Impact

Metric	Value
Time Saved	13,000 hours/year for the suppliers team
Load test	Successful test with 1000 concurrent users
Availability	24/7 instant answers vs. waiting for human response
Coverage	Grounded on internal procedures and accounting guides
Escalation	Automatic redirection to accountants when needed

DAISI reduces repetitive questions about supplier processes, freeing up the finance team to focus on high-value tasks while keeping answers consistent, accurate, and aligned with internal procedures.

Indicateur	Valeur
Temps économisé	13 000 heures/an pour l'équipe fournisseurs
Load test	Test réussi avec 1000 utilisateurs concurrents
Disponibilité	Réponses instantanées 24/7 vs. attente d'une réponse humaine
Couverture	Fondé sur les procédures internes et les guides comptables
Escalade	Redirection automatique vers les comptables si nécessaire

Technical Architecture

AI/ML Stack

LangGraph - Agent orchestration framework with stateful conversation flows
LangChain - LLM application framework for chains and prompts
Gemini via Vertex AI - Foundation model for text generation
Vertex AI Vector Search - High-performance semantic search for RAG
FAISS - Local vector similarity search
Model Armor - GCP security templates for anti-prompt injection and content filtering
DLP - Protection for sensitive information

Backend Infrastructure

FastAPI - High-performance async API framework
Cloud Run - Serverless container deployment with autoscaling
Cloud SQL PostgreSQL - Managed database for conversation persistence
Google Cloud Storage - Object storage for knowledge base and configurations
Uvicorn - ASGI server for FastAPI

Observability & MLOps

MLflow 3.7+ - Experiment tracking with LangGraph autologging
Databricks - MLflow and Delta refresh workflows
LiteLLM - Unified LLM API gateway
OpenTelemetry - Distributed tracing with OTLP/gRPC export
Cloud Logging - Centralized log aggregation

Scheduled Jobs (Cloud Run Jobs)

The application includes automated maintenance jobs orchestrated by Cloud Scheduler:

Job	Schedule	Purpose
TTL Cleanup	Daily 03:00	GDPR compliance - deletes checkpoints and MLflow traces older than 12 months
Trace Evaluation	Daily 04:00	LLM-as-Judge quality scoring with custom scorers (relevance, language consistency, conciseness)
Data Sync	On config update	Syncs knowledge base from Google Sheets to GCS

Infrastructure as Code

Terraform - Complete GCP infrastructure management with modular design:
- cloud-run - Main service deployment
- cloud-run-job - Batch job definitions
- cloud-scheduler - Scheduled triggers
- iam - Service accounts and role bindings
- model-armor - Security template configuration
- vertex-ai-vector-search - Vector index management
- storage - GCS bucket configuration

DevOps & CI/CD

Docker - Containerized deployments
GitHub Actions - Automated CI/CD pipelines
SonarCloud - Code quality and security analysis
Pre-commit hooks - Automated code checks
Ruff - Python linting and formatting
Mypy - Static type checking
Pytest - Async test framework with coverage

Project Management

Jira - Sprint planning and issue tracking
Confluence - Technical documentation with auto-sync from /docs

Stack IA/ML

LangGraph - Framework d'orchestration d'agents avec flux conversationnels à état
LangChain - Framework applicatif LLM pour les chaînes et les prompts
Gemini via Vertex AI - Modèle fondation pour la génération de texte
Vertex AI Vector Search - Recherche sémantique haute performance pour le RAG
FAISS - Recherche locale de similarité vectorielle
Model Armor - Templates de sécurité GCP pour la protection anti-prompt injection et le filtrage de contenu
DLP - Protection des informations sensibles

Infrastructure Backend

FastAPI - Framework API asynchrone haute performance
Cloud Run - Déploiement serverless de conteneurs avec autoscaling
Cloud SQL PostgreSQL - Base de données managée pour la persistance des conversations
Google Cloud Storage - Stockage objet pour la base de connaissances et les configurations
Uvicorn - Serveur ASGI pour FastAPI

Observabilité & MLOps

MLflow 3.7+ - Suivi d'expériences avec autologging LangGraph
Databricks - Workflows MLflow et refresh Delta
LiteLLM - Passerelle API LLM unifiée
OpenTelemetry - Traçage distribué avec export OTLP/gRPC
Cloud Logging - Agrégation centralisée des logs

Jobs Planifiés (Cloud Run Jobs)

L'application inclut des jobs de maintenance automatisés orchestrés par Cloud Scheduler :

Job	Planification	Objectif
TTL Cleanup	Quotidien 03:00	Conformité RGPD - supprime les checkpoints et traces MLflow de plus de 12 mois
Trace Evaluation	Quotidien 04:00	Scoring qualité LLM-as-Judge avec des scorers personnalisés (pertinence, cohérence linguistique, concision)
Data Sync	Sur mise à jour de config	Synchronise la base de connaissances depuis Google Sheets vers GCS

Infrastructure as Code

Terraform - Gestion complète de l'infrastructure GCP avec conception modulaire :
- cloud-run - Déploiement du service principal
- cloud-run-job - Définitions des jobs batch
- cloud-scheduler - Déclencheurs planifiés
- iam - Comptes de service et liaisons de rôles
- model-armor - Configuration des templates de sécurité
- vertex-ai-vector-search - Gestion des index vectoriels
- storage - Configuration des buckets GCS

DevOps & CI/CD

Docker - Déploiements conteneurisés
GitHub Actions - Pipelines CI/CD automatisés
SonarCloud - Analyse de qualité de code et de sécurité
Pre-commit hooks - Vérifications automatisées du code
Ruff - Linting et formatage Python
Mypy - Vérification statique de types
Pytest - Framework de tests asynchrones avec couverture

Gestion de Projet

Jira - Planification de sprints et suivi des tickets
Confluence - Documentation technique avec synchronisation automatique depuis /docs

Architecture Diagrams

The three diagrams below were rebuilt from the live architecture reference in the DAISI repository. They make the runtime much easier to read than a plain markdown block.

System Architecture

This view shows the full production path: Google Chat as the user channel, Cloud Run / FastAPI as the runtime boundary, Model Armor on ingress and egress, LangGraph + Gemini for orchestration, DLP for sensitive-information protection, and the split between retrieval, persistence, enterprise APIs, and observability.

Module Dependencies

This slice focuses on the codebase itself: a thin app.py entrypoint, an API layer that brokers requests, the LangGraph agent and tool layer in the middle, and shared foundations for retrieval, state, telemetry, and utilities.

Request Flow

This sequence makes the runtime behavior explicit: verify the Google Chat event, reject duplicates early, acknowledge with a processing card, run the agent loop with retrieval and checkpoint state, then return a grounded answer while logging traces asynchronously. Reference: DAISI architecture README

Core Features

Intelligent Question Answering

RAG-powered responses grounded in operational and accounting practical guides ("fiches pratiques").

Contextual Disambiguation

Multi-turn conversations with follow-up questions to clarify ambiguous requests.

Conversation Memory

PostgreSQL-backed checkpointer for persistent conversation state with IAM authentication.

System Integrations

IAM - User context (Cost Center, job title, department)
Accountant Lookup - Automatic assignment to the right contact
Invoice Status - Real-time invoice status queries
Purchasing Info - Indirect purchasing information

Security & Compliance

Model Armor templates for prompt injection protection
DLP to protect sensitive information
Prohibited topics detection (salary, HR issues)
GDPR-compliant data retention with automated TTL cleanup
Access restricted to @decathlon.com domain

Technologies Summary

Category	Technologies
AI/ML	LangGraph, LangChain, Gemini, Vertex AI Vector Search, FAISS, Model Armor, DLP
Backend	Python 3.11, FastAPI, Uvicorn, Pydantic
Database	Cloud SQL PostgreSQL, langgraph-checkpoint-postgres, psycopg3
Cloud	GCP, Cloud Run, Cloud Run Jobs, Cloud Scheduler, GCS
Observability	MLflow, Databricks, LiteLLM, OpenTelemetry
Infrastructure	Terraform, Docker
CI/CD	GitHub Actions, SonarCloud, Pre-commit
Quality	Ruff, Mypy, Pytest, pytest-asyncio

DAISI (Decathlon AI Supplier Informations) - Enterprise Agent for Supplier Processes

Architecture and evidence

Sources

Overview

Business Impact

Technical Architecture

AI/ML Stack

Backend Infrastructure

Observability & MLOps

Scheduled Jobs (Cloud Run Jobs)

Infrastructure as Code

DevOps & CI/CD

Project Management

Stack IA/ML

Infrastructure Backend

Observabilité & MLOps

Jobs Planifiés (Cloud Run Jobs)

Infrastructure as Code

DevOps & CI/CD

Gestion de Projet

Architecture Diagrams

System Architecture

Module Dependencies

Request Flow

Architecture système

Dépendances entre modules

Flux de requête

Core Features

Intelligent Question Answering

Contextual Disambiguation

Conversation Memory

System Integrations

Security & Compliance

Réponses Intelligentes aux Questions

Désambiguïsation Contextuelle

Mémoire Conversationnelle

Intégrations Système

Sécurité & Conformité

Technologies Summary

Catégorie	Technologies
IA/ML	LangGraph, LangChain, Gemini, Vertex AI Vector Search, FAISS, Model Armor, DLP
Backend	Python 3.11, FastAPI, Uvicorn, Pydantic
Base de données	Cloud SQL PostgreSQL, langgraph-checkpoint-postgres, psycopg3
Cloud	GCP, Cloud Run, Cloud Run Jobs, Cloud Scheduler, GCS
Observabilité	MLflow, Databricks, LiteLLM, OpenTelemetry
Infrastructure	Terraform, Docker
CI/CD	GitHub Actions, SonarCloud, Pre-commit
Qualité	Ruff, Mypy, Pytest, pytest-asyncio