A secure internal AI layer for a large Swiss enterprise: model-agnostic agents, SharePoint and Elasticsearch ingestion, vector + graph retrieval, MCP-based permissions, local GPU inference, cloud fallback, and anonymization.
A large enterprise in Switzerland needed an AI system that could answer business-specific questions across internal documentation, historical tickets, SharePoint files, search indexes, and department-specific operational knowledge.
The challenge was enterprise-grade retrieval, not a simple chatbot: the platform had to respect user permissions, combine local and cloud models, protect sensitive data, and keep working even when one knowledge layer returned incomplete results.
We supported a modular AI knowledge platform that routes requests between specialized agents, internal tools, local GPU-hosted models, and selected cloud models depending on the task and data sensitivity.
The architecture combines semantic vector search with graph-based relationship retrieval, connected through MCP servers that enforce permissions before data reaches the model. A custom anonymization layer masks sensitive information when cloud processing is useful.
Semantic search and relationship-based reasoning were kept separate, giving the platform fallback resilience and better answers for complex internal knowledge.
MCP servers enforce user-specific access rules before agents query internal systems, keeping finance, HR, operations, and restricted data properly scoped.
Connectors track document updates, ingest useful ticket history, and integrate Elasticsearch so agents can retrieve knowledge from existing enterprise systems.
Sensitive identifiers can be masked before cloud model calls, while moderation and prompt-injection checks reduce risk across enterprise AI workflows.
The platform created a secure foundation for internal AI assistants that can retrieve knowledge across enterprise systems, respect access controls, and choose the right model or tool for each task.
The most important design choice was to keep retrieval, permissions, model routing, and anonymization as separate layers. That makes the system easier to audit, safer to extend, and more resilient when one source returns incomplete or outdated information.
Mapped internal sources, user roles, sensitive data boundaries, document update patterns, and the types of questions employees needed to answer.
Implemented vector and graph retrieval, MCP tool boundaries, SharePoint ingestion, Elasticsearch access, trace analysis, and model-routing logic.
Containerized the platform, tuned retrieval quality through execution traces, added anonymization and guardrails, and prepared the system for enterprise-controlled infrastructure.
Let's design and ship AI systems that connect your knowledge, tools, and teams without compromising security or control.