Proposal: Secure, Compliant, Zero-Latency Logging for Model Gateway

Secure, Compliant, Zero-Latency Logging for Model Gateway

We want to log prompts and outputs in the model-gateway (ai-gateway) FastAPI webapp
We don't want to log secrets, keys, PII and other sensitive data
We don't want to introduce latency when redacting sensitive data from logs
Destination = Kibana production

Use FastAPIs Background Tasks to write logs after response has been returned, thus zero-to-negligible impact on latency
- https://fastapi.tiangolo.com/tutorial/background-tasks/
- Use middlewares to inject the logger background tasks on all routes: https://fastapi.tiangolo.com/advanced/middleware/
  - Are there any exceptions? i.e. Any routes we do not want logged?
Implement the logger background task which sanitizes and redact secrets, PII etc. from logs before publishing on Kibana
- Try out major Python libraries and measure efficacy:
- Compile significant test dataset to verify solution
- Seek advise from legal/compliance teams to verify solution

Leverage the ecosystem
- Use mechanisms offered by FastAPI (ai-gateway is a FastAPI app after all) such as middlewares, background tasks etc.
- Use redaction libraries available in Python
  - Redacting secrets and PII is a -solved- problem on Python
  - Apart from handling rare edge-cases, there is very little need to re-invent the wheel
Transparent tests and results
- Write comprehensive, near-real-world tests
- Publish for internal approvals (legal, compliance etc.)
- Publish externally for trust + transparency

Edited Feb 06, 2024 by Sri Rang