Added information about guardrails by cableman · Pull Request #6 · os2ai/documentation

cableman · 2026-05-11T08:15:21Z

Added new doc about the message trim guardrail
Added support for mermaid diagrams in this docs site

hypesystem · 2026-06-03T08:55:42Z

Added a note related to this here os2ai/helm-deployments#28 (comment)

I'll review this with my feedback in mind.

lilosti · 2026-06-03T09:54:41Z

@SigneA-hm this should be reviewed and merged

lasseborly · 2026-06-08T08:03:02Z

I'll withdraw from this review and let's @hypesystem take the wheel.

hypesystem

Nice, I like the documentation overall, it gives me a good idea of the general approach.

I've added some notes, mostly where I think some more abstract/broader context is warranted, so readers are more likely to understand not just what we are doing, but why 😄

hypesystem · 2026-06-08T10:27:18Z

+    - `_repair_tool_call_pairings` — strip orphan `role: tool` messages and orphan `tool_calls` entries that the trimmer
+      may have created.
+    - (Optional, opt-in via `pop_trailing_tool_messages`) pop trailing `role: tool` messages and re-run the repair, then
+      append `"Please continue"` if the new terminus is an assistant message.


"Please continue" feels like it could skew the output, especially if the language in the context window otherwise isn't English?

hypesystem · 2026-06-08T10:29:07Z

+
+The message trimming guardrail can be configured in the litellm
+values [file](https://github.com/os2ai/helm-deployments/blob/develop/applications/litellm/litellm-values.yaml#L108)
+configuration file in the helm chart.


A note on why we need message trimming would make sense after this paragraph, just very briefly. E.g. what happens if oversized message histories are not trimmed, and how does the guardrail avoid it? In a sentence of two.

Added comment about this to the doc

hypesystem · 2026-06-08T10:32:52Z

+### Why the trailing-tool pop is opt-in
+
+The "normal" agent-loop shape ends on a `role: tool` message:
+
+```mermaid
+flowchart LR
+    U[User] --> A["Assistant{tool_calls}"]
+    A --> T["Tool{result}"]
+    T --> C([model is asked to continue here])
+```
+
+Most providers (OpenAI, Anthropic, Google, Mistral via the official APIs) __accept__ this shape — that's how tool
+calling works. Popping the tool message and substituting `"Please continue"` deprives the model of the result it was
+supposed to reason from, so the default is __off__.


I don't understand this section. Specifically Popping the tool message and substituting "Please continue" deprives the model of the result it was supposed to reason from, so the default is __off__. doesn't really tell me what happens in the cases where the setting is enabled vs disabled, and what exactly the default behavior is.

What is the effect of depriving the model of the result it was supposed to reason from? (Am I understanding it correctly that this refers to "the result of the tool call", and if so, could we call it that?)

Tried to explain it better

hypesystem · 2026-06-08T10:37:59Z

+
+## How Message Trimming works
+
+`async_pre_call_hook` runs on every chat completion request. The flow:


Before the step-by-step flow, I think a sentence stating what the pros and cons of our approach is would make sense.

E.g. "Sending a too large message to the model can be fatal for the entire conversation, so we take a conservative approach in estimating a safe completion budget for the message" and then explain what a safe completion budget is, why we calculate it as is? I think that would give a lot of good context for evaluating the approach.

Intro section added

hypesystem

As discussed in today's meeting, here's a suggested text explaining the context between Open WebUI and LiteLLM and how guardrails are attached.

cableman · 2026-06-16T07:36:36Z

Tried to answer the questions as good as I can.

cableman requested a review from SigneA-hm May 11, 2026 08:15

cableman force-pushed the feature/guardrails branch from 9a5d7bb to 0ee7642 Compare May 11, 2026 08:18

lilosti requested a review from lasseborly May 11, 2026 08:23

hypesystem self-requested a review June 3, 2026 08:56

lasseborly removed their request for review June 8, 2026 08:02

lilosti mentioned this pull request Jun 8, 2026

Updated message trim to handle message better and support tool calling. os2ai/helm-deployments#28

Open

hypesystem reviewed Jun 8, 2026

View reviewed changes

hypesystem reviewed Jun 10, 2026

View reviewed changes

Comment thread technical/guardrails.md

cableman force-pushed the feature/guardrails branch from 0ee7642 to d10dc26 Compare June 16, 2026 06:27

Added information about guardrails

ea8c468

cableman force-pushed the feature/guardrails branch from d10dc26 to ea8c468 Compare June 16, 2026 06:29

Added intro text to guardrails

ad9bb8d

cableman force-pushed the feature/guardrails branch from 0df92e6 to ad9bb8d Compare June 16, 2026 06:33

cableman added 4 commits June 16, 2026 08:55

Added link to config section

2dafa00

Added short guardtail backgrund

3c0b1c6

Added more context about pop_trailing_tool_messages

69c17d9

Added intro seciton to "how messeage trim works"

6661f98

cableman requested a review from hypesystem June 16, 2026 07:35


		## How Message Trimming works

		`async_pre_call_hook` runs on every chat completion request. The flow:

Conversation

cableman commented May 11, 2026

Uh oh!

hypesystem commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lilosti commented Jun 3, 2026

Uh oh!

lasseborly commented Jun 8, 2026

Uh oh!

hypesystem left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hypesystem Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

hypesystem Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

cableman Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

hypesystem Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

cableman Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

hypesystem Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

cableman Jun 16, 2026

Choose a reason for hiding this comment

Uh oh!

hypesystem left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cableman commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hypesystem commented Jun 3, 2026 •

edited

Loading