Skip to main content
llm-eval-iras

Evaluations

Configure the model routing rules and the test cases, then run them. The router is deterministic and free; running a case calls the routed model and grades the answer against your keywords. Edits stay in your browser.

Model routing rules

First rule whose any keyword appears in the query wins, otherwise the fallback. This is the deterministic router the assistant uses.

Fallback
routes to GPT-4o mini factual-lookup

Test cases

Results

Edit the rules and cases above, then click Run to populate the stats.