Evaluations

llm-eval-iras

Configure the model routing rules and the test cases, then run them. The router is deterministic and free; running a case calls the routed model and grades the answer against your keywords. Edits stay in your browser.

Model routing rules

First rule whose any keyword appears in the query wins, otherwise the fallback. This is the deterministic router the assistant uses.

Fallback

routes to GPT-4o mini factual-lookup

Test cases

Results

Edit the rules and cases above, then click Run to populate the stats.