๐Ÿงช Vibe Test Report

AI Decision-Making Consistency Analysis
Generated: 2025-08-10 12:17:12

Test Configuration

Chat Model: gemma3:4b
Analysis Model: gemma3:4b
Test Mode: Multi-action selection (target action must be selected)
fear โœ…
Use when the user says something disturbing so that the main model can exibit a fear response
Overall Success Rate
100.0%
Tests Passed
6/6
Status
PASS
fileReader โœ…
Use when the user wants you to read or open a file to look at it's content as plaintext.
Overall Success Rate
100.0%
Tests Passed
6/6
Status
PASS
directoryReader โœ…
Use when the user wants you to look through an entire directory's contents for an answer.
Overall Success Rate
100.0%
Tests Passed
6/6
Status
PASS
getWeather โœ…
Use when the user asks about weather conditions or climate. Like probably anything close to weather conditions. UV, Humidity, temperature, etc.
Overall Success Rate
100.0%
Tests Passed
14/14
Status
PASS
getTime โœ…
Use when the user asks about the current time, date, or temporal information.
Overall Success Rate
100.0%
Tests Passed
12/12
Status
PASS
square_root โœ…
Use when the user wants to calculate the square root of a number. Keywords include: square root, sqrt, โˆš
Overall Success Rate
100.0%
Tests Passed
12/12
Status
PASS
calculate โœ…
Use when the user wants to perform arithmetic calculations. Keywords: calculate, compute, add, subtract, multiply, divide, +, -, *, /
Overall Success Rate
100.0%
Tests Passed
12/12
Status
PASS
Test Summary
Total Actions Tested
7
Actions Passed
7
Actions Failed
0
Overall Result
ALL PASS