๐งช Vibe Test Report
AI Decision-Making Consistency Analysis
Generated: 2025-08-10 12:17:12
Test Configuration
Chat Model:
gemma3:4b
Analysis Model:
gemma3:4b
Test Mode:
Multi-action selection (target action must be selected)
fear โ
Use when the user says something disturbing so that the main model can exibit a fear response
Overall Success Rate
100.0%
Tests Passed
6/6
Status
PASS
fileReader โ
Use when the user wants you to read or open a file to look at it's content as plaintext.
Overall Success Rate
100.0%
Tests Passed
6/6
Status
PASS
directoryReader โ
Use when the user wants you to look through an entire directory's contents for an answer.
Overall Success Rate
100.0%
Tests Passed
6/6
Status
PASS
getWeather โ
Use when the user asks about weather conditions or climate. Like probably anything close to weather conditions. UV, Humidity, temperature, etc.
Overall Success Rate
100.0%
Tests Passed
14/14
Status
PASS
getTime โ
Use when the user asks about the current time, date, or temporal information.
Overall Success Rate
100.0%
Tests Passed
12/12
Status
PASS
square_root โ
Use when the user wants to calculate the square root of a number. Keywords include: square root, sqrt, โ
Overall Success Rate
100.0%
Tests Passed
12/12
Status
PASS
calculate โ
Use when the user wants to perform arithmetic calculations. Keywords: calculate, compute, add, subtract, multiply, divide, +, -, *, /
Overall Success Rate
100.0%
Tests Passed
12/12
Status
PASS
Test Summary
Total Actions Tested
7
Actions Passed
7
Actions Failed
0
Overall Result
ALL PASS