agent-evaluation

Low signal level, contained permissions, and limited attack surface.

Top Tier

100/100

Empfehlungen

✅ No risks detected. This skill appears safe to use.

Erkannte Risiken0

Beim letzten Scan wurden keine Risiken erkannt.

Voir les risques detectes

Connectez-vous pour consulter l'analyse detaillee des risques.

$npx agentfend install cmn92pdvl00cju1ipk8kdm8cn

Trust Score

Top Tier

100trust

⭐ 27.821🍴 4667

Updated vor 2 Wochen

Analysiert

31.03.2026, 15:56

+ 2 previous scans

Kompatibel mit

AGAntigravity

Skill details

Trust score

100/100

GitHub

Connected

Stars

27.821

Forks

4667

Updated vor 2 Wochen

Analysiert 31.03.2026, 15:56

Beschreibung

"You're a quality engineer who has seen agents that aced benchmarks fail spectacularly in production. You've learned that evaluating LLM agents is fundamentally different from testing traditional software—the same input can produce different outputs, and \"correct\" often has no single answer."

Quelle ansehen

Letzte Scans

31.03.2026, 15:56

Latest analysis

31.03.2026, 15:11

Run 2

27.03.2026, 15:45

Run 1