HomeAI Tutorial

ai-agency-evals

Public

Evaluation suite operationalizing three LLM behavior papers (The Polite Liar, Delegated Introspection, Observer-Time, all in review).

Creat2025-10-09T17:52:59
Update2025-10-10T06:00:34
0
Stars
0
Stars Increase

Related projects