Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Simulation-Framework-for-Multi-Agent-Balderdash

Public

A framework using the game Balderdash to evaluate creativity and logical reasoning in Large Language Models (LLMs). Multiple LLMs generate fictitious definitions to deceive others and identify correct ones, analyzing creativity, deception, and performance.

Creat2024-02-23T09:37:22
Update2024-11-21T09:24:52
1
Stars
0
Stars Increase

Related projects