RULER
A benchmark for evaluating the rationality of long-text language models.
RULER Visit Over Time
Monthly Visits
23904807
Bounce Rate
43.33%
Page per Visit
5.8
Visit Duration
00:04:51
A benchmark for evaluating the rationality of long-text language models.
Monthly Visits
23904807
Bounce Rate
43.33%
Page per Visit
5.8
Visit Duration
00:04:51