Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

SeeAct

Public

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Creat2023-12-22T02:22:11
Update2025-03-26T19:27:20
https://osu-nlp-group.github.io/SeeAct/
788
Stars
0
Stars Increase