- Published on
The paper introduces Agent S, an innovative open agentic framework designed to enable autonomous interaction with computers through a Graphical User Interface (GUI). This framework aims to revolutionize human-computer interaction by automating complex, multi-step tasks, addressing three key challenges: acquiring domain-specific knowledge, planning over long task horizons, and handling dynamic, non-uniform interfaces.