Ben Bogin
Home
Publications
Page not found
Perhaps you were looking for one of these?
Latest
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Leveraging Code to Improve In-context Learning for Semantic Parsing
Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Diverse Demonstrations Improve In-context Compositional Generalization
Unobserved Local Structures Make Compositional Generalization Hard
COVR: A Test-Bed for Visually Grounded Compositional Generalization with Real Images
Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data
An autonomous debating system