xlang-ai

🏢 Organization

9 repositories on SrcLog

9 Repos

9.7k Stars

1.2k Forks

9.7k Watchers

Repositories (9)

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

OpenCUA: Open Foundations for Computer-Use Agents

Paper collection on building and evaluating language model agents via executable language grounding

[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".

[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?