Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 3 datasets found Groups: Natural Language Processing Formats: JSON Filter Results APIBank APIBank is a comprehensive benchmark for tool-augmented LLMs, focusing on API calling, retrieving, and planning abilities. Dataset JSON APIBench APIBench is a comprehensive benchmark for tool-augmented LLMs, focusing on API calling, retrieving, and planning abilities. Dataset JSON GTA: A Benchmark for General Tool Agents GTA is a benchmark for General Tool Agents, featuring three main aspects: real user queries, real deployed tools, and real multimodal inputs. Dataset JSON