BrowserUse_and_ComputerUse_.../api/mind2web_benchmark.json
Максим Туревич 365ab8dd79 tests add for api
2026-04-22 03:46:17 +03:00

77 lines
No EOL
6 KiB
JSON

[
{
"index": 2,
"original_task_id": "mind2web_2_1776817552",
"api_task_id": "2334bb039ee04dc6b23b5b0240314064",
"task_description": "Find hard side Carry-on Luggage in black color in target",
"reference_length": 9,
"status": "succeeded",
"queue_time_sec": 0.02,
"execution_time_sec": 74.38,
"total_time_sec": 74.38,
"result": {
"task_id": "2334bb039ee04dc6b23b5b0240314064",
"status": "succeeded",
"success": true,
"execution_time": 72.45919299125671,
"result": "Found hard side carry-on luggage in black at Target:\n\n1. **Signature Hardside Carry On Spinner Suitcase Matte Black** - Open Story™\n - Price: $128.00\n - Rating: 4.3/5 (71 ratings)\n - Features: TSA Locks, Water-Resistant, Polycarbonate\n\n2. **Hardside Carry On Spinner Suitcase Black** - Open Story™\n - Price: $90.00\n - Rating: 4.3/5 (500 ratings)\n - Features: Telescoping Handle, 8 Wheels\n\n3. **SWISSGEAR Energie Hardside Carry On Spinner Suitcase** - Black\n - Price: $86.39 (on sale)\n - Rating: 4.2/5 (76 ratings)\n - Features: Structured Polycarbonate, 8 Spinner Wheels\n\nAll three products match your requirements for hard side carry-on luggage in black color.",
"error": null,
"raw_response": {
"success": true,
"result": "Found hard side carry-on luggage in black at Target:\n\n1. **Signature Hardside Carry On Spinner Suitcase Matte Black** - Open Story™\n - Price: $128.00\n - Rating: 4.3/5 (71 ratings)\n - Features: TSA Locks, Water-Resistant, Polycarbonate\n\n2. **Hardside Carry On Spinner Suitcase Black** - Open Story™\n - Price: $90.00\n - Rating: 4.3/5 (500 ratings)\n - Features: Telescoping Handle, 8 Wheels\n\n3. **SWISSGEAR Energie Hardside Carry On Spinner Suitcase** - Black\n - Price: $86.39 (on sale)\n - Rating: 4.2/5 (76 ratings)\n - Features: Structured Polycarbonate, 8 Spinner Wheels\n\nAll three products match your requirements for hard side carry-on luggage in black color.",
"browser_view": "https://face-veteran-investigate-daniel.trycloudflare.com"
}
},
"timestamp": "2026-04-22T03:27:07.331142"
},
{
"index": 3,
"original_task_id": "mind2web_3_1776817627",
"api_task_id": "fb9ca5f23cfe427d80c31747f61260d6",
"task_description": "Show me the coming soon AMC Artisan Films in amctheatres",
"reference_length": 6,
"status": "succeeded",
"queue_time_sec": 0.01,
"execution_time_sec": 58.33,
"total_time_sec": 58.33,
"result": {
"task_id": "fb9ca5f23cfe427d80c31747f61260d6",
"status": "succeeded",
"success": true,
"execution_time": 56.48335075378418,
"result": "Here are the AMC Artisan Films coming soon:\n\n1. **MICHAEL** - April 24\n - A front-row seat to the life and legacy of one of the most influential artists\n\n2. **The AI Doc: Or How I Became an Apocaloptimist**\n - Release date not specified\n\n3. **Lorne** - April 17, 2026\n - From Academy Award-winning filmmaker Morgan Neville\n - Behind-the-scenes glimpse at comedy empire builder\n - Rating: R | Runtime: 1 hr 41 min\n\n4. **Hamlet** - April 10, 2026\n - Rating: NR | Runtime: 1 hr 54 min\n\n5. **Mother Mary** - April 17, 2026\n - Starring Anne Hathaway, Michaela Coel, Hunter Schafer, FKA Twigs\n - Directed by David Lowery | Rating: R | Runtime: 1 hr 50 min",
"error": null,
"raw_response": {
"success": true,
"result": "Here are the AMC Artisan Films coming soon:\n\n1. **MICHAEL** - April 24\n - A front-row seat to the life and legacy of one of the most influential artists\n\n2. **The AI Doc: Or How I Became an Apocaloptimist**\n - Release date not specified\n\n3. **Lorne** - April 17, 2026\n - From Academy Award-winning filmmaker Morgan Neville\n - Behind-the-scenes glimpse at comedy empire builder\n - Rating: R | Runtime: 1 hr 41 min\n\n4. **Hamlet** - April 10, 2026\n - Rating: NR | Runtime: 1 hr 54 min\n\n5. **Mother Mary** - April 17, 2026\n - Starring Anne Hathaway, Michaela Coel, Hunter Schafer, FKA Twigs\n - Directed by David Lowery | Rating: R | Runtime: 1 hr 50 min",
"browser_view": "https://face-veteran-investigate-daniel.trycloudflare.com"
}
},
"timestamp": "2026-04-22T03:28:05.665642"
},
{
"index": 4,
"original_task_id": "mind2web_4_1776817685",
"api_task_id": "e3db75a7582e4fde856d8e9b746417a2",
"task_description": "Go to the page with help in choosing a solar energy product for homes in tesla",
"reference_length": 5,
"status": "succeeded",
"queue_time_sec": 0.0,
"execution_time_sec": 295.44,
"total_time_sec": 295.44,
"result": {
"task_id": "e3db75a7582e4fde856d8e9b746417a2",
"status": "succeeded",
"success": true,
"execution_time": 293.71661829948425,
"result": "Successfully found help content for choosing solar energy products via Tesla Assist chat. The response includes: (1) Design & Aesthetics comparison between Solar Panels and Solar Roof, (2) Durability information for both products, (3) Roof Eligibility table showing which roof types work with each product (Asphalt/Composition Shingles, Concrete Tile, Flat Roof, Standing Seam Metal, Other), (4) Key Benefits including savings & independence info, (5) Next steps with Order Now and Schedule Consultation links.",
"error": null,
"raw_response": {
"success": true,
"result": "Successfully found help content for choosing solar energy products via Tesla Assist chat. The response includes: (1) Design & Aesthetics comparison between Solar Panels and Solar Roof, (2) Durability information for both products, (3) Roof Eligibility table showing which roof types work with each product (Asphalt/Composition Shingles, Concrete Tile, Flat Roof, Standing Seam Metal, Other), (4) Key Benefits including savings & independence info, (5) Next steps with Order Now and Schedule Consultation links.",
"browser_view": "https://face-veteran-investigate-daniel.trycloudflare.com"
}
},
"timestamp": "2026-04-22T03:33:01.112179"
}
]