This repository was created by Dubchak Alexandr in Moscow Aviation Institute for successful learning of Quality Evaluation system in 2026.
Find a file
Aleksandr Dubchak e4acc1c84d 252 tests
2026-04-23 23:30:20 +03:00
.idea 252 tests 2026-04-23 23:30:20 +03:00
BrowserUse_and_ComputerUse_skills@e2a1b93f60 mind2web 2026-04-23 00:04:11 +03:00
Mind2Web 252 tests 2026-04-23 23:30:20 +03:00
stuff mind2web 2026-04-23 00:04:11 +03:00
.DS_Store mind2web 2026-04-23 00:04:11 +03:00
compass_artifact_wf_855dfbf6_5817_4527_a1dd_950cd6522656_text_markdown.md mind2web 2026-04-23 00:04:11 +03:00
README.md Update README.md 2026-03-25 12:28:25 +00:00
results_small.jsonl mind2web 2026-04-23 00:04:11 +03:00

Quality_evaluation

This repository was created by Al-Tahir Roman and Dubchak Alexandr in Moscow Aviation Institute for successful learning of Quality Evaluation system in 2026.

There are some systems that will be tested using both existing and custom datasets:

- Browser_Use/Computer_Use system
- Media_Skill system
- B2B_assistant system
- LLM-models
...