This repository was created by Dubchak Alexandr in Moscow Aviation Institute for successful learning of Quality Evaluation system in 2026.
Find a file
Aleksandr Dubchak 1dd92ab887 45 tests
2026-04-23 01:21:29 +03:00
.idea mind2web 2026-04-23 00:04:11 +03:00
BrowserUse_and_ComputerUse_skills@e2a1b93f60 mind2web 2026-04-23 00:04:11 +03:00
Mind2Web 45 tests 2026-04-23 01:21:29 +03:00
stuff mind2web 2026-04-23 00:04:11 +03:00
.DS_Store mind2web 2026-04-23 00:04:11 +03:00
compass_artifact_wf_855dfbf6_5817_4527_a1dd_950cd6522656_text_markdown.md mind2web 2026-04-23 00:04:11 +03:00
README.md Update README.md 2026-03-25 12:28:25 +00:00
results_small.jsonl mind2web 2026-04-23 00:04:11 +03:00

Quality_evaluation

This repository was created by Al-Tahir Roman and Dubchak Alexandr in Moscow Aviation Institute for successful learning of Quality Evaluation system in 2026.

There are some systems that will be tested using both existing and custom datasets:

- Browser_Use/Computer_Use system
- Media_Skill system
- B2B_assistant system
- LLM-models
...