BrowserUse_and_ComputerUse_.../hermes_code/skills/mlops/models/DESCRIPTION.md

191 B

description
Specific model architectures and tools — computer vision (CLIP, SAM, Stable Diffusion), speech (Whisper), audio generation (AudioCraft), and multimodal models (LLaVA).