The architecture has been updated
This commit is contained in:
parent
805f7a017e
commit
a01257ead9
1119 changed files with 226 additions and 352 deletions
3
hermes_code/skills/mlops/inference/DESCRIPTION.md
Normal file
3
hermes_code/skills/mlops/inference/DESCRIPTION.md
Normal file
|
|
@ -0,0 +1,3 @@
|
|||
---
|
||||
description: Model serving, quantization (GGUF/GPTQ), structured output, inference optimization, and model surgery tools for deploying and running LLMs.
|
||||
---
|
||||
Loading…
Add table
Add a link
Reference in a new issue