feat(ocr-and-documents): add OCR and document extraction skills

- Introduced new skills for extracting text from PDFs, scanned documents, and images using OCR and document parsing tools. - Added detailed documentation for usage and installation of `pymupdf` and `marker-pdf` for local extraction. - Implemented scripts for text extraction with both lightweight and high-quality options, including support for various document formats. - Updated web extraction functionality to handle PDF URLs directly, enhancing usability for academic papers and documents.
2026-02-26 23:06:08 -08:00 · 2026-02-26 23:06:08 -08:00 · 19abbfff96
commit 19abbfff96
parent 21cf339a85
5 changed files with 322 additions and 1 deletions
--- a/skills/ocr-and-documents/DESCRIPTION.md
+++ b/skills/ocr-and-documents/DESCRIPTION.md
@ -0,0 +1,3 @@
+---
+description: Skills for extracting text from PDFs, scanned documents, images, and other file formats using OCR and document parsing tools.
+---