pdf-page-extract

Name: pdf-page-extract
Rating: 5 (1 reviews)
Author: bg-szy

Extracts rich data from PDF pages, including text spans with metadata, rendered PNG images, and page mapping. Creates persistent artifacts for downstream processing.

1stars

Updated 13 days ago

View on GitHub ↗

How to add

/plugin marketplace add bg-szy/TOP-SKILLS

The exact command may vary by repository. Check the README on GitHub.

For the skill author

Drop this on your repo README

Shows your skill is listed on Skillteca, generates a backlink and trackable traffic.

[![Listada na Skillteca](https://www.skillteca.com.br/api/badge/pdf-page-extract-bg-szy/svg)](https://www.skillteca.com.br/skills/pdf-page-extract-bg-szy?utm_source=badge&utm_medium=readme&utm_campaign=badge)

#pdf

Related skills

See all in Documentos →

pdf

143.8k

Use this skill for any operation with PDF files, including reading, extracting text/tables, combining, splitting, rotating pages, adding watermarks, creating, filling forms, encrypting/decrypting, extracting images, and OCR to make them searchable.

Documentos#ocr#pdfby anthropics

pptx

143.8k

Use this skill for any task involving .pptx files, including creating, editing, reading, extracting text, combining, or splitting presentations.

Documentos#pptx#aiby anthropics

canvas-design

143.8k1

Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art, design, or other static piece. Create original visual designs, never copying existing artists' work to avoid copyright violations.

Documentos#pdfby anthropics

theme-factory

143.8k

Toolkit for styling artifacts with a theme. It offers 10 pre-set themes with colors/fonts, applicable to various artifacts like slides, docs, or HTML pages, or allows generating new themes on-the-fly.

Documentosby anthropics

Category alert

Get new Documentos skills every Monday

One short email with only the new Documentos skills. 4 minutes of reading, no spam, unsubscribe with one click.

You confirm your email on the first send. No spam. Unsubscribe with one click.

PDF Page Extract Skill

Purpose

This skill extracts all necessary data from PDF pages to enable accurate AI-driven HTML generation. It produces three critical artifacts:

Rich extraction data - Text spans with font metadata (sizes, styles, positions)
Rendered PNG image - Visual reference for AI to understand page layout
Page mapping - Authoritative mapping of PDF indices to book pages

This is the deterministic, Python-based foundation for the entire pipeline. All e

[Description truncada. Veja o README completo no GitHub.]

ShareX LinkedIn

Comments · No comments

No comments yet. Be the first.