[![Listada na Skillteca](https://www.skillteca.com.br/api/badge/import-web-markdown-with-gather/svg)](https://www.skillteca.com.br/skills/import-web-markdown-with-gather?utm_source=badge&utm_medium=readme&utm_campaign=badge)

Import Web Markdown With Gather

Purpose

Use gather as the default local tool for converting a URL into readable markdown.

Recommended Defaults

Run gather with these settings unless the user asks otherwise:

gather --metadata-yaml --inline-links --no-paragraph-links "<url>"

Rationale:

--metadata-yaml: Adds title/date/source in front matter for downstream indexing.
--inline-links: Keeps links close to text for RAG/chunk readability.
--no-paragraph-links: Avoids repeated reference blocks after each paragraph.

Required Workflow

Validate input:
- Accept only http:// or https:// URLs.
- If input is not a URL, ask for one.

Run gather:

Primary command:

gather --metadata-yaml --inline-links --no-paragraph-links "<url>"

On failure, retry with fallback mode:

First fallback:

gather --metadata-yaml --inline-links --no-paragraph-links \
  --no-readability "<url>"

If the page still fails and raw HTML is available, pass HTML directly:

printf "%s" "$HTML" | gather --html --stdin --metadata-yaml \
  --inline-links --no-paragraph-links

Return markdown text as the main result.

Output Contract

When successful, return:

url: original URL
title: extracted title when available
markdown: full markdown body
used_fallback: true if --no-readability or --html path was used

Safety And Limits

Do not execute JavaScript from pages.
Do not follow login-only pages automatically.
Preserve the original URL in output metadata.
If output is empty or too short, report a partial extraction warning.

Examples

Basic import:

gather --metadata-yaml --inline-links --no-paragraph-links "https://example.com/article"

Fallback when readability extraction fails:

gather --metadata-yaml --inline-links --no-paragraph-links --no-readability "https://example.com/article"

Optional Variants

Add title only:
```
gather --title-only "<url>"
```

Plain body without source/title injection:

gather --no-include-source --no-include-title "<url>"

import-web-markdown-with-gather

How to add

Drop this on your repo README

Related skills

doc-coauthoring

algorithmic-art

seo-aeo-blog-writer

wordpress-centric-high-seo-optimized-blogwriting-skill

Get new Escrita e Conteúdo skills every Monday