jina-reader

Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP.

Skill file

Preview skill file↓↑
---
name: jina-reader
description: "Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP."
homepage: https://jina.ai/reader
metadata: {"clawdbot":{"emoji":"πŸ“–","requires":{"bins":["curl","jq"]},"primaryEnv":"JINA_API_KEY"}}
---

# Jina Reader

Extract clean web content via Jina AI β€” without exposing your server IP.

## Read a URL

```bash
{baseDir}/scripts/reader.sh "https://example.com/article"
```

## Search the web (top 5 results with full content)

```bash
{baseDir}/scripts/reader.sh --mode search "latest AI news 2025"
```

## Fact-check a statement

```bash
{baseDir}/scripts/reader.sh --mode ground "OpenAI was founded in 2015"
```

## Options

| Flag | Description | Default |
|------|-------------|---------|
| `--mode` | `read`, `search`, `ground` | `read` |
| `--selector` | CSS selector to extract specific region | β€” |
| `--wait` | CSS selector to wait for before extraction | β€” |
| `--remove` | CSS selectors to remove (comma-separated) | β€” |
| `--proxy` | Country code for geo-proxy (`br`, `us`, etc.) | β€” |
| `--nocache` | Force fresh content (skip cache) | off |
| `--format` | `markdown`, `html`, `text`, `screenshot` | `markdown` |
| `--json` | Raw JSON output | off |

## Examples

```bash
# Extract article content
{baseDir}/scripts/reader.sh "https://blog.example.com/post"

# Extract specific section via CSS selector
{baseDir}/scripts/reader.sh --selector "article.main" "https://example.com"

# Remove nav and ads before extraction
{baseDir}/scripts/reader.sh --remove "nav,footer,.ads" "https://example.com"

# Search with JSON output
{baseDir}/scripts/reader.sh --mode search --json "AI enterprise trends"

# Read via Brazil proxy
{baseDir}/scripts/reader.sh --proxy br "https://example.com.br"

# Fact-check a claim
{baseDir}/scripts/reader.sh --mode ground "Tesla is the most valuable car company"
```

## API Key

```bash
export JINA_API_KEY="jina_..."
```

Free tier: 10M tokens (no signup needed). Get key at https://jina.ai/reader/

## Pricing

- **Read:** ~$0.005/page (standard) | 3x for ReaderLM-v2
- **Search:** 10K tokens fixed + variable per result
- **Ground:** ~300K tokens/request (~30s latency)

## Why Jina Reader?

- **IP protection** β€” requests route through Jina's infra, not your server
- **Clean markdown** β€” readability extraction + optional ReaderLM-v2
- **Dynamic content** β€” headless Chrome renders JavaScript
- **Structured extraction** β€” JSON schema support for data extraction

Source

Creator's repository Β· sundial-org/awesome-openclaw-skills

View on GitHub β†—

Security

Security checks in progress
Results will appear here once audits complete
What this skill can do
Reads your filesConnects to the internetRuns code on your machine
Checked by 3 independent security firms
Does it try to trick the AI?Not yet checkedPending Β· Gen Agent Trust Hub
Does it sneak in hidden code?Not yet checkedPending Β· Socket
Does it have known bugs?Not yet checkedPending Β· Snyk