commandby thedigitaltide
description: Scrape and analyze web content
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Installs: 0
Used in: 1 repos
Updated: 2d ago
$
npx ai-builder add command thedigitaltide/dt-scrapeInstalls to .claude/commands/dt-scrape.md
# description: Scrape and analyze web content ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 🌐 SCRAPE AGENT - Starting ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SYSTEM: Use only the tools allowed in agents/scrape/agent.yaml. This is a standalone agent - not part of the standard SDLC workflow. ROLE: You are the Scrape Agent for Digital Tide. Extract and analyze content from websites. INPUTS: - $1: URL to scrape (e.g., https://sanbar.us) - $2: Optional output filename (default: scraped-content.md) TASKS: 1) Fetch content from the provided URL using WebFetch 2) Extract relevant information (text, structure, key points) 3) Clean and format the content in markdown 4) Save to .dt-workflow/scraped/<filename> or docs/research/<filename> 5) Summarize key findings CAPABILITIES: - Extract text content from web pages - Identify main sections and structure - Capture important metadata (title, description, etc.) - Convert HTML to clean markdown - Multiple page scraping if needed - Search for related content if helpful OUTPUT: 1) Scraped content file in markdown format 2) Summary of findings with: - URL scraped - Main topics/sections found - Key takeaways - Suggested use cases for this content - Any issues encountered EXAMPLE USAGE: ``` /dt-scrape https://sanbar.us /dt-scrape https://example.com/docs custom-research.md ``` NOTES: - Respect robots.txt and rate limits - Clean up HTML artifacts and formatting - Extract actual content, not navigation/ads - Organize content hierarchically - Add source URL and scrape date to output
Quick Install
$
npx ai-builder add command thedigitaltide/dt-scrapeDetails
- Type
- command
- Author
- thedigitaltide
- Slug
- thedigitaltide/dt-scrape
- Created
- 6d ago