Extract and break down content from web documents, PDFs, images, and URLs into structured markdown notes stored locally and synced to Obsidian. Use when the...
Convert documents, PDFs, images, and web content into structured markdown notes saved to workspace and synced to Obsidian.
For URLs/PDFs:
Use web_fetch to extract content
For images:
Use image tool to analyze and extract text
For already-analyzed content:
User may paste content directly or you've already extracted it
Convert raw content into organized markdown:
Sections to create:
Formatting guidelines:
Create the markdown note with proper frontmatter and save to workspace:
# Prepare frontmatter
date = "2026-02-10"
tags = ["research", "bloodwork", "nmr"] # Auto-assigned based on content
title = "NMR Lipid Panel Reference Ranges"
# Build full markdown content
content = f"""---
date: {date}
tags:
- {tag1}
- {tag2}
- {tag3}
source: {original_url_or_source}
type: image-breaker-note
---
# {title}
## Overview
[Brief description of what this document is]
## Key Points
- Point 1
- Point 2
- Point 3
## [Main Section]
[Detailed content with subsections]
## Reference
- **Source:** [URL or document name]
- **Extracted:** {date}
"""
# Save to workspace
output_dir = "research/image-breaker-notes" # Default
# or user-specified: "research/bloodwork", "content/references", etc.
# Write file
filepath = f"{output_dir}/{date}-{slugified-title}.md"
write(filepath, content)
# Sync to Obsidian (using obsidian-sync skill)
exec: python3 skills/obsidian-sync/scripts/sync_to_obsidian.py {filepath} /Users/biohacker/Desktop/Connections ImageBreaker
Auto-assign 3 most relevant tags based on content:
Common tags:
research - Academic papers, studies, referencesbloodwork - Lab results, biomarkers, panelsnmr - NMR lipid panels specificallycholesterol - Cholesterol and lipid-relatedpeptides - BPC-157, TB-500, etc.supplements - Vitamins, minerals, compoundsprotocols - Treatment/optimization protocolsfounders - Business/entrepreneur health contentlongevity - Anti-aging, healthspanperformance - Cognitive/physical optimizationtraining - Exercise, workoutstoku - Nattokinase, Toku Flow relatedPrioritize specific tags over generic ones.
Default: research/image-breaker-notes/
Content-specific alternatives:
research/papers/ or research/protocols/research/bloodwork/content/references/research/training/projects/business-docs/Choose the most appropriate directory based on content type.
User provides Labcorp NMR document URL:
web_fetchbloodwork, nmr, researchresearch/image-breaker-notes/2026-02-10-nmr-lipid-panel-reference.mdImageBreaker/2026-02-10-nmr-lipid-panel-reference.mdAfter completing the workflow:
✅ **Document broken down and saved**
📝 **Title:** [Note Title]
📂 **Location:** research/image-breaker-notes/2026-02-10-note-title.md
🔗 **Obsidian:** ImageBreaker/2026-02-10-note-title.md
🏷️ **Tags:** tag1, tag2, tag3
**Sections created:**
- Overview
- Key Points
- [Main sections listed]
- Reference
The note is now in your Obsidian vault for tagging and cross-referencing.
Obsidian Sync: Automatically called after note creation
Paper Fetcher: If user provides DOI, use paper-fetcher first, then break down the PDF
Research Automation: Can batch-process multiple documents from research runs
ZIP package — ready to use