Azure Green Product Scraping
URL Structure
Azure Green uses two URL patterns:
/departments/{id}/- Category navigation (nested hierarchy)/products/{id}/- Actual product listings
Important: Use /products/ URLs for scraping actual items.
Working Product IDs
Candles
| ID | Description | Count |
|---|---|---|
| 111 | Ritual candles | ~20 |
| 112 | Household candles | ~20 |
| 113 | Pillar candles | ~17 |
| 114 | Figure candles (cat, knob, etc) | ~20 |
| 143 | Chime candles | ~20 |
| 262 | Votive candles | ~20 |
| 268 | Crystal Journey pillars | ~20 |
Incense
| ID | Description | Count |
|---|---|---|
| 35 | Burners & holders | ~20 |
| 36 | Cone incense | ~19 |
| 37 | Powder incense | ~20 |
| 549 | Palo Santo & smudge | ~20 |
| 609 | Brass burners | ~20 |
| 379 | Charcoal discs | ~14 |
| 608 | Wood burners | ~27 |
| 1052 | Backflow burners | ~33 |
| 1074 | Waterfall incense | ~34 |
Crystals & Stones
| ID | Description | Count |
|---|---|---|
| 193 | Raw/bulk stones | ~42 |
| 394 | Tumbled stones | ~40 |
Other Categories
| ID | Description | Count |
|---|---|---|
| 2 | Tarot decks | ~20 |
| 250 | Best sellers | ~20 |
Scraper Usage
# Scrape a specific category
npx tsx scripts/scrape-dept.ts \
"https://www.azuregreen.net/Candles/products/111/" \
--output ./data/candles.json \
--check-images
# Options:
# --output <file> Output JSON file
# --limit <n> Max products to scrape
# --check-images Verify image URLs existImage URL Pattern
Standard pattern: https://www.azuregreen.net/images/{SKU}.jpg
Some products have missing images. Current workarounds:
- Check alternate patterns (lowercase, underscores)
- Use category placeholder images
- Source manually
Import Pipeline
# 1. Scrape products
npx tsx scripts/scrape-dept.ts <url> --output ./data/<category>.json
# 2. Combine all scraped data
node -e "..." # (see all-scraped.json generation script)
# 3. Import to database
npx tsx scripts/import-azure-green.ts --input ./data/all-scraped.json
# 4. Auto-tag with correspondences
npx tsx scripts/auto-tag-products.tsCurrent Stats (2026-01-31)
- Total unique products scraped: 444
- With images: 444 (100%)
- Categories covered: Candles, Incense, Tarot, Crystals, Amulets, Athames, Scrying, Voodoo, Salts, Statues
Known Issues
-
Misclassification - Some products tagged wrong (e.g., βGoddess athameβ β Statue)
- Fix: Review auto-tagging logic or manual corrections
-
Salt/Tool overlap - Some salts tagged as tools
- 13 products in Tools category have missing images (not on AzureGreen CDN)
-
Pagination - Scraper handles basic pagination but may miss some pages
- Workaround: Manually verify counts against site