Introduction
Overview of Alchemy Lab documentation, features and usage
Welcome to the Alchemy Lab documentation.
Alchemy Lab is a web content extraction tool under Alchemy Lab, converting web pages into clean Markdown format, designed for AI applications and data processing scenarios.
Core Features
- Web Scraping - Extract content from single or multiple URLs, with support for dynamically rendered pages
- Multi-page Crawling - Recursively discover and scrape entire websites from a starting URL
- PDF Export - Export web pages to PDF format
- AI Enhancement - Optional AI optimization to automatically clean and format extracted content
Technical Highlights
| Feature | Description |
|---|---|
| Browser Engine | Puppeteer for SPA dynamic rendering |
| Smart Engine Selection | Automatically detects page type and selects optimal scraping strategy |
| High Performance | Supports concurrent scraping with built-in caching |
| Open Source | MIT licensed, free to deploy and modify |
Quick Start
# Clone the project
git clone https://cnb.cool/ai-alchemy-factory/alchemy-lite.git
cd alchemy-lite
# Install dependencies
pnpm install
# Start the backend
pnpm dev
# Start the frontend (in another terminal)
cd ui && pnpm dev
Visit http://localhost:9002 to access the web interface.
Next Steps
- Alchemy Page - Learn how to use the core scraping interface
- Multi-page Crawling - Learn how to crawl entire websites
- API Reference - View the complete API documentation
- Deployment Guide - Deploy Alchemy Lab to production