ArchiveBox is a self-hosted web archiving solution that lets you preserve content from websites in multiple formats like HTML, PDF, screenshots, and more. It supports importing from browser history, bookmarks, RSS feeds, and other sources.
ArchiveBox is an open source self-hosted web archiving platform that preserves web content in multiple redundant formats. It allows you to save copies of websites, social media, and other online content while maintaining complete control over your archived data.
Comprehensive Archiving:
Flexible Input Sources:
Privacy-Focused Design:
Advanced Capabilities:
ArchiveBox is ideal for:
ArchiveBox can be deployed via Docker or pip:
Docker:
docker run -v $PWD:/data archivebox/archivebox init
docker run -v $PWD:/data archivebox/archivebox add 'https://example.com'
Python/pip:
pip install archivebox
archivebox init
archivebox add 'https://example.com'
The platform provides extensive documentation and an active community to help users get started with web archiving.
Similar projects based on shared tags