Splash is a lightweight, scriptable headless browser designed specifically for web scraping at scale that enables you to render, interact with and extract data from modern Javascript heavy websites.
Easily customisable for faster rendering and complex user behaviour simulation.
Our data powers over 2,000 companies every day
Render pages at any scale
Scrape JavaScript generated content from web pages at scale and take screenshots of the page for specific applications including data quality assurance.
Lightweight browser
Speed up your crawlers with a dedicated node for rendering JavaScript that’s separate from your crawler.
Simulate user behaviour
Simulate user behavior with custom scripts via Lua scripts when behaviour is too complex to replicate in your crawler.
Faster rendering
Process multiple pages in parallel, turn OFF images or use Adblock Plus rules to make rendering faster.
Easy integration
Splash integrates with Scrapy, Scrapy Cloud and Crawlera to make adding a headless browser to you web scraping stack a seamless experience.
An option to suit all requirements
Splash is both an open source project and a Scrapinghub product.
Open source Splash
The Splash headless browser is an open source project created and maintained by Scrapinghub. Check out the Splash documentation to learn more.
Hosted Splash
Scrapinghub offers a hosted version of Splash using a SaaS model that enables you to access a dedicated battle-tested Splash instance via an HTTP API.
Pricing
We offer hosted Splash instances in 3 sizes and an enterprise solution