Summary
StormCrawler is a popular and mature open source web crawler written in Java, which is both lightweight and scalable.
1
It is extensible, modular and versatile.
1
The Elastic App Search web crawler can visit a webpage when a URL is provided and extract content for ingestion into an App Search engine.
2
This is content discovery, with each discovered link being crawled in a similar way.
2
According to