Vessel is a powerful web crawling framework for automated web crawling and data mining that is powered by the Ferrum web driver. Both tools are written in pure Ruby and are open source projects .
A search framework, or web crawler, is used to collect information from web pages. For example, to later enter the information into a database or analyze the structure of the site. Similar tools are gambling database russia used by search engines, SEO services, scoring systems and other programs that rely on data from open sources.
Algorithm of work
Let's look at how the framework works using an example:
To install the framework, simply add the gem "vessel" to your Gemfile.
Register the crawler class.
Create a software agent spider.rb, where you will need to register the Spider class, derived from Vessel::Cargo.
Next, set the data collection parameters and parsing callback functions. If you do not specify a method, Vessel::Cargo will raise NotImplementedError by default.
Vessel is a powerful data collection tool
-
- Posts: 863
- Joined: Tue Dec 24, 2024 3:20 am