Configure the Scraping Pipeline
Learn to configure the broadway pipeline and use the handle_message function.
We'll cover the following...
We'll cover the following...
Pipeline configuration
We’ll use the processors of Broadway
to refactor the logic that checks each website. For this, we have to define :processors
in start_link/1
, and use handle_message/3
:
def start_link(_args) do
options = [
name: ScrapingPipeline,
producer: [
module: {PageProducer, []},
transformer: {ScrapingPipeline, :transform, []}
],
processors: [
default: [max_demand: 1, concurrency: 2]
]
]
Broadway.start_link(__MODULE__, options)
end
def handle_message(_processor, message, _context) do
if Scraper.online?(message.data) do
# To do...
else
...