top of page

Intro to Scatter-Gather Integration Pattern

GitHub repository with the Mule Project can be found at the end of the post.



In this blog post we will be gathering the scattered thoughts by routing them into a singular direction. You would have probably guessed it by now, we will be discussing an Enterprise Integration Pattern for Message Routing called the Scatter-Gather. We will first try to understand the concept of Scatter-Gather and then demonstrate its ability with the help of MuleSoft ESB connector for the same. The goal of this post is to follow a very basic and simplistic approach, before diving further into the concept. If you are someone who doesn’t like a lot of text, the depicted pictures will whisper the secret, in parallel.




What is Scatter-Gather?


As described in Enterprise Integration Patterns, Scatter-Gather is a Message routing pattern which broadcasts messages to multiple recipients and aggregates the response back in a single message. In simple words, the messages are executed in parallel (scatter) and response from each execution is bundled (gathered) as one single message.


Below picture depicts the same regarding Scatter-Gather:


request scatter-gather process 1 process 2 process 3 aggregate response process 1 + plus process 2 + plus process 3
Fig 1.1: Request is routed to three different processes in scatter-gather, Output response will be the aggregated result of all 3 processes.

Note: Response of individual processes are independent of each other.



When to use Scatter-Gather?


Depends on what you are trying to achieve, if it’s a Fork-Joined model (imagine a fork) or Multicast scenario, where the messages are required to be communicated simultaneously, Scatter-Gather is a better choice, in case of “Fire-Forget” model use asynchronous processing blocks, as there is no standard requirement to gather all the response in given time. If performance is a parameter (most certainly), Scatter- Gather pattern helps to immensely cut down the processing duration and improves the performance.



How does Scatter-Gather work?


As already explained in the above section, Scatter-Gather will execute the messages in Parallel a.k.a. concurrent processing, meaning sending the messages to desired routes at the same time through a parallel thread pool. While a particular message is being processed, the remaining messages should wait for the completion of the message. In other words, the Scatter-Gather processing will be completed only when all the messages are fully processed.


request scatter-gather process 1 process 2 process 3 aggregate response process 1 + plus process 2 + plus process 3
Fig 1.2: Scatter-Gather process 1 is complete (Green), Other two processes are work in-progress (Yellow). The response is not yet captured (Red).

request scatter-gather process 1 process 2 process 3 aggregate response process 1 + plus process 2 + plus process 3
Fig 1.3: Scatter-Gather process 1 and 2 are complete (Green), Whereas process 3 is work in-progress (Yellow). The response is not yet captured (Red).

request scatter-gather process 1 process 2 process 3 aggregate response process 1 + plus process 2 + plus process 3
Fig 1.4: Scatter-Gather process 1, 2, and 3 are complete (All Green. Yay!). The response is captured as a bundle of all three responses (Green).


Quiz 1:

What is the Total time taken for the completion of the Scatter-Gather process below?


request scatter-gather process 1 20 seconds process 2 14 seconds process 3 10 seconds total time? aggregate response process 1 + plus process 2 + plus process 3


Why use the Scatter-Gather Integration Pattern?


It may turn out to be a simple decision on why to use the Scatter-Gather pattern when compared to sequential processing, although the responses retrieved can be the same in both cases. Sequential processing should be considered when there is an interdependency of the processes up on their responses, meaning response from process 1 is required for process 2 to complete.


In cases where the process responses are independent of each other, parallel processing can be efficiently executed. If using sequential processing, in case of one message/process failure the complete processing will come to halt. This may be a valid business scenario to end the process, whereas in case of scatter-gather the responses will include or exclude the error cases (scenario based) therefore avoiding a complete meltdown.



Quiz 2:

Scatter-Gather halts the processing of the messages if any one of the routing processes fail.

Answer: True or False.


We started with scattered thoughts about the pattern and now we have reached a state of composed thoughts, which were pretty scattered to begin with. Next, we will attempt to demonstrate our understanding through a MuleSoft (Mule 4) usage of Scatter-Gather Component. I won’t go deep into the specifics of Mule 4 Scatter-Gather Router as it has been explained very well in MuleSoft documentation.



Mule 4 Scatter-Gather Connector Example


In the below example, a mule Scatter-Gather router invokes the three flows concurrently and aggregates the response as a new payload. A point to note is Scatter-Gather block should have at least two routes to process. The combined response is captured through Mule Transform Message component, where-in we flatten the payload to an array object. The output resultant payload of Scatter-Gather is an Object of Object.


scatter gather test http listener scatter gather flow one route flow 1 flow two route flow 2 flow three route flow 3 transform message

scatter gather test http listener scatter gather flow one flow two flow three transform message result payload of scatter gather dw 2.0 output application json flatten payload payload


Scatter-Gather Output Payload:



The payload size is 3 after the Scatter-Gather processing is complete.


payload size 3 in logger


Result of Scatter-Gather:


The flow has been tested using POSTMAN.


debug payload

The debug process shows that the Scatter-Gather output payload has been aggregated (remember, we have flattened the payload to make it an array object).


aggregated response from postman interface is array of 3 objects

The example shown above is a very simple case of implementing and understanding Scatter-Gather Pattern. In the next post, we shall add more twists with different service call routings and discuss the topic of error handling in case Mule 4 Scatter-Gather. Until then, try to find your reflections of the scattered thoughts.



Oh wait! The quiz? Yes!


The answer to the first quiz is 20 seconds and the second is False; Scatter-Gather doesn’t halt the processes abruptly with an effective error handling in place, indeed it will gather all the route information including the route which has errored and aggregates the results as output. In order to retrieve only successful route payload and ignore errors from Scatter-Gather processing, this can be efficiently handled using Mule 4 error handling process. We shall discuss the Scatter-Gather error handling cases in our next post.


I hope you enjoyed the post. Please subscribe to ProstDev for more exciting topics.


Hasta luego, amigos!



References



GitHub repository






4,523 views2 comments

1 Comment


Alex Martinez
Alex Martinez
Jun 23, 2020

Awesome post, Vishal! Loved the puns :-)

Like
bottom of page