Mastering Laravel job batching Pt.2: Job batching and queue

Batches are just a bunch of jobs

This article is 2nd of the 4-parts series: Mastering Laravel batches

In this chapter, we'll have a deeper look on how batches are managed in queues.
In order to have a better view on this topic, we'll use Redis queues and Telescope. In fact, we already use Redis but the combination of the two tools will make clear one important fact about batches.

Install Telescope

Telescope is a monitoring tool which allow to get valuable information about a running Laravel application. More info at: https://laravel.com/docs/11.x/telescope

Installation is straight forward, just follow these steps:

# Install dependency
sail composer require laravel/telescope

# Publish assets and migration
sail artisan telescope:install

# Create required tables
sail artisan migrate

You should be able to navigate to : http://localhost/telescope

Use redis queues (if not already)

If it's not already the case, check your .env file and ensure that the queue connection environment variable is set to redis:

QUEUE_CONNECTION=redis

Restart the queue:

sail artisan queue:restart

If you're not familiar with queues, note that they are long running process, then if you want code updates to be taken in account, you have to restart the queues with the above command.

Run a batch and inspect what happens

Let's review our ExampleJob and Batcher command in order to keep thing without tweaks or errors.
First, the job:

# app/Jobs/ExampleJob.php

// ...
class ExampleJob implements ShouldQueue
{
    use Batchable, Dispatchable, InteractsWithQueue, Queueable, SerializesModels;

    /**
     * Create a new job instance.
     */
    public function __construct(public int $number)
    {
    }

    /**
     * Execute the job.
     */
    public function handle(): void
    {
        // If batch is cancelled, stop execution
        if ($this->batch()->cancelled()) {
            Log::info(sprintf('%s [%d] CANCELLED: Do nothing', class_basename($this), $this->number));
            return;
        }

        // throw_if($this->number == 3, new Exception('I fail because 3.'));
        Log::info(sprintf('%s [%d] RAN', class_basename($this), $this->number));
    }
}

And our command:

#

//...
class Batcher extends Command
{
    /**
     * The name and signature of the console command.
     *
     * @var string
     */
    protected $signature = 'batcher
                            {--cancel= : Cancel batch with the given id}';

    /**
     * The console command description.
     *
     * @var string
     */
    protected $description = 'Batcher';

    /**
     * Execute the console command.
     */
    public function handle()
    {
        if ($this->option('cancel')) {
            $this->cancelBatch($this->option('cancel'));
        } else {
            $this->runExampleBatch();
        }
    }

    /**
     * Cancel the batch with the given id
     *
     * @param string $batch_id
     * @return void
     */
    private function cancelBatch(string $batch_id): void
    {
        // Find the batch for the given id
        $batch = Bus::findBatch($batch_id);
        // Cancel the batch
        $batch->cancel();
    }

    /**
     * Run a batch of ExampleJobs
     *
     * @return void
     */
    private function runExampleBatch(): void
    {
        Bus::batch(
            Arr::map(
                range(1, 5),
                fn ($number) => new ExampleJob($number)
            )
        )
            ->before(fn (Batch $batch) => Log::info(sprintf('Batch [%s] created.', $batch->id)))
            ->catch(function (Batch $batch, Throwable $e) {
                Log::error(sprintf('Batch [%s] failed with error [%s].', $batch->id, $e->getMessage()));
            })
            ->then(fn (Batch $batch) => Log::info(sprintf('Batch [%s] ended.', $batch->id)))
            ->progress(fn (Batch $batch) =>
            Log::info(sprintf(
                'Batch [%s] progress : %d/%d [%d%%]',
                $batch->id,
                $batch->processedJobs(),
                $batch->totalJobs,
                $batch->progress()
            )))
            ->finally(fn (Batch $batch) => Log::info(sprintf('Batch [%s] finally ended.', $batch->id)))
            ->allowFailures()
            ->dispatch();
    }
}

Now that all configuration is done and file are in order, we can restart the queue:

sail artisan queue:restart 
sail artisan queue:work

And run our command:

sail artisan batcher

Now if we navigate to the Batches tab on the telescope page, we should see our batch:

If your batch is in status Pending, it is due to the fact that the queue is not running. you can run it using one of the following command:

sail artisan queue:work

# OR
sail artisan queue:listen

For our usage, they will behave in a similar way.
You can also check the jobs in the Jobs tab, checking that all 5 jobs ran.

But our attention will be on the Redis tab:

Here we can see what happened in the redis queue:

  • The eval -- Push the job onto the queue lines tell us when a job in pushed to the queue

  • The zrem queues:default:reserved... lines inform about when a job is popped from the queue for execution.

We can notice one very important thing about batches: when a batch is dispatched, all its jobs are pushed to the queue then they are executed.

What makes this point important will be clear with the next test: what happens if we dispatch 2 batches consecutively? Let's find out! First, we create a new job which is basically a copy of ExampleJob, we just change its name:

# app/Jobs/OtherJob.php

// ...
class OtherJob implements ShouldQueue
{
    use Batchable, Dispatchable, InteractsWithQueue, Queueable, SerializesModels;

    /**
     * Create a new job instance.
     */
    public function __construct(public int $number)
    {
    }

    /**
     * Execute the job.
     */
    public function handle(): void
    {
        // If batch is cancelled, stop execution
        if ($this->batch()->cancelled()) {
            Log::info(sprintf('%s [%d] CANCELLED: Do nothing', class_basename($this), $this->number));
            return;
        }

        // throw_if($this->number == 3, new Exception('I fail because 3.'));
        Log::info(sprintf('%s [%d] RAN', class_basename($this), $this->number));
    }
}

And we update our Batcher command to run a batch for each job type:

#

//...
class Batcher extends Command
{
    //...

    /**
     * Execute the console command.
     */
    public function handle()
    {
        if ($this->option('cancel')) {
            $this->cancelBatch($this->option('cancel'));
        } else {
            $this->dispatchBatch(ExampleJob::class);
            $this->dispatchBatch(OtherJob::class);
        }
    }

    // ...

    /**
     * Run a batch of jobs
     *
     * @return void
     */
    private function dispatchBatch(string $job_class_name): void
    {
        Bus::batch(
            Arr::map(
                range(1, 5),
                fn ($number) => new $job_class_name($number)
            )
        )
        //...
    }
}

Keep in mind that a batch is dispatched, meaning that it will run asynchronously.
We expect that both batch run at the same time or at least that their respective jobs runs alternatively.

Restart the queue to have updates taken in account:

sail artisan queue:restart
sail artisan queue:work

And run our command:

sail artisan batcher

And now check telescope:

We can see that all jobs are pushed to queue before being executed.
And if we check our logs, we see the following:

[...] local.INFO: Batch [9c253dac-2604-4cf0-acb0-d6d3a00775ad] created.  
[...] local.INFO: Batch [9c253dac-43d7-42b6-9d88-bf497e0ed2d9] created.  
[...] local.INFO: ExampleJob [1] RAN  
[...] local.INFO: Batch [9c253dac-2604-4cf0-acb0-d6d3a00775ad] progress : 1/5 [20%]  
[...] local.INFO: ExampleJob [2] RAN  
[...] local.INFO: Batch [9c253dac-2604-4cf0-acb0-d6d3a00775ad] progress : 2/5 [40%]  
[...] local.INFO: ExampleJob [3] RAN  
[...] local.INFO: Batch [9c253dac-2604-4cf0-acb0-d6d3a00775ad] progress : 3/5 [60%]  
[...] local.INFO: ExampleJob [4] RAN  
[...] local.INFO: Batch [9c253dac-2604-4cf0-acb0-d6d3a00775ad] progress : 4/5 [80%]  
[...] local.INFO: ExampleJob [5] RAN  
[...] local.INFO: Batch [9c253dac-2604-4cf0-acb0-d6d3a00775ad] progress : 5/5 [100%]  
[...] local.INFO: Batch [9c253dac-2604-4cf0-acb0-d6d3a00775ad] ended.  
[...] local.INFO: Batch [9c253dac-2604-4cf0-acb0-d6d3a00775ad] finally ended.  
[...] local.INFO: OtherJob [1] RAN  
[...] local.INFO: Batch [9c253dac-43d7-42b6-9d88-bf497e0ed2d9] progress : 1/5 [20%]  
[...] local.INFO: OtherJob [2] RAN  
[...] local.INFO: Batch [9c253dac-43d7-42b6-9d88-bf497e0ed2d9] progress : 2/5 [40%]  
[...] local.INFO: OtherJob [3] RAN  
[...] local.INFO: Batch [9c253dac-43d7-42b6-9d88-bf497e0ed2d9] progress : 3/5 [60%]  
[...] local.INFO: OtherJob [4] RAN  
[...] local.INFO: Batch [9c253dac-43d7-42b6-9d88-bf497e0ed2d9] progress : 4/5 [80%]  
[...] local.INFO: OtherJob [5] RAN  
[...] local.INFO: Batch [9c253dac-43d7-42b6-9d88-bf497e0ed2d9] progress : 5/5 [100%]  
[...] local.INFO: Batch [9c253dac-43d7-42b6-9d88-bf497e0ed2d9] ended.  
[...] local.INFO: Batch [9c253dac-43d7-42b6-9d88-bf497e0ed2d9] finally ended.

First batch runs completely before the second one can run its first job!

If we sum up, when a batch is dispatched, all its jobs are pushed to the queue. This means that a batch must run all its jobs before the queue can handle any other job.
Depending on your worker count, number of queues and number of jobs in batch, this can lead to serious delay in job executions...

What can we do to alleviate this?

Laravel chained jobs

What we're after is a way to tell the batch to treat the job one after another without pushing all the list to the queue.
One way to achieve this is to use chained jobs inside our batch. If instead of feeding the batch like this:

# app/Console/Commands/Batcher.php

class Batcher extends Command
{
  //...

    private function dispatchBatch(string $job_class_name): void
    {
        Bus::batch(
            Arr::map(
                range(1, 5),
                fn ($number) => new $job_class_name($number)
            )
        )
          //...
    }
}

we can wrap the jobs in array like this:

# app/Console/Commands/Batcher.php

class Batcher extends Command
{
  //...

    private function dispatchBatch(): void
    {
        Bus::batch(
          [
           Arr::map(
                range(1, 5),
                fn ($number) => new $job_class_name($number)
            )
          ]
        )
    }
}

Here, all jobs will run as a chain and because chained jobs are run (and pushed to queue) one after another, we should expect jobs of our different batches to be mixed.

Run our command:

sail artisan batcher

Now let's check the logs, they are as expected:

[...] local.INFO: Batch [9c260075-92a1-44d3-aba0-a3a62c3738bd] created.  
[...] local.INFO: Batch [9c260075-a927-49b2-81e4-6226eb13644b] created.  
[...] local.INFO: ExampleJob [1] RAN  
[...] local.INFO: Batch [9c260075-92a1-44d3-aba0-a3a62c3738bd] progress : 1/5 [20%]  
[...] local.INFO: OtherJob [1] RAN  
[...] local.INFO: Batch [9c260075-a927-49b2-81e4-6226eb13644b] progress : 1/5 [20%]  
[...] local.INFO: ExampleJob [2] RAN  
[...] local.INFO: Batch [9c260075-92a1-44d3-aba0-a3a62c3738bd] progress : 2/5 [40%]  
[...] local.INFO: OtherJob [2] RAN  
[...] local.INFO: Batch [9c260075-a927-49b2-81e4-6226eb13644b] progress : 2/5 [40%]  
[...] local.INFO: ExampleJob [3] RAN  
[...] local.INFO: Batch [9c260075-92a1-44d3-aba0-a3a62c3738bd] progress : 3/5 [60%]  
[...] local.INFO: OtherJob [3] RAN  
[...] local.INFO: Batch [9c260075-a927-49b2-81e4-6226eb13644b] progress : 3/5 [60%]  
[...] local.INFO: ExampleJob [4] RAN  
[...] local.INFO: Batch [9c260075-92a1-44d3-aba0-a3a62c3738bd] progress : 4/5 [80%]  
[...] local.INFO: OtherJob [4] RAN  
[...] local.INFO: Batch [9c260075-a927-49b2-81e4-6226eb13644b] progress : 4/5 [80%]  
[...] local.INFO: ExampleJob [5] RAN  
[...] local.INFO: Batch [9c260075-92a1-44d3-aba0-a3a62c3738bd] progress : 5/5 [100%]  
[...] local.INFO: Batch [9c260075-92a1-44d3-aba0-a3a62c3738bd] ended.  
[...] local.INFO: Batch [9c260075-92a1-44d3-aba0-a3a62c3738bd] finally ended.  
[...] local.INFO: OtherJob [5] RAN  
[...] local.INFO: Batch [9c260075-a927-49b2-81e4-6226eb13644b] progress : 5/5 [100%]  
[...] local.INFO: Batch [9c260075-a927-49b2-81e4-6226eb13644b] ended.  
[...] local.INFO: Batch [9c260075-a927-49b2-81e4-6226eb13644b] finally ended.

The drawback of this solution is in the chained jobs themselves.
If any job fails, the chain stops and pending jobs are not executed.
In fact, this is what they are designed for, running dependent jobs then it is nice that they stops when a job fails.

Let's update our ExampleJob to have it throw an Exception to examplify this:

# app/Jobs/ExampleJob.php

// ...
class ExampleJob implements ShouldQueue
{
    //...

    /**
     * Execute the job.
     */
    public function handle(): void
    {
        // ...

        throw_if($this->number == 3, new Exception('I fail because 3.'));
        Log::info(sprintf('%s [%d] RAN', class_basename($this), $this->number));
    }
}

Restart and run the queue:

sail artisan queue:restart && sail artisan queue:work

And run the command:

sail artisan batcher

And check the logs:

[...] local.INFO: Batch [9c2601fd-d351-4195-b1a1-4c1c0d6fc901] created.  
[...] local.INFO: Batch [9c2601fd-ea5d-4772-9eaf-792f34516eb8] created.  
[...] local.INFO: ExampleJob [1] RAN  
[...] local.INFO: Batch [9c2601fd-d351-4195-b1a1-4c1c0d6fc901] progress : 1/5 [20%]  
[...] local.INFO: OtherJob [1] RAN  
[...] local.INFO: Batch [9c2601fd-ea5d-4772-9eaf-792f34516eb8] progress : 1/5 [20%]  
[...] local.INFO: ExampleJob [2] RAN  
[...] local.INFO: Batch [9c2601fd-d351-4195-b1a1-4c1c0d6fc901] progress : 2/5 [40%]  
[...] local.INFO: OtherJob [2] RAN  
[...] local.INFO: Batch [9c2601fd-ea5d-4772-9eaf-792f34516eb8] progress : 2/5 [40%]  
[...] local.INFO: Batch [9c2601fd-d351-4195-b1a1-4c1c0d6fc901] progress : 2/5 [40%]  
[...] local.ERROR: Batch [9c2601fd-d351-4195-b1a1-4c1c0d6fc901] failed with error [I fail because 3.].  
[...] local.ERROR: I fail because 3. {"exception":"[object] (Exception(code: 0): I fail because 3. at /var/www/html/app/Jobs/ExampleJob.php:36)
...
[...] local.INFO: OtherJob [3] RAN  
[...] local.INFO: Batch [9c2601fd-ea5d-4772-9eaf-792f34516eb8] progress : 3/5 [60%]  
[...] local.INFO: OtherJob [4] RAN  
[...] local.INFO: Batch [9c2601fd-ea5d-4772-9eaf-792f34516eb8] progress : 4/5 [80%]  
[...] local.INFO: OtherJob [5] RAN  
[...] local.INFO: Batch [9c2601fd-ea5d-4772-9eaf-792f34516eb8] progress : 5/5 [100%]  
[...] local.INFO: Batch [9c2601fd-ea5d-4772-9eaf-792f34516eb8] ended.  
[...] local.INFO: Batch [9c2601fd-ea5d-4772-9eaf-792f34516eb8] finally ended.

Because when a job fails in a chain the chain stops, the 2 last job weren't ran at all, meaning that the batch didn't end in any case. Even finally callback is of no help here as not all the jobs are ran at least once.
Another point to keep in mind is that batch progression is based on the number of jobs in the batch, being added atomically or many at a times with the help of a chain.

Regarding our issue, we still need to find a solution!

Laravel "progressive" batches

In fact, the neatest solution lies in a feature offered by batch itself.
It is possible to use what we could called "progressive batch": start a batch with only one job and add another job to the batch when the previous one has finished. Laravel documentation says:

Sometimes it may be useful to add additional jobs to a batch from within a batched job. This pattern can be useful when you need to batch thousands of jobs which may take too long to dispatch during a web request.

We can update this statement to make our own one:

Sometimes it may be useful to add additional jobs to a batch from within a batched job. This pattern can be useful when you need to batch thousands of jobs without blocking a queue with this one batch and still allow other batches to run.

Let's see if this works! First, we need to update our ExampleJob to have it add a new job to the batch. This can be done using $this->batch()->add(...) like this:

# app/Jobs/ExampleJob.php

class ExampleJob implements ShouldQueue
{
    // ...

    public function handle(): void
    {
        //...

        // throw_if($this->number == 3, new Exception('I fail because 3.'));
        Log::info(sprintf('%s [%d] RAN', class_basename($this), $this->number));

        // If job is not the fifth one, add a job to the batch
        if ($this->number < 5) {
            $this->batch()->add([new static(++$this->number)]);
        }
    }
}

Do the same for our OtherJob.
Now, in our Batcher, we can dispatch the batch with only one job and the initial data:

# app/Console/Commands/Batcher.php

class Batcher extends Command
{
  // ...

    private function dispatchBatch(string $job_class_name): void
    {
        Bus::batch([new $job_class_name(1)])
            // ... 
    }
}

What will happen is the following:

  • Batcher will create the batch with only one job

  • The batch will push its only job to queue

  • Job will be popped from queue and executed

  • At the handle end, job will check if it is not the last of the batch (number < 5) :

    • if no: number is incremented and a new job is pushed to the queue

    • if yes: no new job is pushed and batch ends

Let's try to run the command. Restart the queue to have updates taken in account:

sail artisan queue:restart && sail artisan queue:work

And run our command:

sail artisan batcher

Now let's check the logs:

[...] local.INFO: Batch [9c260af9-4b69-456e-bb25-1fa1405dd835] created.  
[...] local.INFO: Batch [9c260af9-65bb-4e7f-9e79-732308aec38d] created.  
[...] local.INFO: ExampleJob [1] RAN  
[...] local.INFO: Batch [9c260af9-4b69-456e-bb25-1fa1405dd835] progress : 1/2 [50%]  
[...] local.INFO: OtherJob [1] RAN  
[...] local.INFO: Batch [9c260af9-65bb-4e7f-9e79-732308aec38d] progress : 1/2 [50%]  
[...] local.INFO: ExampleJob [2] RAN  
[...] local.INFO: Batch [9c260af9-4b69-456e-bb25-1fa1405dd835] progress : 2/3 [67%]  
[...] local.INFO: OtherJob [2] RAN  
[...] local.INFO: Batch [9c260af9-65bb-4e7f-9e79-732308aec38d] progress : 2/3 [67%]  
[...] local.INFO: ExampleJob [3] RAN  
[...] local.INFO: Batch [9c260af9-4b69-456e-bb25-1fa1405dd835] progress : 3/4 [75%]  
[...] local.INFO: OtherJob [3] RAN  
[...] local.INFO: Batch [9c260af9-65bb-4e7f-9e79-732308aec38d] progress : 3/4 [75%]  
[...] local.INFO: ExampleJob [4] RAN  
[...] local.INFO: Batch [9c260af9-4b69-456e-bb25-1fa1405dd835] progress : 4/5 [80%]  
[...] local.INFO: OtherJob [4] RAN  
[...] local.INFO: Batch [9c260af9-65bb-4e7f-9e79-732308aec38d] progress : 4/5 [80%]  
[...] local.INFO: ExampleJob [5] RAN  
[...] local.INFO: Batch [9c260af9-4b69-456e-bb25-1fa1405dd835] progress : 5/5 [100%]  
[...] local.INFO: Batch [9c260af9-4b69-456e-bb25-1fa1405dd835] ended.  
[...] local.INFO: Batch [9c260af9-4b69-456e-bb25-1fa1405dd835] finally ended.  
[...] local.INFO: OtherJob [5] RAN  
[...] local.INFO: Batch [9c260af9-65bb-4e7f-9e79-732308aec38d] progress : 5/5 [100%]  
[...] local.INFO: Batch [9c260af9-65bb-4e7f-9e79-732308aec38d] ended.  
[...] local.INFO: Batch [9c260af9-65bb-4e7f-9e79-732308aec38d] finally ended.

Jobs are intertwined and this is what we want.

The solution works, however there is several points that needs a particular attention:

  • batch progress is not reliable anymore: Because we add jobs one by one, totalJobs and pendingJobs does not reflect the reality of the work to do and progression is more a note on whether the batch ends than a real view what is done.

  • Progressive batch can leads to infinite loop: because they are a kind of while loop on the queue, we need to be very careful with the condition to fulfill to add a new job to the queue and the code creating the new job itself. To experiment this, just try to use $this->number++ instead of ++$this->number: increment will happen after job is created and you'll end up in a infinite loop

  • when complex condition are needed of when treating complex list, an external queue may be required in order to push further jobs to queue with the required arguments. For example, it may not be a good idea to have a list of 5000 emails as an argument, storing them in database or as a queue in Redis is preferable.

Why using Laravel batches anyway?

We covered all the features of the Laravel batches we can find in documentation. But as always with Laravel, there is more to find if you look at the framework code, which I encourage you to do. Reading Laravel code will make you write more "Laravelic" code by grasping the core philosophy and you will have access to many of those features that makes Laravel extensible.

Considering batches, what we learn?
Mostly, we learn that they require a particular attention when used and that there is drawbacks in using them in almost every situations.
Then, why bother using them anyway? Answer is simple, it is a beautiful abstraction and provides a clean interface for common problem. Yes, we need to code what can be seen as workaround, but it is nice to be able to just write this:

Bus::batch([new $job_class_name(1)])
    ])->before(function (Batch $batch) {
    // The batch has been created but no jobs have been added...
    })->progress(function (Batch $batch) {
        // A single job has completed successfully...
    })->then(function (Batch $batch) {
        // All jobs completed successfully...
    })->catch(function (Batch $batch, Throwable $e) {
        // First batch job failure detected...
    })->finally(function (Batch $batch) {
        // The batch has finished executing...
    })->dispatch();
    // Allow batch to fail without being cancelled
    ->allowFailures()
    ->dispatch();

And don't forget the ability to cancel a batch at any moment, to retry failed jobs.

High level is simple and clear and complexity can be hidden into more low level parts. Additionally, the responsibility are clearly expressed:

  • the job does only what it is supposed + launching other jobs (and this can done by a specific method)

  • one function to do work before batch starts

  • one function to do work when everything is done

  • etc.

Sometimes, it's not about the amount of code we write, buthow we can re-use it, read it and maintain it. And laravel batches are a good basis.

What's next

In the next chapter we'll build our own system on top on Laravel batches to fit a specific use case.
And we'll see how powerful batches are indeed.