What is the purpose of the WordPress plugin discussed in the post?

The plugin is designed to integrate with the Toronto Real Estate Board (TREB) to pull new real estate listings and import them into WordPress as posts.

Why is it necessary to run tasks asynchronously in WordPress?

Asynchronous processing is necessary to avoid execution time limits imposed by web servers. Tasks such as retrieving images from a slow FTP server can take longer than the allowed execution time, which can lead to failures.

What is the WP Background Processing class?

The WP Background Processing class is a library that allows you to run long-running tasks in the background without blocking the main execution flow. It helps manage queued jobs and processes them in smaller batches.

How does the CURL function for retrieving images work?

The CURL function initializes a CURL session to download images from a remote FTP server. It sets various options for connection timeout and file handling, but can fail if the download exceeds the maximum execution time.

What happens if a PHP function takes too long to execute?

If a PHP function exceeds the maximum execution time set in the server configuration, it will generate a fatal error, which can disrupt the entire process. This is particularly problematic for tasks that involve slow external resources.

How do you initiate the background process in WordPress?

To initiate the background process, you need to hook into WordPress’ init action and call a function that sets up the background processing class and queues the tasks.

What is the significance of the push_to_queue method?

The push_to_queue method is used to add tasks to the processing queue. Each item that needs to be processed is added to the queue, allowing the background process to handle them sequentially.

What does the save() and dispatch() method do?

The save() method stores the current state of the queue, while the dispatch() method triggers the processing of the queued tasks. Together, they ensure that the queued jobs are executed properly.

Can I adjust the execution time settings in PHP?

While you can adjust the max_execution_time setting in the php.ini file, it is not recommended for long-running tasks. Using background processing is a more efficient and safer approach.

Where can I find the full code examples for the plugin?

The full code examples for the plugin can be found in the GitHub repository linked in the post. This repository contains all the relevant code and documentation for implementing the plugin.

What are the benefits of using the WP Background Processing class?

Using the WP Background Processing class allows for efficient management of long-running tasks without blocking the main execution. It enables the processing of large batches of jobs independently of server execution limits.

Is this plugin available for public use?

The plugin is currently under development and will be released to the WordPress community in the future. Updates will be provided as the project progresses.

Blog

11/18/2016

How to create asynchronous non-blocking queued jobs in WordPress

Hello!

One of the projects that we are currently working on is a WordPress plugin that integrates with the Toronto Real Estate Board to pull new listings from their systems and import them into WordPress as posts.

The mechanics required to connect to TREB are very basic. One might call TREB’s systems out of date. They put their listing data in a downloadable CSV and store all the listing images on an FTP site. This hasn’t changed for years and likely will not change in the near future without some serious technological overhaul.

Due to this technological restriction, in order to connect modern content management systems such as WordPress to this information, we can write a WordPress plugin that pulls the CSV data, processes it and imports it as as a post. The listing images can be retrieved via PHP Curl and stored in the media library. The problem with things like FTP is that you end up waiting a long time for the process to complete.

The problem with waiting in the context of a web server is usually the web server has strict timeout variables (sometimes 30 seconds , longer or shorter depending). Since this type of an execution timeout variable is different on many systems, that would pose a problem when trying to initiate an FTP based retrieval of an image from a notoriously slow FTP server like TREB’s. It can take upwards of 15-20 seconds per file to retrieve!

How would you deal with this type of a bottleneck? Well you would run it as a background process, ideally to run concurrently with the other processes. In PHP this is not really easily possible.

What we found was a very interesting class that can allow you do to just that : run asynchronous non-blocking tasks in the background.

PHP Functions that take too long to process will usually fail

Lets look at our PHP function to use CURL to retrieve images via FTP :

function treb_get_images($remote_url, $remote_user, $remote_pass, $local_file) {
        try {
                $ch = curl_init();
                $fp = fopen($local_file, 'w');
                curl_setopt_array( $ch, array(
                        CURLOPT_URL => $remote_url,
                        CURLOPT_HEADER => 0,
                        CURLOPT_VERBOSE => 0,
                        CURLOPT_RETURNTRANSFER => 1,
                        CURLOPT_BINARYTRANSFER => 1,
                        CURLOPT_CONNECTTIMEOUT => 140,
                        CURLOPT_TIMEOUT => 300,
                        CURLOPT_NOSIGNAL => 1,
                        CURLOPT_FILE => $fp
                        )
                );
                // Set CURL to write to disk
                // Execute download
                $response = curl_exec($ch);
                if (FALSE === $response) {
                        throw new Exception(curl_error($ch), curl_errno($ch));
                }
        } catch(Exception $e) {
                trigger_error(sprintf(
                'Curl failed with error #%d: %s',
                $e->getCode(), $e->getMessage()),
                E_USER_ERROR);
        }
                curl_close($ch);
                fclose($fp);
}

Now triggering the above function is doing is a simple curl_exec of a remote URL , requesting a remote file. Executing this function in a PHP Web process will likely fail and generate a fatal error, something along the lines of “exceeded allocated maximum_execution_time”. To get it to work, you could adjust the php.ini file in your web environment to something abnormally high. This is not really recommended, so we found that wrapping functions that take too long in the WP Background Process class

Import & Use the WordPress Background Process Class

Again, take a look at the Github project for WP Background Processing. What we want to do is create a wrapper function that will handle this particular function ,along with any other functions that may take a long time to process.

add_action( 'init', 'process_handler' );

You need to hook into WordPress’ init and attach the process handler function :

function process_handler() {
        $treb_import = new StdClass;
        $treb_import->treb_import_process = new Treb_Import_Process();

        if ( 'treb_images' === $_GET['process'] ) {
                // Parse date , otherwise assign current date
                if ($_GET['date']) {
                        $date = explode("-", $_GET['date']);
                } else {
                        $date = explode("-", date('d-m-Y'));
                }
                $treb_data = treb_get_csv($date);
                $loop_count = 0;
                foreach ($treb_data as $item) {
                        $loop_count++;
                        // Prep multidimensional array
                        $item_array = array(
                                        count($treb_data),
                                        $item,
                                        $loop_count
                                        );
                        // Queue the import
                        $treb_import->treb_import_process->push_to_queue($item_array);

                }
                $treb_import->treb_import_process->save()->dispatch();
        }
}

Whats happening in the above function? The only thing you need to worry about is the following lines of code :

$treb_import = new StdClass;
$treb_import->treb_import_process = new Treb_Import_Process();
$treb_import->treb_import_process->push_to_queue($item_array);
$treb_import->treb_import_process->save()->dispatch();

The first two lines, we’re initializing the class. The third line we are pushing items to the queue, in this case it is listing data. The last line saves the queue and dispatches it.

In the referenced Treb_Import_Process we have all the functions and tasks that take time which are run asynchronously and in the background. Hopefully by now you will see the benefits of this. Large batches of jobs can be run safely in the background and the WP Background Process will run the jobs in small batches until the job is deemed complete. This is completely independent of any server settings such as max_exectuion_time.

I hope you find this helpful! Eventually we will be releasing the TREB wordpress plugin to the WordPress community, however for now you can view our github project to see the full code examples illustrated above.

Frequently Asked Questions

What is the purpose of the WordPress plugin discussed in the post?
Integration with TREB
The plugin is designed to integrate with the Toronto Real Estate Board (TREB) to pull new real estate listings and import them into WordPress as posts.
Why is it necessary to run tasks asynchronously in WordPress?
Avoiding execution time limits
Asynchronous processing is necessary to avoid execution time limits imposed by web servers. Tasks such as retrieving images from a slow FTP server can take longer than the allowed execution time, which can lead to failures.
What is the WP Background Processing class?
Handling background tasks
The WP Background Processing class is a library that allows you to run long-running tasks in the background without blocking the main execution flow. It helps manage queued jobs and processes them in smaller batches.
How does the CURL function for retrieving images work?
Retrieving images via FTP
The CURL function initializes a CURL session to download images from a remote FTP server. It sets various options for connection timeout and file handling, but can fail if the download exceeds the maximum execution time.
What happens if a PHP function takes too long to execute?
Potential errors
If a PHP function exceeds the maximum execution time set in the server configuration, it will generate a fatal error, which can disrupt the entire process. This is particularly problematic for tasks that involve slow external resources.
How do you initiate the background process in WordPress?
Using the init hook
To initiate the background process, you need to hook into WordPress’ init action and call a function that sets up the background processing class and queues the tasks.
What is the significance of the push_to_queue method?
Queuing tasks for processing
The push_to_queue method is used to add tasks to the processing queue. Each item that needs to be processed is added to the queue, allowing the background process to handle them sequentially.
What does the save() and dispatch() method do?
Finalizing the queue
The save() method stores the current state of the queue, while the dispatch() method triggers the processing of the queued tasks. Together, they ensure that the queued jobs are executed properly.
Can I adjust the execution time settings in PHP?
Not recommended for long tasks
While you can adjust the max_execution_time setting in the php.ini file, it is not recommended for long-running tasks. Using background processing is a more efficient and safer approach.
Where can I find the full code examples for the plugin?
Accessing the GitHub repository
The full code examples for the plugin can be found in the GitHub repository linked in the post. This repository contains all the relevant code and documentation for implementing the plugin.
What are the benefits of using the WP Background Processing class?
Efficient task management
Using the WP Background Processing class allows for efficient management of long-running tasks without blocking the main execution. It enables the processing of large batches of jobs independently of server execution limits.
Is this plugin available for public use?
Future release plans
The plugin is currently under development and will be released to the WordPress community in the future. Updates will be provided as the project progresses.