Cron

Learn what Cron is and how we can use it for scheduling.

A common requirement for model pipelines is running a task at a regular frequency, such as every day or every hour.

What is cron?

Cron is a utility that provides scheduling functionality for machines running the Linux operating system. You can set up a scheduled task using the crontab utility and assign a cron expression that defines how frequently to run the command. Cron jobs run directly on the machine where cron is utilized and can make use of the runtimes and libraries installed on the system.

There are a number of challenges with using cron in production-grade systems, but it’s a great way to get started with scheduling a small number of tasks and it’s good to learn the cron expression syntax that is used in many scheduling systems. The main issue with the cron utility is that it runs on a single machine, and does not natively integrate with tools such as version control. If your machine goes down, then you’ll need to recreate your environment and update your cron table on a new machine.

Get hands-on with 1200+ tech skills courses.