TimeGPT on Ray

Ray is an open-source unified compute framework that helps scale Python workloads for distributed computing. In this tutorial, you will learn how to distribute TimeGPT forecasting jobs on top of Ray.

This guide uses Fugue to easily run code across various distributed computing frameworks, including Ray.

Overview

Key Concepts

Below is an outline of what we’ll cover:

Installation
Load Your Data
Initialize Ray
Use TimeGPT on Ray
Shutdown Ray

1. Installation

Install Ray using Fugue. Fugue provides an easy-to-use interface for distributed computation. It lets you run Python code on several distributed computing frameworks, including Ray.

Fugue Ray Installation
pip install fugue[ray]

When executing on a distributed Ray cluster, ensure the nixtla library is installed on all workers.

2. Load Your Data

Load your dataset into a pandas DataFrame. This tutorial uses hourly electricity prices from various markets:

Load Dataset Example
import pandas as pd

df = pd.read_csv(
    'https://raw.githubusercontent.com/Nixtla/transfer-learning-time-series/main/datasets/electricity-short.csv',
    parse_dates=['ds'],
)
df.head()

Preview of the first few rows of data

3. Initialize Ray

Here, we’re spinning up a Ray cluster locally by creating a head node. You can scale this to multiple machines in a real cluster environment.

Ray Cluster Initialization
import ray
from ray.cluster_utils import Cluster

ray_cluster = Cluster(
    initialize_head=True,
    head_node_args={"num_cpus": 2}
)

ray.init(address=ray_cluster.address, ignore_reinit_error=True)

# Convert your DataFrame to Ray format:
ray_df = ray.data.from_pandas(df)
ray_df

Ray Initialization Logs

Log Output

4. Use TimeGPT on Ray

With Ray, you can run TimeGPT similar to a standard (non-distributed) local environment. Operations such as forecast still apply directly to Ray Dataset objects.

Instantiating NixtlaClient

Begin by creating a NixtlaClient. Replace my_api_key_provided_by_nixtla with your own API key.

NixtlaClient Initialization
from nixtla import NixtlaClient

nixtla_client = NixtlaClient(
  api_key='my_api_key_provided_by_nixtla'
)

If you prefer using an Azure AI endpoint, specify the base_url and api_key for Azure.

NixtlaClient Azure Setup
nixtla_client = NixtlaClient(
  base_url="your azure ai endpoint",
  api_key="your api_key"
)

Making a Forecast

TimeGPT Forecasting on Ray
%%capture
fcst_df = nixtla_client.forecast(ray_df, h=12)

Public API models supported include timegpt-1 (default) and timegpt-1-long-horizon.

Inspect the forecast results by converting to a pandas DataFrame:

Inspect Forecast Results
fcst_df.to_pandas().tail()

Cross-validation with TimeGPT

You can also perform cross-validation on Ray. The following sample code performs a cross-validation procedure using rolling windows:

TimeGPT Cross-validation on Ray
%%capture
cv_df = nixtla_client.cross_validation(
ray_df, 
h=12, 
freq='H', 
n_windows=5, 
step_size=2
)

After computation, convert cv_df to pandas to view the results:

Inspect Cross-validation Results
cv_df.to_pandas().tail()

Exogenous Variables

5. Shutdown Ray

Always shut down Ray after you finish your tasks to free up resources.

Shutdown Ray Example
ray.shutdown()

Congratulations! You’ve successfully used TimeGPT on Ray for distributed forecasting.

QUICK START

GETTING STARTED

CAPABILITIES

DEPLOYMENT

TUTORIALS

USE CASES

REFERENCE

About

Ray

TimeGPT on Ray

Overview

Ray

Fugue

TimeGPT

QUICK START

GETTING STARTED

CAPABILITIES

DEPLOYMENT

TUTORIALS

USE CASES

REFERENCE

About

​TimeGPT on Ray

​Overview

TimeGPT on Ray

Overview