Dask

Run TimeGPT in a distributed manner using Dask.

Dask is an open-source parallel computing library for Python. This guide explains how to use TimeGPT from Nixtla with Dask for distributed forecasting tasks.

Before proceeding, make sure you have an API key from Nixtla.

Highlights

• Simplify distributed computing with Fugue.
• Run TimeGPT at scale on a Dask cluster.
• Seamlessly convert pandas DataFrames to Dask.

Outline

Step 1: Installation

Install Fugue and Dask

Fugue provides an easy-to-use interface for distributed computing over frameworks like Dask.

You can install fugue with:

Install Fugue and Dask
pip install fugue[dask]

If running on a distributed Dask cluster, ensure the nixtla library is installed on all worker nodes.

Step 2: Load Your Data

You can start by loading data into a pandas DataFrame. In this example, we use hourly electricity prices from multiple markets:

Load Electricity Data
import pandas as pd

df = pd.read_csv(
    'https://raw.githubusercontent.com/Nixtla/transfer-learning-time-series/main/datasets/electricity-short.csv',
    parse_dates=['ds'],
)
df.head()

Example pandas DataFrame:

Step 3: Import Dask

Convert the pandas DataFrame into a Dask DataFrame for parallel processing.

Convert to Dask DataFrame
import dask.dataframe as dd

dask_df = dd.from_pandas(df, npartitions=2)
dask_df

When converting to a Dask DataFrame, you can specify the number of partitions based on your data size or system resources.

Step 4: Use TimeGPT on Dask

To use TimeGPT with Dask, provide a Dask DataFrame to Nixtla’s client methods instead of a pandas DataFrame.

Important Concept: NixtlaClient

Instantiate the NixtlaClient class to interact with Nixtla’s API.

Initialize NixtlaClient
from nixtla import NixtlaClient

nixtla_client = NixtlaClient(
    api_key='my_api_key_provided_by_nixtla'
)

Using an Azure AI endpoint

You can use any method from the NixtlaClient, such as forecast or cross_validation.

Forecast with TimeGPT and Dask
fcst_df = nixtla_client.forecast(dask_df, h=12)
fcst_df.compute().head()

Forecast with TimeGPT and Dask
fcst_df = nixtla_client.forecast(dask_df, h=12)
fcst_df.compute().head()

Cross-validation with TimeGPT and Dask
cv_df = nixtla_client.cross_validation(
    dask_df,
    h=12,
    n_windows=5,
    step_size=2
)
cv_df.compute().head()

Azure AI Models

When using an Azure AI endpoint, set model to "azureai":

Azure AI Model Usage
nixtla_client.forecast(..., model="azureai")

For the public API, two models are available:
• timegpt-1 (default)
• timegpt-1-long-horizon

See the Long Horizon Forecasting Tutorial for details on timegpt-1-long-horizon.

TimeGPT with Dask also supports exogenous variables. Refer to the Exogenous Variables Tutorial for details. Substitute pandas DataFrames with Dask DataFrames as needed.

QUICK START

GETTING STARTED

CAPABILITIES

DEPLOYMENT

TUTORIALS

USE CASES

REFERENCE

About

Highlights

Outline

Important Concept: NixtlaClient

QUICK START

GETTING STARTED

CAPABILITIES

DEPLOYMENT

TUTORIALS

USE CASES

REFERENCE

About

Highlights

​Outline

Important Concept: NixtlaClient

Outline