Iris with preprocessing: ship transforms with the model#

In tutorial 01 the client sent a JSON array of features. In production those features usually need normalisation, scaling, or encoding before they hit the model. The naive approach is to do that work in the client, but then every consumer has to ship the same preprocessing logic and stay in sync with the model.

Edgeflow’s answer: bake the preprocessing into the deployment artifact itself, as a WASM pre-transform. The client keeps sending the same request; the inference server runs the transform inside the pipeline, just before the model. Hot-swap a new model with new normalisation parameters and the client never knows.

You will:

Train a LogisticRegression on z-scored iris features.
Attach a Normalize WASM pre-transform with the per-feature mean and std baked in.
Send the same JSON array as tutorial 01 - and get the right answer.

Prerequisites#

Tutorial 01 working, or at least edgeflow up via docker compose.
Python 3.12+ and uv.

1. Bring up edgeflow#

Same as tutorial 01. If the stack is already running, skip ahead.

curl -O https://raw.githubusercontent.com/jordandelbar/edgeflow/main/deploy/quickstart.yaml
docker compose -f quickstart.yaml up -d

2. Train with preprocessing baked in#

curl -O https://raw.githubusercontent.com/jordandelbar/edgeflow/main/examples/02-iris-with-preprocessing/train.py
uv run train.py

The script computes per-feature mean and std on the training set, trains a LogisticRegression on z-scored features, and pushes the model along with an edgeflow.Normalize(mean=..., std=...) pre-transform. The relevant call:

edgeflow.log_model(
    model_bytes=sklearn_to_onnx(clf),
    preprocess=edgeflow.Pipeline(
        [
            edgeflow.Normalize(mean=mean, std=std),
        ]
    ),
    postprocess=edgeflow.Pipeline(
        [
            edgeflow.ClassifierOutput(labels=list(iris.target_names)),
        ]
    ),
)

Expected output:

feature mean: [5.84, 3.05, 3.74, 1.20]
feature std:  [0.83, 0.43, 1.77, 0.76]
training on z-scored features...
accuracy: 0.9667
pushing to edgeflow at http://localhost:5000...

3. Send un-normalised features#

Send the same JSON array as tutorial 01, with the raw (un-normalised) feature values:

curl -X POST http://localhost:8080/infer \
     -H 'Content-Type: application/json' \
     -d '[5.1, 3.5, 1.4, 0.2]'

The server runs your input through the WASM Normalize transform first, then through the ONNX model. You get back the same labelled prediction format as tutorial 01.

What just happened?#

When you called log_model, edgeflow compiled the Normalize pre-transform into a WASM component and bundled it with the ONNX bytes into a single deployment artifact. The inference pod loaded that artifact, spun up a wasmtime runtime for the pre-transform, and now runs it on every request before the model sees a tensor.

The pod-to-WASM trip is structurally cheap: roughly two memcpy operations per call to move the input in and the output back out. The cost of the transform itself dominates - trivial for a 4-feature Normalize, more significant for image decoding (see YOLOv8 on edgeflow: image input, WASM pre/post).

Try this#

Train a second version with a deliberately wrong mean (say, all zeros) and run train.py again. The new version is registered as v2 and the iris-inference target hot-swaps to it. The client keeps sending the same JSON; the predictions go bad. Roll back by deploying v1 again. No client change, no downtime.

Next steps#

Adult income: JSON input with mixed feature types - swap the positional array for a JSON object with named fields plus categorical encodings.
YOLOv8 on edgeflow: image input, WASM pre/post - same pre/post-transform pattern, but with real image data and a 6 MB model.