features: Manually setting the validation set for multi-output task by lizhuoq · Pull Request #1302 · microsoft/FLAML

lizhuoq · 2024-05-05T09:32:17Z

Why are these changes needed?

For original multi-output tasks where the eval_method is holdout, manual setting of the validation set was not possible. This commit introduces a new feature allowing manual setting of the validation set for multi-output tasks.

model = MultiOutputRegressor(
    AutoML(
        task="regression",
        time_budget=1,
        eval_method="holdout",
        multioutput_train_size=len(X_train)
    )
)
model.fit(
    pd.concat([X_train, X_val]),
    pd.concat([y_train, y_val])
)

Related issue number

Checks

I've included any doc changes needed for https://microsoft.github.io/FLAML/. See https://microsoft.github.io/FLAML/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

lizhuoq · 2024-05-05T09:38:33Z

@microsoft-github-policy-service agree

prdai

LGTM!

Copilot

Pull request overview

This pull request adds support for manually setting a validation set for multi-output tasks when using the "holdout" evaluation method. Previously, users could not manually specify a validation set for multi-output regression tasks. The new multioutput_train_size parameter allows users to concatenate training and validation data and specify where to split them.

Changes:

Added multioutput_train_size parameter to AutoML class for manual validation set specification
Implemented _train_val_split method to split concatenated training/validation data
Added test case demonstrating the new functionality with MultiOutputRegressor

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
flaml/automl/automl.py	Added documentation and implementation for the `multioutput_train_size` parameter, including the split logic in the `fit` method
test/automl/test_regression.py	Added `test_multioutput_train_size` function to demonstrate usage of the new feature

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

test/automl/test_regression.py

flaml/automl/automl.py

Copilot · 2026-01-19T23:50:57Z

test/automl/test_regression.py

+def test_multioutput_train_size():
+    import numpy as np
+    from sklearn.datasets import make_regression
+    from sklearn.model_selection import train_test_split
+    from sklearn.multioutput import MultiOutputRegressor, RegressorChain
+
+    # create regression data
+    X, y = make_regression(n_targets=3)
+
+    # split into train and test data
+    X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.20, random_state=42)
+    X_train, X_val, y_train, y_val = train_test_split(X_train, y_train, test_size=0.1, random_state=42)
+
+    # train the model
+    model = MultiOutputRegressor(
+        AutoML(task="regression", time_budget=1, eval_method="holdout", multioutput_train_size=len(X_train))
+    )
+    model.fit(np.concatenate([X_train, X_val], axis=0), np.concatenate([y_train, y_val], axis=0))
+
+    # predict
+    print(model.predict(X_test))


The test function lacks assertions to verify the new multioutput_train_size feature works as expected. Consider adding assertions to validate that the model was trained successfully and that the validation split was performed correctly. For example, you could check that the model produces reasonable predictions or verify internal state that confirms the train/validation split occurred.

flaml/automl/automl.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

features: Manually setting the validation set for multi-output task

623a77f

prdai approved these changes May 22, 2024

View reviewed changes

thinkall requested review from qingyun-wu, skzhang1 and sonichi July 23, 2024 02:55

thinkall requested review from Copilot and removed request for qingyun-wu, skzhang1 and sonichi January 19, 2026 23:48

Copilot started reviewing on behalf of thinkall January 19, 2026 23:48 View session

Copilot AI reviewed Jan 19, 2026

View reviewed changes

thinkall and others added 3 commits January 20, 2026 21:54

Update test/automl/test_regression.py

ef29a4a

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update flaml/automl/automl.py

73d9c52

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update flaml/automl/automl.py

e035a09

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

thinkall mentioned this pull request Jan 21, 2026

Manually setting the validation set for multi-output task #1503

Open

Copilot AI mentioned this pull request Jan 21, 2026

Add multioutput_train_size parameter for manual validation set specification in multi-output tasks #1504

Draft

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

features: Manually setting the validation set for multi-output task#1302

features: Manually setting the validation set for multi-output task#1302
lizhuoq wants to merge 4 commits intomicrosoft:mainfrom
lizhuoq:multiouput

lizhuoq commented May 5, 2024 •

edited

Loading

Uh oh!

lizhuoq commented May 5, 2024

Uh oh!

prdai left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

lizhuoq commented May 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

lizhuoq commented May 5, 2024

Uh oh!

prdai left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lizhuoq commented May 5, 2024 •

edited

Loading