[ENH] Add pretrain/fine-tune lifecycle to BaseModel v2 by echo-xiao · Pull Request #2220 · sktime/pytorch-forecasting

echo-xiao · 2026-03-20T20:41:52Z

Reference Issues/PRs

close #2105

What does this implement/fix? Explain your changes.

Implements the pretrain -> fine-tune -> predict lifecycle for basemodel v2.

Changes:

_base_model_v2.py

pretrained_weights init param
fine_tune_strategy init param
pretrain(): pre-trian on gloabl/panel data
_pretrain(): hook for subclasses to override
_post_init_laod_pretrained(): subclaases call
load_ptrtrained_weights(): load HuggingFace path
_freeze_backbone() / _unfreeze_backbone(): freeze_utilities
fine_tune_strategy param
three mode docstring examples (pretrain->finetune, cold-start, train from scratch)

What should a reviewer concentrate their feedback on?

design of _pretrain() hook: most important thing
find_tune_strategy default value
laod_pretrained_weights(), is this pattern acceptable for all v2 subclasses?
post_init_load_pretrain() pattern, is this pattern acceptable for all v2 subclasses?
if all of these changes match Pre-training, global learning, and fine-tuning API enhancement-proposals#41

Did you add any tests for the change?

tests/test_models/test_basemodel_pretrain.py

pretrain() sets and returns self
_pretrain() subclass hook
weights change after pretraining
load_pretrained_weights() with difference format

Any other comments?

NHiTS_v2 POC integration will be added as follow-up after #2186 is merged.

PR checklist

The PR title starts with either [ENH], [MNT], [DOC], or [BUG]. [BUG] - bugfix, [MNT] - CI, test framework, [ENH] - adding or improving code, [DOC] - writing or improving documentation or docstrings.
Added/modified tests
Used pre-commit hooks when committing to ensure that code is compliant with hooks. Install hooks with pre-commit install.

… error

…consistency

codecov · 2026-03-21T02:22:38Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
⚠️ Please upload report for BASE (main@5600398). Learn more about missing BASE report.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2220   +/-   ##
=======================================
  Coverage        ?   86.68%           
=======================================
  Files           ?      165           
  Lines           ?     9782           
  Branches        ?        0           
=======================================
  Hits            ?     8480           
  Misses          ?     1302           
  Partials        ?        0

Flag	Coverage Δ
cpu	`86.68% <100.00%> (?)`
pytest	`86.68% <100.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…t, resolve_path

echo-xiao · 2026-03-26T19:06:02Z

Hi @phoeenniixx the PR implements the basemodel v2 core, and I'd love to get your feed back. Does this pretrain on_fit_start() workflow look reasonable to you?

phoeenniixx

Thanks! This looks good!
Sorry, I don't have the complete mental model ready of how pretraining should look like... But I have added some doubts (some of which may feel redundant :))

How the inference mode would look here? After freezing the backbone?
Also, if we load weights from hf, how should we finetune the model? I think there can be multiple strats?
I think we should think of how to integrate this with the FMs, I am not sure if the current BaseModel is a good place for this?
Should the pretrain etc methods be in Basepkg similar to the way fit, predict is?

FYI @agobbifbk, @PranavBhatP, @fkiraly what are your thoughts on this?

Also, please dont share the GSoC proposal here, I think you should follow the official channels and apply at the GSoC portal and the sktime form...

phoeenniixx · 2026-03-27T18:07:46Z