HM refactor #504

radka-j · 2025-06-04T08:25:36Z

Closes #406

Overview

This PR:

adds two new classes in experimental based on HistoryMatching in main
- HistoryMatching is a metric that takes in observations and returns implausibility for provided predictions
  - it does not require an emulator or a simulator to be instantiated
  - one can optionally pass in an emulator in which case the class provides a method for making predictions with it
- HistoryMatchingWorkflow captures the iterative process of using an emulator to choose what simulations to run and then updating that emulator with the newly simulated data
  - it expects the refactored Simulator and GaussianProcessExact emulators as input
  - each time run() is called, it now executes just one "wave", because the method stores state, the user can simply call run() repeatedly if they want to run multiple waves
  - when NROY parameters get identified, the simulator.param_bounds get updated to the NROY min/max, this way we can just keep using the simulator sample_inputs method without having to have a separate method for sampling
adds a notebook to experimental/exploratory/hm_refactor.ipynb that shows the two HM classes and how they integrate with the dahsboard
updates the HM Dashboard to accept tensors as inputs

For reviewers:

The added experimental/exploratory/hm_refactor.ipynb notebook is a good place to start
It would be good to double check the workflow implemented in HistoryMatchingWorkflow.run(), it is based on what I understood of what is described in this paper, it'd be great to see if we all agree on this.

…t compat with experimental emulators

review-notebook-app · 2025-06-04T08:25:42Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

codecov-commenter · 2025-06-04T08:40:08Z

Codecov Report

Attention: Patch coverage is 87.24490% with 25 lines in your changes missing coverage. Please review.

Project coverage is 77.43%. Comparing base (7ae1d2a) to head (b41156f).
Report is 21 commits behind head on main.

Files with missing lines	Patch %	Lines
autoemulate/experimental/history_matching.py	89.09%	12 Missing ⚠️
autoemulate/history_matching_dashboard.py	0.00%	11 Missing ⚠️
...experimental/test_experimental_history_matching.py	97.26%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #504      +/-   ##
==========================================
+ Coverage   76.52%   77.43%   +0.90%     
==========================================
  Files         113      117       +4     
  Lines        8270     8637     +367     
==========================================
+ Hits         6329     6688     +359     
- Misses       1941     1949       +8

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2025-06-04T08:40:19Z

Coverage report

Click to see where and how coverage changed

File	Statements	Missing	Coverage	Coverage (new stmts)	Lines missing
autoemulate
history_matching_dashboard.py					11, 38-39, 54-57, 63-68
autoemulate/experimental
history_matching.py					70, 101-102, 108-109, 156-157, 207, 304-305, 355, 380
autoemulate/experimental/simulations
epidemic.py
tests/experimental
test_experimental_history_matching.py					23-24
Project Total

_{This report was generated by python-coverage-comment-action}

… original

autoemulate/experimental/history_matching.py

Co-authored-by: Sam Greenbury <[email protected]>

autoemulate/experimental/history_matching.py

sgreenbury

I think the revised HistoryMatchingWorkflow with the new distinct methods and updated run() looks really great! The workflow looks like it reflects paper to me (I had one small query regarding the update). I also had one suggestion regarding the potential additional method but I think this looks good to me to merge without that too.

radka-j · 2025-06-20T16:38:18Z

@sgreenbury this should be ready now but it would be great if you could have one last look over it. Specifically:

I updated the run() method, as agreed it now takes in max_tries as an input argument.
I also made some additional commits trying to ensure device is handled consistently throughout and I'd appreciate if you could have one more look over the whole HM code with just that in mind and see if I missed anything.

sgreenbury · 2025-06-23T16:50:49Z

@sgreenbury this should be ready now but it would be great if you could have one last look over it. Specifically:
* I updated the `run()` method, as agreed it now takes in `max_tries` as an input argument.

Looks great!

* I also made some additional commits trying to ensure device is handled consistently throughout and I'd appreciate if you could have one more look over the whole HM code with just that in mind and see if I missed anything.

Good point and looks good! I opened #562 to explore adding a test for run() and adding one cpu() call and a dtype in the subclassed Epidemic simulator. I am not sure if we need Simulator to consider Device at the moment? More generally, I am wondering how we could do testing across devices more consistently across PRs (without adding lots of boilerplate/extra tests) - might be worth looking at in a new issue. Also possibly relates to potential integration with lightning.

My only other minor thought was about the additional handling on the None case for the emulator in HistoryMatching and whether the emulator is needed in HistoryMatching or if it could be moved to the HistoryMatchingWorkflow?

Add test for device to history matching

radka-j · 2025-06-24T08:15:16Z

@sgreenbury

Thank you for adding the test!
I added an issue for adding device handling to Simulator, it's been on my mind since starting to work with TorchCor which is implemented in PyTorch (Add device option to Simulator #563).
As for the emulator, I originally didn't have that option in the main class but added it because it makes getting predictions from the emulator in the right format for scoring implausibility easier for the user. It's demonstrated in the notebook. Alternatively it could be a method of the emulator to get just the mean and the variance like this?

sgreenbury · 2025-06-24T13:35:04Z

* I added an issue for adding device handling to Simulator, it's been on my mind since starting to work with TorchCor which is implemented in PyTorch ([Add device option to Simulator #563](https://github.com/alan-turing-institute/autoemulate/issues/563)).

Sounds good!

* As for the emulator, I originally didn't have that option in the main class but added it because it makes getting predictions from the emulator in the right format for scoring implausibility easier for the user. It's demonstrated in the notebook. Alternatively it could be a method of the emulator to get just the mean and the variance like this?

I think adding this as a method on an probabilistic/gaussian emulator sounds like a great option - perhaps this would work well as an addition to #561 where there is a ProbabilisticEmulator or GaussianEmulator that this method could be added to so it is not on the base Emulator? It seems like it could be refactored out of the HistoryMatching class with that change in that PR.

From looking at the notebook, I had one other minor comment: in cell 8, should x be x_new here to match the predictions? And for cell 9 would it also make sense to make it x_new so that it is being used on data not part of the fit?

I think this looks really great and good to merge otherwise!

radka-j added 23 commits June 3, 2025 16:29

replace np.ndarray with torch.tensor

a84c1a6

update HM dashboard to expect tensor

a2ea999

linting

2e3eb35

add self.sample method

fea5fd0

add warning

a2c7e6b

HM dashboard can take both numpy and torch.tensor as input

cdeb8af

update progess bar

b60e784

mv HM refactor to experimental

732368a

rm use of list, replace with tensor

266ede9

add initial HM tests to experimental

37dcdfe

save results in HM object

739f60d

rm option to use previous training data stores in emulator object, no…

73fb22f

…t compat with experimental emulators

store in_dim and out_dim

501bf3a

update HM emulator

d81f1cd

fix type issues

3ab12e1

style fixes

a992df2

refactor nroy calc

1b47eae

refactor nroy calc

27f0163

update dashboard

597bbff

add exploratory nb to test refactor

60ddb44

simplify run loop

b0411d2

update nb

5b9f8cc

update tests

eabe4b3

radka-j mentioned this pull request Jun 4, 2025

414 combine simulator implementations #485

Closed

radka-j added 3 commits June 4, 2025 13:59

add test to nb that refactored HM gives same implausability scores as…

58132b4

… original

add tests

1429dbc

add device handling

1a75d1e

radka-j added 3 commits June 18, 2025 21:07

update docstrings

8de8dd6

refactor run into separate methods

f720391

add warning

9939231

radka-j requested a review from sgreenbury June 19, 2025 08:20

mv emulator predict method to main HistoryMatching class

529f7e3

sgreenbury reviewed Jun 20, 2025

View reviewed changes

autoemulate/experimental/history_matching.py Outdated Show resolved Hide resolved

sgreenbury reviewed Jun 20, 2025

View reviewed changes

autoemulate/experimental/history_matching.py Outdated Show resolved Hide resolved

radka-j and others added 2 commits June 20, 2025 12:16

Update autoemulate/experimental/history_matching.py

a6834de

Co-authored-by: Sam Greenbury <[email protected]>

Update autoemulate/experimental/history_matching.py

f1ded13

Co-authored-by: Sam Greenbury <[email protected]>

sgreenbury reviewed Jun 20, 2025

View reviewed changes

autoemulate/experimental/history_matching.py Outdated Show resolved Hide resolved

sgreenbury reviewed Jun 20, 2025

View reviewed changes

autoemulate/experimental/history_matching.py Outdated Show resolved Hide resolved

sgreenbury approved these changes Jun 20, 2025

View reviewed changes

radka-j added 5 commits June 20, 2025 16:41

better device handling in HM

a4624d9

more device handling in HM

453a19e

handle possibility of emulator is None

ddc43df

add to run() method

5697049

add test for max_tries arg added in last commit

587f9c4

radka-j requested a review from sgreenbury June 20, 2025 16:38

radka-j and others added 2 commits June 20, 2025 18:52

update docstrings

95a6b27

Add test for device to history matching

b982e61

Merge pull request #562 from alan-turing-institute/hm_refactor_device

a373152

Add test for device to history matching

sgreenbury approved these changes Jun 24, 2025

View reviewed changes

fix hm nb

b41156f

radka-j mentioned this pull request Jun 24, 2025

Add more emulator subclasses #561

Merged

radka-j merged commit 826483a into main Jun 24, 2025
4 checks passed

radka-j deleted the hm_refactor branch June 25, 2025 14:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HM refactor #504

HM refactor #504

Uh oh!

radka-j commented Jun 4, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented Jun 4, 2025

Uh oh!

codecov-commenter commented Jun 4, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jun 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sgreenbury left a comment

Uh oh!

radka-j commented Jun 20, 2025 •

edited

Loading

Uh oh!

sgreenbury commented Jun 23, 2025

Uh oh!

radka-j commented Jun 24, 2025 •

edited

Loading

Uh oh!

sgreenbury commented Jun 24, 2025

Uh oh!

Uh oh!

Uh oh!

HM refactor #504

HM refactor #504

Uh oh!

Conversation

radka-j commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

For reviewers:

Uh oh!

review-notebook-app bot commented Jun 4, 2025

Uh oh!

codecov-commenter commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sgreenbury left a comment

Choose a reason for hiding this comment

Uh oh!

radka-j commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgreenbury commented Jun 23, 2025

Uh oh!

radka-j commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgreenbury commented Jun 24, 2025

Uh oh!

Uh oh!

Uh oh!

radka-j commented Jun 4, 2025 •

edited

Loading

codecov-commenter commented Jun 4, 2025 •

edited

Loading

github-actions bot commented Jun 4, 2025 •

edited

Loading

radka-j commented Jun 20, 2025 •

edited

Loading

radka-j commented Jun 24, 2025 •

edited

Loading