New `iter_generate` for OPS 2.0 by dwhswenson · Pull Request #1042 · openpathsampling/openpathsampling

dwhswenson · 2021-08-02T00:54:48Z

Resolves #810.

One of the things I want to do in OPS 2.0 is simplify code in API-breaking ways, when the API isn't something that is widely used. One case of this is DynamicsEngine.iter_generate. The existing iter_generatehas a huge complexity rating (McCabe cyclomatic complexity of 46; CodeClimate "cognitive complexity" of 92). This comes because it tries to implement a bunch of user-facing options for error handling into one catch-all function, instead of letting the client code handle the errors. This is all the stuff about how to handle NaNs or max-length error (whether to restart/retry/etc.)

In path sampling, many of those options would be just wrong. A trajectory that generates a NaN is a rejected trajectory. Retrying will bias the sampling. In addition, when I tried to write tests for it, it became clear that some of that functionality isn't even implemented correctly.

So this PR rewrites iter_generate without the extra error handling functionality. That error handling should be done in the calling code. We raise errors that should make it clear to calling code what the problem is; special treatments in case of certain error types can be done by calling code (this is never something that should be done on a per-engine level, but something that should be done on a situational basis).

There are two places where engine writers can change the error handling:

engine.is_valid_snapshot(): Check whether the created snapshot is valid. Generally will just return a bool. If you want to use the default error behavior (raise an EngineError), this is all you need to change.
engine.snapshot_error_handler(): If you want to modify the error in some way, or if there are some errors that you can recover from, do that here. If you want an error to be raised, then return the error. If you can recover from the error and create a valid snapshot, then return the valid snapshot.

For most users, this makes no difference. It means that under the hood we're using a different function. The API break is the drop of engine-level support for error handling, and a small change in the call signature of iter_generate (removing the intervals parameter). I don't think anyone will miss these things.

Code for new iter_generate
Use new iter_generate for all tests
Add support for correcting snapshot
Unit tests for new iter_generate; full coverage
Update docs on writing engines
Remove options for NaN handling from engines
Remove docstrings about NaN handling from engines

There are still some problems with a few tests -- integration with EngineMover seems a bit off. I need to compare to the old behavior to see what has changed.

dwhswenson · 2021-08-02T01:07:56Z

There's a weird thing in the unit tests for the minus mover. I changed some behavior there, but I honestly don't understand why the previous behavior was occurring.

Essentially, the minus move consists of the replica exchange stage followed by an extension stage. There's a test for the case that the minus move fails because the extension hits max length, which uses this code. For some reason, the old test checked that the extension trajectory was of a length longer than the engine's maximum. I don't know why that would be. The test itself also modified the engine's error handling (setting 'on_max_length' to 'stop', not 'fail'), so I expect that was part of it. If anyone else has an idea as to why I did it that way, please let me know.

Relevant code is in tests for both minus mover and single replica minus mover:

openpathsampling/openpathsampling/tests/test_pathmover.py

Lines 1355 to 1358 in c67edf7

    
           assert_equal( 
        
               len(sub[-1][0].trials[0].trajectory), 
        
               len(traj_bad_extension)+self.dyn.n_frames_max-1 
        
           )

openpathsampling/openpathsampling/tests/test_pathmover.py

Lines 1517 to 1522 in c67edf7

    
           # first two work and the extension fails 
        
           # this only happens due to length 
        
           assert_equal( 
        
               len(sub[-1].trials[0].trajectory), 
        
               len(traj_bad_extension)+self.dyn.n_frames_max-1 
        
           )

codecov · 2021-08-02T04:21:01Z

Codecov Report

Merging #1042 (c67edf7) into dev-2.0 (d46e028) will increase coverage by 0.00%.
The diff coverage is 100.00%.

❗ Current head c67edf7 differs from pull request most recent head 1df96c0. Consider uploading reports for the commit 1df96c0 to get more accurate results

@@           Coverage Diff            @@
##           dev-2.0    #1042   +/-   ##
========================================
  Coverage    81.54%   81.54%           
========================================
  Files          140      140           
  Lines        15399    15400    +1     
========================================
+ Hits         12557    12558    +1     
  Misses        2842     2842

Impacted Files	Coverage Δ
openpathsampling/engines/openmm/engine.py	`70.00% <ø> (ø)`
openpathsampling/deprecations.py	`100.00% <100.00%> (ø)`
openpathsampling/netcdfplus/version.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update dd47ec0...1df96c0. Read the comment docs.

dwhswenson · 2021-08-02T04:54:21Z

Not sure why we're getting @codecov updates on a branch where the CI doesn't report coverage... I wouldn't trust the data.

dwhswenson added 3 commits August 1, 2021 01:26

prep work to cleanup iter_generate

0f430fd

add most of steps to use the new iter_generate

d6bbdcc

There are still some problems with a few tests -- integration with EngineMover seems a bit off. I need to compare to the old behavior to see what has changed.

fix up so tests pass

6aef582

dwhswenson added API Break 2.0 issues and PRs for the 2.0 release labels Aug 2, 2021

add utils

759b3e9

dwhswenson mentioned this pull request Aug 2, 2021

Update CI for OPS 2.0 #1043

Merged

add snapshot handling (no error); remove old opts

1df96c0

dwhswenson mentioned this pull request Jan 17, 2022

Trajectories should be deque, not lists (while generating) #1099

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New `iter_generate` for OPS 2.0#1042

New `iter_generate` for OPS 2.0#1042
dwhswenson wants to merge 5 commits intoopenpathsampling:dev-2.0from
dwhswenson:new_iter_generate

dwhswenson commented Aug 2, 2021 •

edited

Loading

Uh oh!

dwhswenson commented Aug 2, 2021

Uh oh!

codecov bot commented Aug 2, 2021 •

edited

Loading

Uh oh!

dwhswenson commented Aug 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dwhswenson commented Aug 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dwhswenson commented Aug 2, 2021

Uh oh!

codecov bot commented Aug 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dwhswenson commented Aug 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dwhswenson commented Aug 2, 2021 •

edited

Loading

codecov bot commented Aug 2, 2021 •

edited

Loading