Minor fixes for handling of older SMA data #1399

kartographer · 2024-02-11T18:38:51Z

Fixes a few issues that became apparent after #1371 was merged.

Description

There are a couple of changes that have been made here:

Some fixtures in test_mir_parser that were redundant have been removed.
Reading in of MIR data has been reordered so that data-like attributes are read in last (which is helpful in testing/debugging)
A bug has been fixed in MirParser._make_v3_compliant where MJDs were not calculated correctly (forgot to convert UTC back to TT, which is the MIR standard).
Changed the way that Mir determines whether or not a data set should be flex-pol.
When reading in MIR datasets, the corresponding value for UVData.lst_array is now filled by set_lsts_from_time_array under most circumstances.

Motivation and Context

The change in flex-pol handling primarily affects reading in of older, pre-V3 MIR-formatted SMA data. Specifically, "gunnLO" was not consistently filled/used in pre-V3 data, whereas "fsky" has historically always been present, hence switching from the former to the latter.

The change in LST-handling arises from nuisance warnings about errors in the LSTs, which after #1356 have reduced somewhat in frequency for MIR data sets, but still regularly arise. I believe this is due to how these values are presently calculated, as a polled average rather than directly calculated at the integration mid-point, producing errors of up to 25 ms. Given that this is a long-standing feature, the reader now defaults to using pyuvdata-calculated LST values so long as they agree w/ the recorded values to within this precision limit. This has the advantage of providing "truer-to-observed" values while reducing the erroneous warnings. When the recorded values are not within the precision limit, they are plugged into lst_array so that a warning is appropriately raised (and the issue can be handled by the user accordingly).

Types of changes

Bug fix (non-breaking change which fixes an issue)

Checklist:

I have read the contribution guide.
My code follows the code style of this project.

Bug fix checklist:

My fix includes a new test that breaks as a result of the bug (if possible).
All new and existing tests pass.
I have updated the CHANGELOG.

…ad operation

codecov · 2024-02-11T18:40:14Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (ea458fb) 99.92% compared to head (f7db089) 99.92%.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1399   +/-   ##
=======================================
  Coverage   99.92%   99.92%           
=======================================
  Files          37       37           
  Lines       20763    20766    +3     
=======================================
+ Hits        20747    20750    +3     
  Misses         16       16

Files	Coverage Δ
pyuvdata/uvdata/mir.py	`100.00% <100.00%> (ø)`
pyuvdata/uvdata/mir_parser.py	`100.00% <100.00%> (ø)`

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ea458fb...f7db089. Read the comment docs.

e-koch

@kartographer -- It wasn't clear to me from the warning messages how the lst offset warnings and what to check are getting communicated back to the user.

e-koch · 2024-02-12T20:21:12Z

pyuvdata/uvdata/mir.py

+            ):
+                if not np.allclose(val, mir_data.sp_data[item][data_mask]):
+                    warnings.warn(
+                        "Discrepancy in %s for win %i sb %i pol %i." % (item, *spdx)


Suggest adding some guidance in the warning message on whether this is a small discrepancy vs. something critical with the data.

Or would that be caught in the checks elsewhere?

I extended the error message here in aa4bd2f -- the one time I saw this pop up (which used to throw an AssertionError) it was because a single record was bad, but it can also get triggered if there's a more significant issue to be checked. In the end I figured probably best to just point the user to what values should be looked at...

e-koch · 2024-02-12T20:22:10Z

pyuvdata/uvdata/mir.py

+        if not np.allclose(lst_array, self.lst_array, rtol=0, atol=np.pi / 1728000.0):
+            # If this check fails, it means that there's something off w/ the lst values
+            # (to a larger degree than expected), and we'll pass them back to the user,
+            # who can inspect them directly and decide what to do.


This should get printed somewhere with suggestions on what to check.

Okay, I've added a warning message here in aa4bd2f -- it's a little on the long side, although it hopefully gives enough guidance to the user on what to do and when to worry (it looks like the polling rate wasn't always 20 Hz, so this still triggers for some older data sets, but still in most cases it's just a problem with the metadata, not actually the visibilities).

kartographer added 4 commits February 10, 2024 20:31

Improving robustness of pol-split detection in MIR datasets

221aa91

removing debugging print statements

a58c2f0

More minor code clean-up, moving vis read-in to the end of the Mir re…

9d5a352

…ad operation

Updating CHANGELOG

943b2b9

kartographer requested a review from e-koch February 11, 2024 18:41

bhazelton added UVData SMA Issues related to handling of SMA data labels Feb 12, 2024

e-koch requested changes Feb 12, 2024

View reviewed changes

kartographer added 2 commits February 12, 2024 23:36

Making requested changes following review

aa4bd2f

Updating test to filter new warning

f7db089

kartographer requested a review from e-koch February 13, 2024 12:08

e-koch approved these changes Feb 13, 2024

View reviewed changes

kartographer merged commit 7b5333f into main Feb 13, 2024
51 of 53 checks passed

kartographer deleted the sma_dev branch February 13, 2024 13:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor fixes for handling of older SMA data #1399

Minor fixes for handling of older SMA data #1399

kartographer commented Feb 11, 2024 •

edited

Loading

codecov bot commented Feb 11, 2024 •

edited

Loading

e-koch left a comment

e-koch Feb 12, 2024

kartographer Feb 13, 2024

e-koch Feb 12, 2024

kartographer Feb 13, 2024

Minor fixes for handling of older SMA data #1399

Minor fixes for handling of older SMA data #1399

Conversation

kartographer commented Feb 11, 2024 • edited Loading

Description

Motivation and Context

Types of changes

Checklist:

codecov bot commented Feb 11, 2024 • edited Loading

Codecov Report

e-koch left a comment

Choose a reason for hiding this comment

e-koch Feb 12, 2024

Choose a reason for hiding this comment

kartographer Feb 13, 2024

Choose a reason for hiding this comment

e-koch Feb 12, 2024

Choose a reason for hiding this comment

kartographer Feb 13, 2024

Choose a reason for hiding this comment

kartographer commented Feb 11, 2024 •

edited

Loading

codecov bot commented Feb 11, 2024 •

edited

Loading