Improve subprocess CRDS calls for easier debugging #140

mcara · 2020-06-23T18:51:28Z

Recently there were a couple of helpdesk issues (INC0155647 and INC0156501) when users reported errors with running CRDS commands using subprocess. One difficulty in understanding the cause of the errors was that suprocess in some notebooks is not set up to output errors. This PR modifies calls to subprocess.check_output()so that messages from CRDS can be read in the notebook.

pllim · 2020-06-23T19:18:23Z

notebooks/DrizzlePac/drizzle_wfpc2/drizzle_wfpc2.ipynb

-    "subprocess.check_output('crds bestrefs --files ua0605*_c0m.fits --sync-references=1 --update-bestrefs', shell=True, stderr=subprocess.DEVNULL)\n",
+    "stdout = subprocess.check_output(\n",
+    "    'crds bestrefs --files ua0605*_c0m.fits --sync-references=1 --update-bestrefs',\n",
+    "    shell=True,\n",


Is there a way to do this without shell=True?

Isn't CRDS written in Python? Can't we call its Python API directly?

CC: CRDS experts @eslavich @jaytmiller

@pllim May I ask: what is the issue with shell=True?

https://docs.python.org/3/library/subprocess.html#security-considerations

Just out of academic curiosity, is there benefit in sorting the glob results first in this usage?

I do not see why would it matter here.

Well, let's say if the bestref is usually named in a way that would appear last in listing, and the function do a simple linear search, then sorted(..., reverse=True) would make the search faster? Anyway, I have no idea how it works, so I am just thinking aloud. And of course, it is probably out of topic as far as this PR is concerned.

The order shouldn't matter -- the glob result is a list of datasets, and we have to make the best references determination for each of them.

Incidentally, I don't think shell=True is a security concern in this case. We wouldn't want to use it in web server code (for example) that interpolates user input into a command, but here the user is already in an environment where they can execute arbitrary shell commands so it's no more unsafe than a notebook in general.

It is not as much as a safety in this case, as to promote best practices. What if someone copy-paste what is here into server code? 😉

And given that it is possible to call Python API directly, there is really no reason to have shell=True at all in the first place.

p.s. Yeah, just ignore my comment about sorting. Sorry for the noise!

bhilbert4 · 2020-06-23T20:16:18Z

I don't know if it is relevant or useful here or not, but here's an example of using the CRDS API: https://github.com/spacetelescope/mirage/blob/master/mirage/reference_files/crds_tools.py#L141

mcara · 2020-06-23T20:18:32Z

I would say that I just want to improve one aspect of existing notebooks to address a specific issue. If there is desire to re-design the notebooks, that could be done too as a separate PR.

Improve subprocess CRDS calls for easier debugging

d97ab85

mcara requested a review from eteq as a code owner June 23, 2020 18:51

pllim reviewed Jun 23, 2020

View reviewed changes

mcara added 2 commits June 23, 2020 16:00

Shell=False

ba41308

shell=False by default

c1e88de

use crds API

f5c4a70

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve subprocess CRDS calls for easier debugging #140

Improve subprocess CRDS calls for easier debugging #140

mcara commented Jun 23, 2020

pllim Jun 23, 2020

pllim Jun 23, 2020

mcara Jun 23, 2020 •

edited

Loading

mcara Jun 23, 2020

pllim Jun 23, 2020

mcara Jun 23, 2020 •

edited

Loading

pllim Jun 23, 2020

eslavich Jun 23, 2020

pllim Jun 23, 2020

pllim Jun 23, 2020

bhilbert4 commented Jun 23, 2020

mcara commented Jun 23, 2020

Improve subprocess CRDS calls for easier debugging #140

Are you sure you want to change the base?

Improve subprocess CRDS calls for easier debugging #140

Conversation

mcara commented Jun 23, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mcara Jun 23, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mcara Jun 23, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bhilbert4 commented Jun 23, 2020

mcara commented Jun 23, 2020

mcara Jun 23, 2020 •

edited

Loading

mcara Jun 23, 2020 •

edited

Loading