Di 1112 exomiser ranking fix #11

kjwinfield · 2024-09-02T09:38:22Z

Correct issue where variants in GEL tiering page could be duplicated onto exomiser tiering page.
Prev version shows variants ranked 1, 2, and 3

although 2 and 3 are already in GEL tiering page

New versions shows 1, 5, and 8, which is correct:

This change is

pep8speaks · 2024-09-02T09:38:28Z

Hello @kjwinfield! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file resources/home/dnanexus/make_workbook.py:

Line 802:47: W605 invalid escape sequence '_'

Comment last updated at 2024-09-16 14:53:21 UTC

Addy81

Reviewed 1 of 6 files at r1.
Reviewable status: 1 of 6 files reviewed, 3 unresolved discussions (waiting on @kjwinfield)

resources/home/dnanexus/get_variant_info.py line 349 at r1 (raw file):

        top = []

        ordered_list = dict(sorted(ranked.items(), key=lambda item: item[1]))

this is not a list if you are casting it back to a dict?

Could you also add some comments in this function. I'm struggling to visualise what the input dict is. Could you add an example in your docstring and some comments to explain your logic?

resources/home/dnanexus/make_workbook.py line 801 at r1 (raw file):

            # columns ending _x)
            d = {}
            for col in self.var_df.columns:

I think you can skip some of this by setting suffices in your colum so it only renames one then you can drop the other if you don't want it:

resources/home/dnanexus/make_workbook.py line 816 at r1 (raw file):

            ranks = ex_df['Priority'].to_dict()
            for k,v in ranks.items():
                ranks[k] = int(v.split(' ')[-1].split('.')[0])

why is this needed? is this {index: int(exomiser rank)} the ranks dict when first created or the intended dict?

kjwinfield

Reviewable status: 1 of 6 files reviewed, 3 unresolved discussions (waiting on @Addy81)

resources/home/dnanexus/get_variant_info.py line 349 at r1 (raw file):

Previously, Addy81 (Adriana) wrote…

this is not a list if you are casting it back to a dict?

Could you also add some comments in this function. I'm struggling to visualise what the input dict is. Could you add an example in your docstring and some comments to explain your logic?

Done.

resources/home/dnanexus/make_workbook.py line 801 at r1 (raw file):

Previously, Addy81 (Adriana) wrote…

I think you can skip some of this by setting suffices in your colum so it only renames one then you can drop the other if you don't want it:

Done.

resources/home/dnanexus/make_workbook.py line 816 at r1 (raw file):

Previously, Addy81 (Adriana) wrote…

why is this needed? is this {index: int(exomiser rank)} the ranks dict when first created or the intended dict?

Done. hopefully this explains it better. the current df has "Exomiser Rank 1.0" when we just want the integer to pass to get_top_3_ranked which expects a dict of {index in df: integer denoting rank}

Addy81

Reviewed 1 of 6 files at r1.
Reviewable status: 2 of 6 files reviewed, 4 unresolved discussions (waiting on @kjwinfield)

resources/home/dnanexus/get_variant_info.py line 389 at r3 (raw file):

                else:
                    break
        return top

if not returning top, the docstring feels now to not match the function?

if you don't need gold silver and bronze anymore, could this be done with a df and a sort/select values out instead of all the if statements?

Addy81

Reviewable status: 2 of 6 files reviewed, 4 unresolved discussions (waiting on @kjwinfield)

resources/home/dnanexus/get_variant_info.py line 389 at r3 (raw file):

Previously, Addy81 (Adriana) wrote…

if not returning top, the docstring feels now to not match the function?

if you don't need gold silver and bronze anymore, could this be done with a df and a sort/select values out instead of all the if statements?

Sorry that was meant to say if only returning top

kjwinfield

Reviewable status: 2 of 6 files reviewed, 4 unresolved discussions (waiting on @Addy81)

resources/home/dnanexus/get_variant_info.py line 389 at r3 (raw file):

Previously, Addy81 (Adriana) wrote…

Sorry that was meant to say if only returning top

Done

jethror1

Reviewed 2 of 6 files at r1, all commit messages.
Reviewable status: 3 of 6 files reviewed, 5 unresolved discussions (waiting on @Addy81 and @kjwinfield)

resources/home/dnanexus/get_variant_info.py line 348 at r4 (raw file):

        df = df.copy()
        # First change "Exomiser Rank #" string to int
        df['Priority'] = df['Priority'].map(

I would generally avoid modifying your underlying data and then trying to return its state, and just create a new column that you can drop at the end, something like:

df['priority_as_int'] = df['Priority'].map(
       lambda x: int(x.split(' ')[-1].split('.')[0])
    )

...

df.drop(['priority_as_int'], axis=1, inplace=True)

Code quote:

        # First change "Exomiser Rank #" string to int
        df['Priority'] = df['Priority'].map(

jethror1

Reviewed 3 of 3 files at r5, all commit messages.
Reviewable status: all files reviewed, 5 unresolved discussions (waiting on @Addy81 and @kjwinfield)

kjwinfield added 4 commits August 30, 2024 15:44

fix exomiser duplicate variants

ee4d809

change python version

eab4be7

correct rquirements.txt

b6f53ed

remove prev version

52d5976

Addy81 requested changes Sep 2, 2024

View reviewed changes

feedback changes

84a126b

kjwinfield commented Sep 2, 2024

View reviewed changes

edit docstring

47cf985

kjwinfield requested a review from Addy81 September 3, 2024 08:09

Addy81 requested changes Sep 12, 2024

View reviewed changes

kjwinfield added 2 commits September 16, 2024 12:22

change to use df method

b76e613

fix broken test

f2f907a

kjwinfield commented Sep 16, 2024

View reviewed changes

kjwinfield requested a review from Addy81 September 16, 2024 12:32

jethror1 requested changes Sep 16, 2024

View reviewed changes

kjwinfield added 2 commits September 16, 2024 15:51

get rid of df copy and use new column instead

c36191c

remove unused import

7911d7c

kjwinfield requested a review from jethror1 September 16, 2024 14:55

jethror1 approved these changes Sep 16, 2024

View reviewed changes

jethror1 merged commit 2d23676 into main Sep 16, 2024
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Di 1112 exomiser ranking fix #11

Di 1112 exomiser ranking fix #11

kjwinfield commented Sep 2, 2024 •

edited by woook

Loading

pep8speaks commented Sep 2, 2024 •

edited

Loading

Addy81 left a comment

kjwinfield left a comment

Addy81 left a comment

Addy81 left a comment

kjwinfield left a comment

jethror1 left a comment

jethror1 left a comment

Di 1112 exomiser ranking fix #11

Di 1112 exomiser ranking fix #11

Conversation

kjwinfield commented Sep 2, 2024 • edited by woook Loading

pep8speaks commented Sep 2, 2024 • edited Loading

Comment last updated at 2024-09-16 14:53:21 UTC

Addy81 left a comment

Choose a reason for hiding this comment

kjwinfield left a comment

Choose a reason for hiding this comment

Addy81 left a comment

Choose a reason for hiding this comment

Addy81 left a comment

Choose a reason for hiding this comment

kjwinfield left a comment

Choose a reason for hiding this comment

jethror1 left a comment

Choose a reason for hiding this comment

jethror1 left a comment

Choose a reason for hiding this comment

kjwinfield commented Sep 2, 2024 •

edited by woook

Loading

pep8speaks commented Sep 2, 2024 •

edited

Loading