add new arguments #96

cdonnay · 2023-09-07T23:20:32Z

Add a rank_cols argument to load_csv so that the user can specify which columns of the csv correspond to rankings. Helpful, for example, with the 2013 Minneapolis data file on MGGG/chicago which has extraneous columns.

Also removed quota parameter from Election.get_threshold method since quota should be an attribute of the Election.

Also, this is my first pull request; Moon has asked me to tag y'all going forward. Let me know if you don't wish to be tagged or if I made an error in my request. Thanks!
@jamesturk @drdeford @jgibson517 @ziglaser @jennjwang

Add a rank_cols argument to load_csv so that the user can specify which columns of the csv correspond to rankings. Helpful, for example, with the 2013 Minneapolis data file on MGGG/chicago which has extraneous columns. Also removed quota parameter from Election.get_threshold method since quota should be an attribute of the Election.

jamesturk · 2023-09-07T23:28:44Z

Hi @cdonnay,

A couple of notes:

You'll want to add .idea to .gitignore, and remove the .idea files, those are only relevant to you/PyCharm and shouldn't be part of this repo.
You're adding a required parameter, but it should probably be optional since it was already working without that. This is also why the tests are failing. (A new test should probably be added to accept this optional parameter.)

I'll let someone else comment on the self.quota issue as I haven't kept up with the API in that level of detail.

@jamesturk

Thanks, @jamesturk ! I added .idea to .gitignore, and gave the `rank_cols` a default argument. I think it's worth discussing whether or not you actually want the `rank_cols` parameter to have a default argument. My argument for not including a default is that we don't want to force preprocessing of the csv file onto the user, and we want the user to think intentionally about loading in their data. Happy to hear other ideas about why we might want a default.

Default needs to be a list since that is the declared data type.

Fixing a line that was too long for test.

Use .lower() method to allow users to input quotas.

cdonnay · 2023-09-08T17:46:13Z

Thanks, @jamesturk ! I added .idea to .gitignore, and gave the rank_cols a default argument.

I think it's worth discussing whether or not you actually want the rank_cols parameter to have a default argument. My argument for not including a default is that we don't want to force preprocessing of the csv file onto the user, and we want the user to think intentionally about loading in their data. Happy to hear other ideas about why we might want a default.

jennjwang · 2023-09-08T18:01:29Z

Hi @cdonnay thanks for helping with Votekit! I agree with James on making rank_cols an optional argument - I think the user should have an idea of what their data looks like before loading it, but it would be annoying for them to specify the rank_cols every time if their csv file is already clean without extraneous columns. A default would save them this trouble.

cdonnay · 2023-09-13T16:08:25Z

Awesome, sounds good to me.

jgibson517 · 2023-09-13T20:38:28Z

src/votekit/elections/election_types.py

@@ -61,20 +61,18 @@ def __init__(
        self.transfer = transfer
        self.seats = seats
        self.tiebreak = tiebreak
-        self.threshold = self.get_threshold(quota)
+        self.quota = quota.lower()


👍 agree with this change

jgibson517 · 2023-09-13T20:39:39Z

src/votekit/cvr_loaders.py

@@ -55,6 +59,12 @@ def load_csv(
    if id_col is not None and not df.iloc[:, id_col].is_unique:
        raise DataError(f"Duplicate value(s) in column at index {id_col}")

+    if rank_cols:


jgibson517 · 2023-09-13T20:43:46Z

.gitignore

@@ -5,4 +5,4 @@ htmlcov/
 .DS_Store
 dist/
 .ipynb_checkpoints
-
+.idea


you may need to delete the .idea and inspectionProfiles files from the version of your repo on github, but then the .gitignore should handle them in the future!

Okay, I think I just deleted those files.

cdonnay added 4 commits September 8, 2023 10:29

fix None type issue

460e868

Default needs to be a list since that is the declared data type.

line too long

6a4e507

Fixing a line that was too long for test.

add lower to STV

de2b825

Use .lower() method to allow users to input quotas.

cdonnay mentioned this pull request Sep 13, 2023

add tutorial notebook #98

Merged

jgibson517 reviewed Sep 13, 2023

View reviewed changes

jgibson517 merged commit 2cd0800 into mggg:main Sep 21, 2023

cdonnay deleted the read_csv branch October 4, 2023 12:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add new arguments #96

add new arguments #96

cdonnay commented Sep 7, 2023 •

edited

Loading

jamesturk commented Sep 7, 2023

cdonnay commented Sep 8, 2023

jennjwang commented Sep 8, 2023

cdonnay commented Sep 13, 2023

jgibson517 Sep 13, 2023

jgibson517 Sep 13, 2023

jgibson517 Sep 13, 2023

cdonnay Sep 13, 2023

add new arguments #96

add new arguments #96

Conversation

cdonnay commented Sep 7, 2023 • edited Loading

jamesturk commented Sep 7, 2023

cdonnay commented Sep 8, 2023

jennjwang commented Sep 8, 2023

cdonnay commented Sep 13, 2023

jgibson517 Sep 13, 2023

Choose a reason for hiding this comment

jgibson517 Sep 13, 2023

Choose a reason for hiding this comment

jgibson517 Sep 13, 2023

Choose a reason for hiding this comment

cdonnay Sep 13, 2023

Choose a reason for hiding this comment

cdonnay commented Sep 7, 2023 •

edited

Loading