Changes ids_get() Default Parameters for counterparts and start_date to Reduce Data Volume and Align with User Needs #50 #57

t-emery · 2024-12-06T20:40:41Z

Changed default parameters for start_date and counterparts for ids_get().

Improved function documentation & examples, partially addressing Enhancement: Add documentation to clarify country coverage of IDS + add related informative error handling #47.
changed date behavior. Initially despite changing start_date to 2000, it would still provide data from 1970 if end_date was empty (it worked as expected when end_date was specified). I fixed this, and created filtering logic so that debt service data with projections through 2031 would show up, but for data without projections it filtered out NA values after 2023 (last observed data)
Fixing this required using hard-coded dates. These can be updated every year (Perhaps we can document the "yearly data update checklist in the wiki? Another option would be to add a ids_list_years() and create some logic that pulled the years from there automatically. I figured we have more pressing issues, but might want to address that later.

This is my first pull request! Any feedback welcome.

This reverts commit aa48566. on suggestion of Christopher Smith, who is trying to help me fix my Git mistakes Reverting on suggestion of Christopher Smith who is helping me navigate my GitMistakes

- Changed example code for ids_bulk() to use `\dontrun` instead of `\dontest`. because of CMD Check error related to #51. - Updated function title and parameter descriptions for clarity. - Set new default values for `counterparts` and `start_date`. - changed date logic so it would work with new default start date (previously it would give from 1970 unless a end_date was also specified). - Added filtering logic to remove rows with NA values beyond the latest year of observed data, (2023) while allowing projections (2031). - Enhanced tests to validate new defaults and filtering behavior.

chriscarrollsmith · 2024-12-06T23:57:30Z

R/ids_get.R

+#'   Common options:
+#'   * "WLD" - World total (aggregated across all creditors)
+#'   * "all" - Retrieve data broken down by all creditors
+#'   * Individual creditors use numeric codes (e.g., "730" for China)


Important to be careful about the terms "numeric" and "text" since these have a special meaning in R. (I.e., are we supposed to provide 730 as a numeric variable but "907" as a string?)

Yes, we have to provide identifiers as string (I guess because there can also be leading 0s?). I also believe that we should just write about text codes even if they are numbers.

chriscarrollsmith · 2024-12-07T00:01:39Z

R/ids_get.R

+#'   data volume. For historical analysis, explicitly set to 1970.
+#'
+#' @param end_date A numeric value representing the ending year (default: NULL).
+#'   Must be >= 1970 and cannot be earlier than start_date. If NULL, returns


We actually could remove this constraint, replace any sub-1970 year with 1970, and just raise a warning flag to let the user know we did so.

chriscarrollsmith · 2024-12-07T00:06:41Z

R/ids_get.R

+#'     bondholders)
+#'   Cannot contain NA values.
+#'
+#' @param start_date A numeric value representing the starting year (default:


I do think that if you set the default start date to 2000, you risk misleading casual users into thinking no earlier data is available.

chriscarrollsmith · 2024-12-07T00:30:22Z

R/ids_get.R

  } else {
    "all"
  }
 }
+
+filter_post_actual_na <- function(data) {


I take it you're trying to drop NAs outside the coverage period, but preserve NAs within the coverage period? To avoid hardcoding and continually updating the start and end years, I would suggest:

Get the first and last years in the retrieved dataset with a non-NA value, and treat these as the boundaries of the data, then

Drop all NAs outside those boundaries.

…se_headers

chriscarrollsmith · 2024-12-07T01:00:57Z

I pushed a fix for a typo and a missing mock that were breaking some of the tests. Approved the PR but also added some comments above on the ids_get documentation that you should take a look at before merging. If you'd like me to tackle any additional adjustments, let me know.

codecov · 2024-12-07T01:04:46Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.31%. Comparing base (b6f1b9e) to head (6f86aa4).

Additional details and impacted files

@@             Coverage Diff             @@
##              main      #57      +/-   ##
===========================================
- Coverage   100.00%   96.31%   -3.69%     
===========================================
  Files           10       10              
  Lines          235      244       +9     
===========================================
  Hits           235      235              
- Misses           0        9       +9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

chriscarrollsmith · 2024-12-07T13:50:02Z

Closes #50

christophscheuch

Looks good mostly! There is a test coverage gap now according to the codecov report: the multi-page request is not fired during testing because we don't pull enough data through ids_get() - need to include at least one test / example where we download a lot of data from the API.

christophscheuch · 2024-12-07T14:12:15Z

R/ids_get.R

-#'  or `FALSE`.
+#' @param geographies A character vector of geography identifiers representing
+#'   debtor countries and aggregates. Must use `geography_id` from
+#'   `ids_list_geographies()`:


If you use \link{ids_list_geographies} instead of ids_list_geographies(), then you get a hyperlink to the function in the docs and I believe this is desired :)

christophscheuch · 2024-12-07T14:13:52Z

R/ids_get.R

+#'   Common options:
+#'   * "WLD" - World total (aggregated across all creditors)
+#'   * "all" - Retrieve data broken down by all creditors
+#'   * Individual creditors use numeric codes (e.g., "730" for China)


Yes, we have to provide identifiers as string (I guess because there can also be leading 0s?). I also believe that we should just write about text codes even if they are numbers.

t-emery added 2 commits December 6, 2024 13:17

Revert "Revert "Update ids_get() defaults + documentation""

0d11db3

This reverts commit aa48566. on suggestion of Christopher Smith, who is trying to help me fix my Git mistakes Reverting on suggestion of Christopher Smith who is helping me navigate my GitMistakes

t-emery requested review from chriscarrollsmith and christophscheuch December 6, 2024 20:40

t-emery added documentation Improvements or additions to documentation enhancement New feature or request labels Dec 6, 2024

Merge branch 'main' into 50-enhancement-change-ids_get-default-pa

2f34b0e

chriscarrollsmith reviewed Dec 6, 2024

View reviewed changes

chriscarrollsmith reviewed Dec 7, 2024

View reviewed changes

Corrected typo in variable name and added missing mock for get_respon…

6f86aa4

…se_headers

chriscarrollsmith approved these changes Dec 7, 2024

View reviewed changes

chriscarrollsmith linked an issue Dec 7, 2024 that may be closed by this pull request

Enhancement: Change ids_get() Default Parameters for counterparts and start_date to Reduce Data Volume and Align with User Needs #50

Open

christophscheuch requested changes Dec 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes ids_get() Default Parameters for counterparts and start_date to Reduce Data Volume and Align with User Needs #50 #57

Changes ids_get() Default Parameters for counterparts and start_date to Reduce Data Volume and Align with User Needs #50 #57

t-emery commented Dec 6, 2024

chriscarrollsmith Dec 6, 2024

christophscheuch Dec 7, 2024

chriscarrollsmith Dec 7, 2024

chriscarrollsmith Dec 7, 2024

chriscarrollsmith Dec 7, 2024

chriscarrollsmith commented Dec 7, 2024

codecov bot commented Dec 7, 2024

chriscarrollsmith commented Dec 7, 2024

christophscheuch left a comment

christophscheuch Dec 7, 2024

christophscheuch Dec 7, 2024

Changes ids_get() Default Parameters for counterparts and start_date to Reduce Data Volume and Align with User Needs #50 #57

Are you sure you want to change the base?

Changes ids_get() Default Parameters for counterparts and start_date to Reduce Data Volume and Align with User Needs #50 #57

Conversation

t-emery commented Dec 6, 2024

This is my first pull request! Any feedback welcome.

chriscarrollsmith Dec 6, 2024

Choose a reason for hiding this comment

christophscheuch Dec 7, 2024

Choose a reason for hiding this comment

chriscarrollsmith Dec 7, 2024

Choose a reason for hiding this comment

chriscarrollsmith Dec 7, 2024

Choose a reason for hiding this comment

chriscarrollsmith Dec 7, 2024

Choose a reason for hiding this comment

chriscarrollsmith commented Dec 7, 2024

codecov bot commented Dec 7, 2024

Codecov Report

chriscarrollsmith commented Dec 7, 2024

christophscheuch left a comment

Choose a reason for hiding this comment

christophscheuch Dec 7, 2024

Choose a reason for hiding this comment

christophscheuch Dec 7, 2024

Choose a reason for hiding this comment