feat: use bhepop2 package for income assignment #243

leo-desbureaux-tellae · 2024-06-18T08:12:52Z

Introduction of the Bhepop2 package for income assignment

data.income.municipality

The distributions DataFrame returned by data.income.municipality is now tagged by attribute and modality
Attributes describes a property present on the the population agents. A modality is a value taken by this attribute.
In Eqasim, we use two attributes:

"size", with modalities in ["1_pers", "2_pers", "3_pers", "4_pers", "5_pers_or_more"]
"family_comp", with modalities in ["Single_man", "Single_wom", "Couple_without_child", "Couple_with_child", "Single_parent", "complex_hh"]

New columns of the returned DataFrame are ["commune_id", "q1", "q2", "q3", "q4", "q5", "q6", "q7", "q8", "q9", "attribute", "modality", "is_imputed", "is_missing", "reference_median"]
Global distributions (those that were returned in the previous version of municipality.py) are tagged with attribute and modality "all".

synthesis.population.income

New config option "income_assignation_method" (should it be "income_assignment_method" ?). This config allows choosing the method used to assign an income to population agents.
The former method is called via the config "uniform" (what should be the default config ?).
A new assignation method has been added, called via the config "bhepop2". This method uses the Bhepop2 package to match per attribute distributions instead of just matching the global one.

analysis.methods.income.compare_methods

A new analysis module has been added to compare the assignation methods. It can be run using the path analysis.methods.income.compare_methods.
This module generates plots comparing income distributions of each assignation method and the source distribution (here, Filosofi data).
This comparison is done per attribute. For instance, we compare the income distribution of individuals with attribute "family_comp" equal to "Single_parent" for the two methods, and see what method matches best the source distribution.

Another output of this module is a table measuring the distance of each method to the source distribution, here again per attribute. This allows a more measurable comparison between assignation methods.

describe attribute modalities when giving attribute selection

municipality stage now returns a DataFrame containing income deciles per attributes in addition to the usual global deciles. Two columns "attribute" and "modality" have been added to specify the related attribute and the value it takes (modality). Attribute and modality for global deciles are "all". Filter on "all" attribute and modality have been added where data.income.municipality were used.

merge municipality_attributes.py into municipality.py move compare_methods.py to analysis/methods/income/ created a utils.py module in synthesis/population/income/ to store common functions added test dataset and tests --------- Co-authored-by: leo-desbureaux-tellae <[email protected]>

sebhoerl · 2024-06-20T06:41:48Z

Hi! Thanks a lot, I'm at hEART this week and a bit busy the week after, but I'll look at it asap!

sebhoerl

Thanks, I made a couple of minor comments, but this looks very good :)

docs/population.md

documentation/info/collect.py

synthesis/population/income/uniform.py

synthesis/population/spatial/primary/candidates.py

sebhoerl · 2024-07-04T06:17:14Z

Thanks a lot, LGTM

leo-desbureaux-tellae · 2024-07-14T11:41:27Z

Issues from your first review are fixed ! Let me know if there is something more that needs changes.

leo-desbureaux-tellae · 2024-07-14T11:43:06Z

Resolved conflict in changelog

sebhoerl · 2024-07-15T06:41:42Z

Thanks, looks all good, you can merge!

Nitnelav · 2024-07-15T07:22:34Z

Perfect, thank you very much @leo-desbureaux-tellae, thanks @sebhoerl for the review !! ⛵ 🚀

leo-desbureaux-tellae and others added 23 commits June 17, 2024 10:56

feat: use bhepop2 for eqasim income assignment

7624cd4

todo: manage zip

cd85ecc

corrections from rebase (Filosofi is now read from .zip file)

20a3262

restore default config file

b39b836

add missing configuration of income_com_path

3ac39a7

add missing request of income_com_xlsx config

f6a7fcd

remove columns filters and rename from _read_filosofi_excel

ccf45dc

cleanup

b580c58

convert income to integer

622ecfc

remove pandas warning

f53a6cf

improve error catches and code layout

8a40665

include population size in plot title

e568839

remove MODALITIES constant

0dbc129

describe attribute modalities when giving attribute selection

wip use bhepop2

bd758a8

move income_uniform_sample and MAXIMUM_INCOME_FACTOR to a utils module

becece3

change MAXIMUM_INCOME_FACTOR back to original value

49c259a

renamed "eqasim method" into "uniform method"

e521cbc

move compare_methods.py to analysis/methods/income/

8dd39f5

renamed bhepop2_income.py module into bhepop2.py

62bb2ab

fix: conflicts

c4011b0

Merge branch 'develop' into feat/use_bhepop2

efef3c5

Nitnelav requested a review from sebhoerl June 18, 2024 08:48

Nitnelav and others added 2 commits June 18, 2024 14:55

try to make test_determinism work (#6)

4476d51

add income_assignation_method config documentation

9ca5acd

sebhoerl requested changes Jul 2, 2024

View reviewed changes

leo-desbureaux-tellae added 2 commits July 3, 2024 12:43

remove blank lines

b85ca29

remove debug print

88d4655

leo-desbureaux-tellae added 2 commits July 3, 2024 12:44

remove float casting

cb4581b

update docs

dd653b2

sebhoerl approved these changes Jul 4, 2024

View reviewed changes

renamed "modality" column in "value"

8c4cc30

Merge branch 'develop' into feat/use_bhepop2

173adc8

Nitnelav merged commit f74bd98 into eqasim-org:develop Jul 15, 2024
2 checks passed

leo-desbureaux-tellae deleted the feat/use_bhepop2 branch July 19, 2024 08:53

sebhoerl mentioned this pull request Jan 6, 2025

chore(develop): release 1.3.0 #294

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use bhepop2 package for income assignment #243

feat: use bhepop2 package for income assignment #243

leo-desbureaux-tellae commented Jun 18, 2024 •

edited

Loading

sebhoerl commented Jun 20, 2024

sebhoerl left a comment

sebhoerl commented Jul 4, 2024

leo-desbureaux-tellae commented Jul 14, 2024

leo-desbureaux-tellae commented Jul 14, 2024

sebhoerl commented Jul 15, 2024

Nitnelav commented Jul 15, 2024

feat: use bhepop2 package for income assignment #243

feat: use bhepop2 package for income assignment #243

Conversation

leo-desbureaux-tellae commented Jun 18, 2024 • edited Loading

Introduction of the Bhepop2 package for income assignment

data.income.municipality

synthesis.population.income

analysis.methods.income.compare_methods

sebhoerl commented Jun 20, 2024

sebhoerl left a comment

Choose a reason for hiding this comment

sebhoerl commented Jul 4, 2024

leo-desbureaux-tellae commented Jul 14, 2024

leo-desbureaux-tellae commented Jul 14, 2024

sebhoerl commented Jul 15, 2024

Nitnelav commented Jul 15, 2024

leo-desbureaux-tellae commented Jun 18, 2024 •

edited

Loading