Stochasticity #349

daviesje · 2023-12-04T13:37:32Z

DRAFT PR FOR v4

This update focuses around two things:

The new sampler for sub-grid halo sources, meant to include stochasticity below the grid scale
reorganisation of the C code to be more modular, making things easier for future development

The state of the halo model currently:

We can produce halo populations which conform to a given conditional mass function, and which remain self-consistent across cosmic time. There is a drift from the given CMF at very high redshifts (~10% z=20) which is worse for small timesteps. work to improve the accuracy and speed of this method is ongoing.
There is an increased memory requirement. While halo catalogues down to 10^8 are of order ~2GB, feedback requires that halo mass catalogues are generated backward in time, while the stellar component is generated forward in time. This can be fixed with a proper purging routine implemented for the HaloField structs (in progress).
Halos larger than the cell size are generated with DexM, and there are some issues there. I find that trying to generate these catalogues on the low-density grid produces a factor of ~10 too few halos, and generating with Press-Schechter mass function results in a factor of ~3 too many. Only Sheth-Tormen on the highres grid is consistent, I am working to improve both the threading of this calculation and flexibility with resolution and HMF (while we expect the model to perform worse on low resolution I believe there are also some bugs here).
The only available CMF currently is EPS with rescaling to fit the collapsed fraction of another MF. I plan to include fits for a Sheth-Torment CMF, as well as rescaling options which have been used in the literature to create a CMF (both rescalings of the EPS CMF i.e tree augmentation methods, and rescalings of UMF to a CMF i.e analytic models).
I have placed test outputs in a single function at the end of Stochasticity.c which obviously needs to be converted to proper tests before the merge

State of the C modularisation

The spin temperature code has been the main focus of modularisation. there is still work to be done there but I believe it is much cleaner than before.
I have created a new file, interpolation.c, to hold interpolation routines and table functions. This will allow us to combine a few tables which currently hold the same values but have different generation functions. I have moved most of the Ts.c tables here and will continue to move tables over as I cleanup the other functions
modularisation of IonisationBox.c has not yet started

Biggest Issues

Going backward in time with the halo model makes the recursive generation very painful, everything runs fine with run_coeval or run_lightcone however calling Ionisation_box at z=6 with HALO_STOCHASTICITY on does not work well, it has been disabled until I find a way to do this properly
There is a new output object, XraySourceBox, due to the requirement of halos at multiple redshifts I have separated the source box generation (filtered SFRD) from the spin temperature calculation. I aim to move the non-halo case here as well, which will allow us to no-longer rely on linear growth when making these boxes, and use annular filtering for more self-consistency. However the implementation is very hacky at the moment, calling _compute() multiple times over a series of inputs, and setting the _computed_in_mem to False. There should be a better way to calculate a box with multiple input redshifts (In my opinion if we want to move some functions to Jax this is probably a good first candidate).
I have implemented minihalos in a way which is similar to the default case, where there are two source components with different stellar/spectral parameters based on halo mass. This should produce similar results but is conceptually strange, where each halo has both ACG and MCG components in different fractions based on the halo mass.
The implementation of minihalos also presents a memory issue, as the number of halos can increase dramatically at lower masses. There is a mean source box calculation I can alter to implement a sub-resolution model. I also want to look into chunking via Dask or similar, as the halo calculations are independent.

MISC

The photon conservation model has given us a lot of grief. Shifting the redshift of a halo-based source box has required a lot of messy implementation, and its use with recombination and minihalo flags is difficult. As a stopgap there is a very simple model which adjust the escape fraction parameters instead of the redshift and produces expected results, this simple model needs some cleanup, and the original photon conservation model needs some work to be useful with the new model.

…into stochasticity

Conflicts: src/py21cmfast/src/FindHaloes.c

…aloField

daviesje · 2024-03-11T10:09:52Z

When the halo sampler is switched off, results very closely resemble the master branch, I have noticed only < 10^-3 level differences when examining global quantities and individual cells. However there seem to be bigger differences in various power spectra. Attached are paired lightcone and global plots for three cases, default parameters (titled _KS_OLD[_MASTER]), setting USE_MASS_DEPENDENT_ZETA=True (titled _KS[_MASTER]) and USE_MINI_HALOS=TRUE (titled KS_MINI[MASTER]).

Most of the tests fail since the interpolation tables and integral structure has changed, causing floating point errors (some float -> double conversion has been done, and many interpolation tables have different binning). The failures without the spin temperature fluctuations have very small differences at the smallest scales in the power spectra.

tests.test_integration_features.py--test_power_spectra_lightcone[mdzeta].pdf

With the spin temperature fluctuations, global values appear identical but power spectra show significant differences at moderate and small scales. This difference is not something I've been able to track down looking at individual cells, but it seems to not affect the history very much.

tests.test_integration_features.py--test_power_spectra_lightcone[tsfluct].pdf

tests.test_integration_features.py--test_power_spectra_lightcone[mini_halos].pdf

Obviously, with USE_HALO_FIELD=True, the results are very different. I am currently performing some runs with just DexM on this branch to examine but these are very slow. I would expect differences from the new gridding structure and the different implementation of the halos in ComputeIonizedBox.

tests.test_integration_features.py--test_power_spectra_lightcone[halo_field].pdf

steven-murray · 2024-03-11T10:31:13Z

@daviesje this is looking great, thanks for the plots!

I think we need to bring in @andreimesinger here to weigh in on what kind of tolerance we should have for changes. The good thing is that the simplest cases (no spin temp, no halos) is working great. Happy to give that my stamp of approval.

Even with the spin temperature fluctuations, I think we're doing pretty well. The differences, as expected, are only evident in the fields dependent on Ts. The mini-halos example is a little concerning (~100% differences in some spectra), so I'd like @andreimesinger to weigh in there.

When using the halo field, we expect differences, and we see them. However, we should be careful to understand what kind of differences we might expect, and whether their magnitude is reasonable here. I'm seeing ~20% differences in the globally-averaged quantities which I don't think I expected, but perhaps I'm too naive.

daviesje · 2024-03-11T11:52:54Z

To give a little more context, the halo field differences will always be a bit of an apples-to-oranges comparison. In the master branch, the halo field is converted to collapsed fraction by dividing the halo mass in a cell by the cell mass, then either the constant zeta is applied (if USE_MASS_DEPENDENT_ZETA=False) or it is assumed all that mass is at 10^10 solar and the constant escape and stellar fractions are used (if USE_MASS_DEPENDENT_ZETA=True). The new model assigns stellar mass and sfr to the halos in accordance with the Park+19 model, with or without the lognormal scatter depending on parameters. For debugging I could create an option which uses the constant zeta with the halo boxes but I don't see this ever being useful.

Conflicts: src/py21cmfast/inputs.py src/py21cmfast/outputs.py src/py21cmfast/src/GenerateICs.c src/py21cmfast/src/IonisationBox.c src/py21cmfast/src/SpinTemperatureBox.c src/py21cmfast/src/ps.c src/py21cmfast/wrapper.py

daviesje and others added 30 commits December 15, 2021 11:34

WIP finished sampling and grid building

2cefc4c

WIP adding cdf sampler

c620d93

Merge branch 'stochasticity' of https://github.com/daviesje/21cmFAST …

4446fdb

…into stochasticity

WIP testing inversion

2969aa0

debug messages, OMP in FindHaloes and refactoring the test samplers

d1ebcda

fix omp variables

19ccc63

fix omp bug in halo catalogues

c3e73a1

allow mass binning in HMF samples

ea7406a

add box sampling & euler conversion

1f262f4

add HMF option

5dc62d4

WIP threading DexM

da7e03b

fix bug where optimize excluded all small halos

d741199

thread the main halo loop, 1 percent error up to 4 threads

f9b7b4a

Merge branch 'dexm_changes' into stochasticity

41b94ff

Conflicts: src/py21cmfast/src/FindHaloes.c

thread the halolist sampling

1fd2b62

modularise properties

286f2a1

output HaloField from stochastic sampling, add properties to PerturbH…

3faf7d4

…aloField

produce proper output structures and ionised boxes

2ed1441

working halo catalogue updator

111f1c3

add halo box output

1983193

improve inverse table generation & fix threading

bfb1bbc

fix high delta mass sampling

4721a55

add halo lightcones

7fb71c0

minor plotting fixes

8b39825

use halo boxes instead of catalogs for input

d936e3d

quick duty cycle

16a2dbe

comments, input flags and leave duty cycle for later

59dc01b

duty cycle mean and unfinished ps_ratio

7547717

WIP spintemp separation

e9013db

producing (reasonable) Ts boxes

49a2e2b

daviesje added 9 commits March 4, 2024 15:27

fix rng seed migration

6b30f97

set rng seeding for consistency with master

f789247

fix ts global tables, add tests and test plots

0c4100b

cleanup more old todo's

6ed9a6b

fix a bunch of mass limit issues

2a68536

cleanup more old todos

96a5166

cleanup photoncons options

5ccd746

add photoncons files

efc28a3

force QAG integration on Nion_General

9f6ff2b

change the HMF error to a warning in perturb_halo_list

0404bb8

require lowres density for sampler

b2ee7fe

daviesje added 13 commits March 11, 2024 15:03

fix some dexm only issues

f00e586

Merge branch 'master' into stochasticity

4a7281b

Conflicts: src/py21cmfast/inputs.py src/py21cmfast/outputs.py src/py21cmfast/src/GenerateICs.c src/py21cmfast/src/IonisationBox.c src/py21cmfast/src/SpinTemperatureBox.c src/py21cmfast/src/ps.c src/py21cmfast/wrapper.py

fix halobox broadcasts

b7ed2d2

clarification, cleanup and fixing FAST_FCOLL_TABLES

b7eca14

fix silly warning error

59fceae

cleanup xraysource logging & fix broadcast bug

4561c5b

various minor fixes

48ca369

place delta limits in evaluation functions, fix test samples

cd38937

more consistent table limits & function calls

f64e489

change delta limit, force QAG over GL at high delta

37fc87f

ffcoll now includes the split approximation from Munoz+22

0693e97

set critical density limits at the correct mass

b11074a

some cleanup & fix first lc save with filename

811ee23

daviesje marked this pull request as ready for review March 27, 2024 11:17

more clarification in sampler notebook

194df45

steven-murray merged commit b727e9a into 21cmfast:v4-prep Mar 27, 2024
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stochasticity #349

Stochasticity #349

daviesje commented Dec 4, 2023 •

edited

Loading

daviesje commented Mar 11, 2024

steven-murray commented Mar 11, 2024

daviesje commented Mar 11, 2024

Stochasticity #349

Stochasticity #349

Conversation

daviesje commented Dec 4, 2023 • edited Loading

DRAFT PR FOR v4

The state of the halo model currently:

State of the C modularisation

Biggest Issues

MISC

daviesje commented Mar 11, 2024

steven-murray commented Mar 11, 2024

daviesje commented Mar 11, 2024

daviesje commented Dec 4, 2023 •

edited

Loading