-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ability to simulate simple 3D structure in Hi-C/3C reads #39
Comments
First port of call, the easiest structural detail to approximate is the single primary fold of a circular chromosome, leading to a second anti-diagonal intensity profile in a heatmap. I would suggest this can be represented as a reflection of the primary distribution with a lower intensity. |
Are we treating 3D structure in a secondary or tertiary sense? That is: low observational frequency driven by inaccessible regions due to tight folding, or lets say increased obs freq because of physical proximity due to consistent 3D structural elements (folds)? |
First port of call sounds great. |
Prototyping generators. Initially, I made the mistake of thinking that the anti-diagonal was product of interacting flattened circular chromosomes. Visually, pull a rubber band tight, and opposing strands come into close proximity. In pursuing that, I cobbled together a multi-modal variation of our empirical distribution, where the second smaller maxima was at the halfway point and followed the same geometrical relationship we've imposed. Well, that turns out to make parallel lines of density in the heatmap (parallel to the primary y=x), with a simple phase shifts of +L/2, -L/2. Sorta, duh! Clearly that won't creating orthogonal traces in the density, where a rotation is necessary, a simple transformation (y=L-y). So, along comes the correct treatment. If you combine this with the above (because well, I had it coded) you get something that looks like this. There is no need to keep the multi-modal distribution, but its a nice patch-work quilt. or with the regular distribution One thing that is bothering me is the apparent aliasing within these heatmaps. I am hopeful that this is an artifact on the plotter. |
Nice one, looks warm & cozy. |
nice. so the off-diagonal blocks with heat are presumably the result of closely related strains that have had mismapped read pairs? i see a cluster of 3 there, and the other 5 must have one pair that are close as well... |
Taking the terminology from Tung et al, I have implemented a model of replicon CID regions. These chromosomal interaction domains are what attributed in producing the block density in the heatmap. Stay tuned for a pic via read-mapping. |
Critique against the figure in the paper
|
this looks really neat. i don't have a good sense in how much value there is in perfecting it beyond this point -- maybe need to see some use cases for guidance. |
I think I will close this issue for now. |
A new method for generating CID regions has been implemented. Rather than random intervals, this approach assumes folding regions (interacting domains) are better modelled as a nested (hierarchical) set of intervals. I am making a note here and posting an example contact map. This map is by generated coordinates only, not a full simulation with actual read-mapping. In implementing this method, there is a new issue #70 to fine-tune it. This is regarding the sizes of intervals, the probabilities assigned to them. This figure depicts E.coli chr1, fake chr2 and S.aures chr1. Therefore there are also inter-replicon contacts between chr1 and chr2 of E.coli. |
We have discussed various ways of doing this. Some options we covered were:
Challenges involved were/are:
The text was updated successfully, but these errors were encountered: