You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The memory consumption in the predictRFStat is too big because all trees are evaluated for all of the points
2 changes are needed:
running in chunks -to keep memory in limit
parameters chunkSize or maxMemory
optimization of the CPU - statistics should also be calculated in parallel
Currently only trees are calculated in parallel
There are 2 implementations of the predictRFStat, predictRFStatNew . This was a quick hack after scikit changed some implementation. The hack was not documented and remained in the code.
As the transition happened 6 month ago, maybe we can make New version default and remove the obsolete version...
The text was updated successfully, but these errors were encountered:
To save memory the predictRFStat should be optionally split to the chunk
If we consider that to be a function responsibility of the function - new parameter maxRows to be added
If maxRow is set then the function will be called in chunks with maxRows and all data will be merged at the end.
Memory consumption - nRowsxnTrees which can be significantly smaller than current nEntries x nTrees
The memory consumption in the predictRFStat is too big because all trees are evaluated for all of the points
2 changes are needed:
chunkSize or maxMemory
There are 2 implementations of the predictRFStat, predictRFStatNew . This was a quick hack after scikit changed some implementation. The hack was not documented and remained in the code.
As the transition happened 6 month ago, maybe we can make New version default and remove the obsolete version...
The text was updated successfully, but these errors were encountered: