Skip to content

miranov25/RootInteractive

Repository files navigation

RootInteractive

Code for the interactive sttistical aggregation and visualisation of multidimensional data in ROOT or native Python formats (Panda, numpy).

Support for ROOT data structures:

  • TTree and TTreeFormula, aliases ...
  • TFormula, or any static Root/AliRoot functions.
  • RDataFrame <-> awkward - work in progress

Root and PyRoot (AliRoot/O2) data structures could be used as input data sources. However, the code also works with pandas only, without the need to have the ROOT package installed. Internally, these data structures are converted into the Bokeh CDS (ColumnDataSource) or in our RootInteractive CDS for the NDimensional histograms, projections and aggregated information.

RootInteractive content:

  • Interactive, easily configurable visualisation of non-binned and binned data.
  • Interactive n-dimensional histogramming/projection and derived aggregated information extraction
  • Client/server application Jupyter, Bokeh
  • Standalone client application - (Bokeh Standalone Dashboard)
  • Lossy and lossless data compression (server --> client)
  • ROOT and RDataFrame tools and interfaces

Interactive visualization, histogramming and data aggregation in N-dimensions on client

The figure array declaration is used as an argument in bokehDrawSA to create an array of figures/graphs/scatter plots/ Unbined or binned (Ndimension histogram and derived statistics/projection) bokeh data sources and derived variables and aggregated statistics can be used for drawing.

The declarative programming used in bokehDrawSA is a type of coding where developers express the computational logic without having to programme the control flow of each process. This can help simplify coding, as developers only need to describe what they want the programme to achieve, rather than explicitly prescribing the steps or commands required to achieve the desired result.

The interactive visualization is declared in the 6 arrays as ine the example bellow

bokehDrawSA.fromArray(df, None, figureArray, widgetParams, layout=figureLayoutDesc, tooltips=tooltips, parameterArray=parameterArray,
                          widgetLayout=widgetLayoutDesc, sizing_mode="scale_width", nPointRender=300,
                           aliasArray=aliasArray, histogramArray=histoArray,arrayCompression=arrayCompression)

figureArrray - figure parameterization

  • see READMEfigure
  • Defining scatter/histogram/derived figures using input data source
  • Example declaration of the figure from data source with columns ABCD
    figureArray = [
    [['A'], ['A*A-C*C'], {"size": 2, "colorZvar": "A", "errY": "errY", "errX":"0.01"}],
    [['A'], ['C+A', 'C-A', 'A/A']],
    [['B'], ['C+B', 'C-B'], { "colorZvar": "colorZ", "errY": "errY", "rescaleColorMapper": True}],
    [['D'], ['(A+B+C)*D'], {"colorZvar": "colorZ", "size": 10, "errY": "errY"} ],
    [['D'], ['D*10'], {"errY": "errY"}],
    {"size":"size", "legend_options": {"label_text_font_size": "legendFontSize"}}
    ]

histogramAray - interactive histogramming parameterization and examples

  • Defining interactive ND histogramsand derived statistics, updated based on the user selection, resp. by parametriz
  • see READMEhistogram
  • Example of creating a 3D histogram showing mean, sum and standard in the projection with colour code in the second dimension
    histoArray = [
            {"name": "histoABC", "variables": ["(A+C)/2", "B", "C"], "nbins": [8, 10, 12], "weights": "D", "axis": [0], "sum_range": [[.25, .75]]},
        ]
    figureArray = [
            [['bin_center_1'], ['mean']],
            [['bin_center_1'], ['sum_0']],
            [['bin_center_1'], ['std']],
            {"source": "histoABC_0", "colorZvar": "bin_center_2", "size": 7}
    ]

aliasArray alias/client side function parameterization

  • see READMEaliase
  • javascrript function with which you can define derived variables on the client. Used e.g. to parameterise the selection, histogram weights, efficiencies
  • newly created variables can be used in histogramArray, figureAray, aliasArray
  • Dependency trees to ensure consistency of aliases and the correct order of evaluation of derived variables and use in visualisation.
  • Example declaration:
        aliasArray = [
            # They can also be used as selection (boolen)  used e.g. for histogram weights
            {
                "name": "C_accepted",
                "variables": ["C"],
                "parameters": ["C_cut"],
                "func": "return C < C_cut"
            },
            # User-defined JS columns can also be created in histograms by specifying the context (CDS) parameter
            {
                "name": "efficiency_A",
                "variables": ["entries", "entries_C_cut"],
                "func": "return entries_C_cut / entries",
                "context": "histoA"
            },
            # Shorthand notation - only for scalar functions
            ("effC", "entries_C_cut / bin_count", "histoAC"),
        ]

widgetLayout - layout of the figures

  • READMElayout
  • Layout declared by and dictionary(tabs)/array of figure IDs (index or name ID)
  • Properties per row/simple layout/tab layout can be specified. More local properties have priority.
  • Example declaration:
    layout = {
        "A": [
            [0, 1, 2, {'commonX': 1, 'y_visible': 1, 'x_visible':1, 'plot_height': 300}],
            {'plot_height': 100, 'sizing_mode': 'scale_width', 'y_visible' : 2}
            ],
        "B": [
            [3, 4, {'commonX': 1, 'y_visible': 3, 'x_visible':1, 'plot_height': 100}],
            {'plot_height': 100, 'sizing_mode': 'scale_width', 'y_visible' : 2}
            ]
    }

layout - layout of the widgets

  • see READMElayoutWidget
  • Layout declared by and dictionary(tabs)/array of figure IDs (index or name ID)
  • Properties per row/simple layout/tab layout can be specified. More local properties have priority.
  • Example declaration:
    • simple layout
       widgetLayoutKine=[
           ["dca0","tgl","qPt","ncl"],
           ["dEdxtot","dEdxmax","mTime0"], 
           ["hasA","Run","IR","isMC"], 
           {'sizing_mode': 'scale_width'}
       ]
    • composed layout:
      widgetLayoutDesc={
          "Select":widgetLayoutKine,
          "Histograms":[["nbinsX","nbinsY", "varX","yAxisTransform"], {'sizing_mode': 'scale_width'}],
          "Legend": figureParameters['legend']['widgetLayout'],
          "Markers":["markerSize"]
      }

arrayCompresion -

Machine learning part - work in progrees

  • Wrappers for decision trees and Neural Net
  • Provides interface for the reducible, irreducible errors, proability density function
  • Local linear forest, resp. local kernel regression

RootInteractive Information

Tutorials

ALICE ROOTIntteractive tutorial

Sevearal ALICE use case (detector calibration, QA/QC)

Galery material in the ALICE agenda () and document server