diff --git a/docs/understand/programming_interface.rst b/docs/understand/programming_interface.rst
index 2374ec381d..ff6e333611 100644
--- a/docs/understand/programming_interface.rst
+++ b/docs/understand/programming_interface.rst
@@ -6,10 +6,24 @@
 Programming interface
 *******************************************************************************
 
-The programming interface document will focus on the HIP runtime API. The runtime API provides C and C++ functions for event, stream,  device and memory managements, etc. The HIP runtime on AMD platform uses the Common Language Runtimes (CLR), while on NVIDIA platform HIP runtime is only a thin layer over the CUDA runtime.
+The programming interface document will focus on the HIP runtime API. The
+runtime API provides C and C++ functions for event, stream,  device and memory
+managements, etc. The HIP runtime on AMD platform uses the Common Language
+Runtimes (CLR), while on NVIDIA platform HIP runtime is only a thin layer over
+the CUDA runtime.
 
-- **CLR** contains source codes for AMD's compute languages runtimes: ``HIP`` and ``OpenCL™``. CLR includes the implementation of ``HIP`` language on the AMD platform `hipamd <https://github.com/ROCm/clr/tree/develop/hipamd>`_ and the Radeon Open Compute Common Language Runtime (rocclr). rocclr is a virtual device interface, that HIP runtime interact with different backends such as ROCr on Linux or PAL on Windows. (CLR also include the implementation of `OpenCL <https://github.com/ROCm/clr/tree/develop/opencl>`_, while it's interact with ROCr and PAL)
-- **CUDA runtime** is built over the CUDA driver API (lower-level C API). For further information about the CUDA driver and runtime API, check the :doc:`hip:how-to/hip_porting_driver_api`. On non-AMD platform, HIP runtime determines, if CUDA is available and can be used. If available, HIP_PLATFORM is set to ``nvidia`` and underneath CUDA path is used.
+- **CLR** contains source codes for AMD's compute languages runtimes: ``HIP``
+  and ``OpenCL™``. CLR includes the implementation of ``HIP`` language on the AMD
+  platform `hipamd <https://github.com/ROCm/clr/tree/develop/hipamd>`_ and the
+  Radeon Open Compute Common Language Runtime (rocclr). rocclr is a virtual device
+  interface, that HIP runtime interact with different backends such as ROCr on
+  Linux or PAL on Windows. (CLR also include the implementation of `OpenCL <https://github.com/ROCm/clr/tree/develop/opencl>`_,
+  while it's interact with ROCr and PAL)
+- **CUDA runtime** is built over the CUDA driver API (lower-level C API). For
+  further information about the CUDA driver and runtime API, check the :doc:`hip:how-to/hip_porting_driver_api`.
+  On non-AMD platform, HIP runtime determines, if CUDA is available and can be
+  used. If available, HIP_PLATFORM is set to ``nvidia`` and underneath CUDA path
+  is used.
 
 .. I am not sure we should share this.
 The different runtimes interactions are represented on the following figure.
@@ -19,7 +33,15 @@ The different runtimes interactions are represented on the following figure.
 HIP compilers
 =============
 
-The HIP runtime API and HIP C++ extensions are available with HIP compilers. On AMD platform ROCm currently provides two compiler interfaces: ``hipcc`` and ``amdclang++``. The ``hipcc`` command-line interface aims to provide a more familiar user interface to users who are experienced in CUDA but relatively new to the ROCm/HIP development environment. On the other hand, ``amdclang++`` provides a user interface identical to the clang++ compiler. (For further details, check `llvm <llvm-project-docs:index>`_). On NVIDIA platform ``hipcc`` invoke the locally installed ``NVCC`` compiler, while on AMD platform it's invoke ``amdclang++``.
+The HIP runtime API and HIP C++ extensions are available with HIP compilers. On
+AMD platform ROCm currently provides two compiler interfaces: ``hipcc`` and
+``amdclang++``. The ``hipcc`` command-line interface aims to provide a more
+familiar user interface to users who are experienced in CUDA but relatively new
+to the ROCm/HIP development environment. On the other hand, ``amdclang++``
+provides a user interface identical to the clang++ compiler. (For further
+details, check `llvm <llvm-project-docs:index>`_). On NVIDIA platform ``hipcc``
+invoke the locally installed ``NVCC`` compiler, while on AMD platform it's
+invoke ``amdclang++``.
 
 .. Need to update the link later.
 For AMD compiler options, check the `GPU compiler option page <https://rocm.docs.amd.com/en/docs-5.2.3/reference/rocmcc/rocmcc.html#amd-gpu-compilation>`_.
@@ -27,16 +49,60 @@ For AMD compiler options, check the `GPU compiler option page <https://rocm.docs
 HIP compilation workflow
 ------------------------
 
-The source code compiled with HIP compilers are separated to device code and host. The HIP compilers:
+The source code compiled with HIP compilers are separated to device code and
+host. The HIP compilers:
 
 .. WIP
 
 * Compiling the device code into an assembly.
-* Modify the host code by replacing the <<<...>>> syntax introduced in Kernels by the necessary CUDA runtime function calls to load and launch each compiled kernel from the PTX code and/or cubin object.
+* Modify the host code by replacing the ``<<<...>>>`` syntax introduced in
+  kernels by the necessary CUDA runtime function calls to load and launch each
+  compiled kernel from the PTX code and/or cubin object.
 
-``NVCC`` and ``amdclang++`` target different architectures and use different code object formats: ``NVCC`` is ``cubin`` or ``ptx`` files, while the ``amdclang++`` path is the ``hsaco`` format.
+``NVCC`` and ``amdclang++`` target different architectures and use different
+code object formats: ``NVCC`` is ``cubin`` or ``ptx`` files, while the
+``amdclang++`` path is the ``hsaco`` format.
 
 For example of compiling from command line, check the :ref:` SAXPY tutorial compiling <compiling_on_the_command_line>`.
 
+.. _hip_runtime_api_understand:
+
 HIP runtime API
-===============
\ No newline at end of file
+===============
+
+.. _driver_api_understand:
+
+Driver API 
+===========
+
+The driver API offers developers low-level control over GPU operations, enabling
+them to manage GPU resources, load and launch kernels, and handle memory
+explicitly. This API is more flexible and powerful compared to the runtime API,
+but it requires a deeper understanding of the GPU architecture and more detailed
+management.
+
+One significant advantage of the driver API is its ability to dynamically load
+and manage code objects, which is particularly useful for applications that need
+to generate or modify kernels at runtime. This flexibility allows for more
+sophisticated and adaptable GPU programming.
+
+Memory management with the driver API involves explicit allocation,
+de-allocation, and data transfer operations. This level of control can lead to
+optimized performance for specific applications, as developers can fine-tune
+memory usage. However, it also demands careful handling to avoid memory leaks
+and ensure efficient memory utilization.
+
+Unlike the runtime API, the driver API does not automatically handle tasks such
+as context creation and kernel loading. While the runtime API is more convenient
+and easier to use for most applications, the driver API provides greater control
+and can be more efficient for complex or performance-critical applications.
+
+Using the driver API can result in longer development times due to the need for
+more detailed code and explicit management. However, the actual runtime
+performance can be similar to or even better than the runtime API, depending on
+how well the application is optimized.
+
+While AMD HIP does not have a direct equivalent to CUDA's Driver API, it
+supports driver API functionalities, such as managing contexts, modules, memory,
+and driver entry point access. These features are detailed in
+:ref:`porting_driver_api`, and described in :ref:`driver_api_reference`.