[Refactor] Decouple Flint index monitor from Flint Spark API layer #435

dai-chen · 2024-07-18T17:31:07Z

Is your feature request related to a problem?

Currently, FlintSparkIndexMonitor is placed within the Flint Spark integration module and is started automatically by the Flint refresh and recover API. This causes the following problems:

Monitoring logic is coupled with the Flint Spark API.
The Flint Spark application code has to bypass Flint SQL and call the awaitMonitor API directly: https://github.com/opensearch-project/opensearch-spark/blob/main/spark-sql-application/src/main/scala/org/apache/spark/sql/JobOperator.scala#L96
Index monitor is unable to reuse error handling code in application: Store error message for streaming job execution in Flint metadata log #433 (comment)

What solution would you like?

I propose moving the index monitor to the Flint Spark application layer. The application code should control the start and stop of the index monitor. This would:

Decouple the monitoring logic from the Flint Spark API.
Allow for more flexible and explicit control of index monitoring within the application code.

Key considerations include evaluating if the benefits are worth the effort and any associated risks, as well as determining how the application code should decide to start monitoring after the Flint index creation statement completes.

What alternatives have you considered?

Keeping the FlintSparkIndexMonitor in the integration module but refactoring the awaitMonitor call mechanism to reduce complexity.
Creating a separate monitoring service that can be invoked independently by the Flint Spark application and Flint SQL.

Do you have any additional context?

N/A

The text was updated successfully, but these errors were encountered:

dai-chen added the maintenance Code refactoring label Jul 18, 2024

github-actions bot added the untriaged label Jul 18, 2024

dai-chen mentioned this issue Jul 18, 2024

Store error message for streaming job execution in Flint metadata log #433

Merged

dai-chen removed the untriaged label Jul 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactor] Decouple Flint index monitor from Flint Spark API layer #435

[Refactor] Decouple Flint index monitor from Flint Spark API layer #435

dai-chen commented Jul 18, 2024

[Refactor] Decouple Flint index monitor from Flint Spark API layer #435

[Refactor] Decouple Flint index monitor from Flint Spark API layer #435

Comments

dai-chen commented Jul 18, 2024