Chore: Add Pipeline Obs averageRunTime #24979

SumanMaharana · 2025-12-23T15:22:08Z

Describe your changes:

Chore Add Pipeline Obs averageRunTime
In pipelines/name/fqn API we only send the latest taskStatus

Type of change:

Checklist:

I have read the CONTRIBUTING document.
My PR title is Fixes <issue-number>: <short explanation>
I have commented on my code, particularly in hard-to-understand areas.
For JSON Schema changes: I updated the migration scripts or explained why it is not needed.

Summary by Gitar

Schema extension:
- Added averageRunTime field to pipelineObservability.json for tracking average pipeline execution duration in milliseconds
New calculation logic:
- calculateAverageRuntime in PipelineRepository.java computes 30-day average using task-level timing with pipeline-level fallback
SQL query optimization:
- Refactored execution trend queries with CTEs for both MySQL and PostgreSQL to support task-based runtime calculation

_{This will update automatically on new commits.}

github-actions · 2025-12-23T15:28:13Z

TypeScript types have been updated based on the JSON schema changes in the PR

github-actions · 2026-01-05T11:08:17Z

TypeScript types have been updated based on the JSON schema changes in the PR

github-actions · 2026-01-05T11:32:49Z

Jest test Coverage

UI tests summary

Lines	Statements	Branches	Functions
	65.25% (52217/80024)	43.14% (25833/59879)	46.53% (8094/17394)

gitar-bot · 2026-01-08T17:41:18Z

openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java

            "Failed to find pipeline status for %s at %s", pipeline.getName(), timestamp));
  }

+  private Long calculateActualDuration(PipelineStatus pipelineStatus) {
+    if (pipelineStatus.getTaskStatus() == null || pipelineStatus.getTaskStatus().isEmpty()) {
+      return null;
+    }
+
+    Long minStartTime =
+        pipelineStatus.getTaskStatus().stream()
+            .map(task -> task.getStartTime())
+            .filter(java.util.Objects::nonNull)
+            .min(Long::compare)
+            .orElse(null);
+
+    Long maxEndTime =
+        pipelineStatus.getTaskStatus().stream()
+            .map(task -> task.getEndTime())
+            .filter(java.util.Objects::nonNull)
+            .max(Long::compare)
+            .orElse(null);
+


⚠️ Performance: Duplicated runtime calculation logic in two methods

Details

The runtime calculation logic (finding min startTime and max endTime from task statuses) is duplicated in three places:

calculateActualDuration() (lines 425-446)

calculateAverageRuntime() (lines 679-748)

The SQL CTEs in CollectionDAO.java

The Java implementations have slightly different behaviors:

calculateActualDuration() filters startTime and endTime independently

calculateAverageRuntime() requires both startTime AND endTime to be non-null for a task

This inconsistency could lead to different results for the same pipeline data. Consider extracting a shared utility method to ensure consistent calculation across all usages.

Suggested fix: Create a single utility method like calculateRuntimeFromTasks(List<Status> taskStatus) that both methods can use.

gitar-bot · 2026-01-08T17:41:18Z

openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java

+        pipelineStatus.getTaskStatus().stream()
+            .map(task -> task.getEndTime())
+            .filter(java.util.Objects::nonNull)
+            .max(Long::compare)
+            .orElse(null);
+


💡 Edge Case: calculateActualDuration doesn't fallback to pipeline-level timing

Details

The calculateActualDuration() method returns null when task-level timing data is unavailable, but calculateAverageRuntime() falls back to pipeline-level timing (status.getEndTime() - status.getTimestamp()).

This inconsistency means:

Duration filtering will exclude pipeline statuses that only have pipeline-level timing

Average runtime calculation will include those same statuses

For consistency, consider adding the same fallback to calculateActualDuration():

if (minStartTime != null && maxEndTime != null && maxEndTime >= minStartTime) { return maxEndTime - minStartTime; } // Fallback to pipeline-level timing if (pipelineStatus.getEndTime() != null && pipelineStatus.getTimestamp() != null) { return pipelineStatus.getEndTime() - pipelineStatus.getTimestamp(); } return null;

gitar-bot · 2026-01-08T17:46:29Z

openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java

            "Failed to find pipeline status for %s at %s", pipeline.getName(), timestamp));
  }

+  private Long calculateActualDuration(PipelineStatus pipelineStatus) {
+    if (pipelineStatus.getTaskStatus() == null || pipelineStatus.getTaskStatus().isEmpty()) {
+      return null;
+    }
+
+    Long minStartTime =
+        pipelineStatus.getTaskStatus().stream()
+            .map(task -> task.getStartTime())
+            .filter(java.util.Objects::nonNull)
+            .min(Long::compare)
+            .orElse(null);
+
+    Long maxEndTime =
+        pipelineStatus.getTaskStatus().stream()
+            .map(task -> task.getEndTime())
+            .filter(java.util.Objects::nonNull)
+            .max(Long::compare)
+            .orElse(null);
+


💡 Code Quality: Duplicated runtime calculation logic

Details

calculateActualDuration() and calculateAverageRuntime() both implement the same core logic: finding min(startTime) and max(endTime) from taskStatus. The implementations differ (stream-based vs imperative loop) which makes it harder to maintain consistency.

Consider extracting the common logic into a single shared method:

private Long calculateRuntimeFromTasks(List<Status> taskStatus) { if (taskStatus == null || taskStatus.isEmpty()) return null; Long minStart = taskStatus.stream() .map(Status::getStartTime) .filter(Objects::nonNull) .min(Long::compare) .orElse(null); Long maxEnd = taskStatus.stream() .map(Status::getEndTime) .filter(Objects::nonNull) .max(Long::compare) .orElse(null); return (minStart != null && maxEnd != null && maxEnd >= minStart) ? maxEnd - minStart : null; }

Both methods can then use this shared helper, reducing duplication and ensuring consistent behavior.

gitar-bot · 2026-01-09T15:27:08Z

🔍 CI failure analysis for 096965e: Integration tests fail due to double /api prefix in PR code (fix by removing /api). Maven PostgreSQL and SonarCloud CI timed out after 3-4.5 hours. Playwright jobs failed due to infrastructure issues. Retry non-test failures.

Integration Test Failures (Related to PR)

Two CI jobs failed with identical test errors in PipelineResourceIT:

integration-tests-postgres-opensearch (job 59924727114)
integration-tests-mysql-elasticsearch (job 59924727206)

All three tests fail in both jobs:

test_pipelineStatusDurationFilters_200_OK (line 1131)
test_pipelineStatusDurationCalculation_200_OK (line 1210)
test_pipelineStatusNullDurationHandling_200_OK (line 1263)

Root Cause

The test code constructs API paths that include the /api/v1 prefix, but the HTTP client automatically prepends /api, resulting in a double prefix: /api/api/v1/pipelines/...

Solution

The fix should remove the /api prefix from the path construction in the three failing test methods at lines 1129, 1158, and 1269 in PipelineResourceIT.java, changing from /api/v1/pipelines/%s/status?... to /v1/pipelines/%s/status?...

Infrastructure Failures (Unrelated to PR)

Maven Build Timeouts

maven-postgresql-ci (job 59924727529): Timed out after 3 hours (15:21 - 18:17)
maven-sonarcloud-ci (job 59924727511): Timed out after 4.5 hours (15:21 - 19:54)

Both jobs show the "Build with Maven" step stuck in in_progress status with no completion time, indicating they exceeded GitHub Actions' job timeout limits and were forcibly terminated. This is an infrastructure/CI timeout issue unrelated to PR changes.

Playwright CI Failures

playwright-ci-postgresql (1, 6) (job 59924727119): Failed during Docker build due to network timeout connecting to public.dhe.ibm.com for IBM iAccess ODBC driver
playwright-ci-postgresql (3, 6) (job 59924727127): All 454 tests passed but job failed with exit code 1, indicating a workflow infrastructure issue

Solution for Infrastructure Failures

All infrastructure failures (Maven timeouts, Playwright network/workflow issues) should be resolved by retrying the failed jobs.

Code Review ⚠️ Changes requested

The PR adds averageRunTime to pipeline observability with proper implementation, but calculateActualDuration lacks the pipeline-level fallback that other methods have, creating an inconsistency in duration filtering.

⚠️

Bug: calculateActualDuration lacks pipeline-level timing fallback

📄 openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java:414-438

The calculateActualDuration method (used for duration filtering in getPipelineStatuses()) returns null when task statuses are empty or lack timing data. However, unlike calculateAverageRuntime (which properly falls back to pipeline-level endTime - timestamp), this method doesn't use the pipeline-level timing as a fallback.

This inconsistency means:

Duration filtering will exclude pipelines that have valid pipeline-level timing but no task-level timing
The behavior differs from calculateAverageRuntime and the SQL queries in CollectionDAO, which all implement the fallback

Suggested fix:
Add a fallback to pipeline-level timing at the end of calculateActualDuration:

private Long calculateActualDuration(PipelineStatus pipelineStatus) {
  // ... existing task-level logic ...
  
  if (minStartTime != null && maxEndTime != null && maxEndTime >= minStartTime) {
    return maxEndTime - minStartTime;
  }

  // Fallback to pipeline-level timing
  if (pipelineStatus.getEndTime() != null && pipelineStatus.getTimestamp() != null) {
    return pipelineStatus.getEndTime() - pipelineStatus.getTimestamp();
  }

  return null;
}

Consider extracting shared logic into a utility method to reduce duplication with calculateAverageRuntime.

⚠️

Bug: calculateActualDuration lacks pipeline-level timing fallback

📄 openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java:408-432

The calculateActualDuration method returns null when task-level timing is unavailable, but it should fall back to pipeline-level timing (endTime - timestamp) for consistency with:\n\n1. calculateAverageRuntime method (lines 523-525) which DOES have this fallback\n2. The SQL queries in CollectionDAO.java which also fall back to pipeline-level timing\n\nImpact: The duration filter in getPipelineStatuses will exclude pipelines that have valid pipeline-level timing but no task-level timing, causing inconsistent behavior.\n\nSuggested fix:\njava\nprivate Long calculateActualDuration(PipelineStatus pipelineStatus) {\n // ... existing task-level logic ...\n \n if (minStartTime != null && maxEndTime != null && maxEndTime >= minStartTime) {\n return maxEndTime - minStartTime;\n }\n\n // Fallback to pipeline-level timing\n if (pipelineStatus.getEndTime() != null && pipelineStatus.getTimestamp() != null) {\n return pipelineStatus.getEndTime() - pipelineStatus.getTimestamp();\n }\n\n return null;\n}\n

⚠️

Bug: calculateActualDuration missing pipeline-level fallback

📄 openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java:425-446

The calculateActualDuration() method only calculates duration from task-level timing and returns null when taskStatus is empty or lacks timing data. However, calculateAverageRuntime() (lines 679-748) has a fallback to pipeline-level timing (status.getEndTime() - status.getTimestamp()).

This inconsistency causes a bug: duration filtering in getPipelineStatuses() will exclude pipeline runs that have pipeline-level timing but no task-level timing, while the average runtime calculation includes them.

Fix: Add the same fallback to calculateActualDuration():

private Long calculateActualDuration(PipelineStatus pipelineStatus) {
  // ... existing task-level calculation ...
  
  // Fallback to pipeline-level timing
  if (result == null && pipelineStatus.getEndTime() != null && pipelineStatus.getTimestamp() != null) {
    return pipelineStatus.getEndTime() - pipelineStatus.getTimestamp();
  }
  return result;
}

⚠️

Performance: Duplicated runtime calculation logic in two methods

📄 openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java:425-446 📄 openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java:679-748

The runtime calculation logic (finding min startTime and max endTime from task statuses) is duplicated in three places:

calculateActualDuration() (lines 425-446)
calculateAverageRuntime() (lines 679-748)
The SQL CTEs in CollectionDAO.java

The Java implementations have slightly different behaviors:

calculateActualDuration() filters startTime and endTime independently
calculateAverageRuntime() requires both startTime AND endTime to be non-null for a task

This inconsistency could lead to different results for the same pipeline data. Consider extracting a shared utility method to ensure consistent calculation across all usages.

Suggested fix: Create a single utility method like calculateRuntimeFromTasks(List<Status> taskStatus) that both methods can use.

More details 💡 3 suggestions

💡 Code Quality: Duplicated runtime calculation logic across two methods

📄 openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java:408-432 📄 openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java:679-748

The runtime calculation logic (finding min startTime and max endTime from tasks) is duplicated in two places:\n\n1. calculateActualDuration (lines 408-432) - uses streams\n2. calculateAverageRuntime (lines 697-720) - uses imperative loops\n\nBoth compute the same thing: max(task.endTime) - min(task.startTime).\n\nImpact: Code duplication increases maintenance burden and risk of bugs when one is updated but not the other (as evidenced by the missing fallback in calculateActualDuration).\n\nSuggested fix: Refactor to use a single helper method:\njava\nprivate Long calculateRuntimeFromStatus(PipelineStatus status) {\n // Calculate from task-level timing first\n if (status.getTaskStatus() != null && !status.getTaskStatus().isEmpty()) {\n // ... common logic ...\n }\n // Fallback to pipeline-level timing\n if (status.getEndTime() != null && status.getTimestamp() != null) {\n return status.getEndTime() - status.getTimestamp();\n }\n return null;\n}\n\nThen reuse in both calculateActualDuration and calculateAverageRuntime.

💡 Code Quality: Duplicated runtime calculation logic

📄 openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java:425-446 📄 openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java:679-748

calculateActualDuration() and calculateAverageRuntime() both implement the same core logic: finding min(startTime) and max(endTime) from taskStatus. The implementations differ (stream-based vs imperative loop) which makes it harder to maintain consistency.

Consider extracting the common logic into a single shared method:

private Long calculateRuntimeFromTasks(List<Status> taskStatus) {
  if (taskStatus == null || taskStatus.isEmpty()) return null;
  
  Long minStart = taskStatus.stream()
      .map(Status::getStartTime)
      .filter(Objects::nonNull)
      .min(Long::compare)
      .orElse(null);
      
  Long maxEnd = taskStatus.stream()
      .map(Status::getEndTime)
      .filter(Objects::nonNull)
      .max(Long::compare)
      .orElse(null);
      
  return (minStart != null && maxEnd != null && maxEnd >= minStart) 
      ? maxEnd - minStart : null;
}

Both methods can then use this shared helper, reducing duplication and ensuring consistent behavior.

💡 Edge Case: calculateActualDuration doesn't fallback to pipeline-level timing

📄 openmetadata-service/src/main/java/org/openmetadata/service/jdbi3/PipelineRepository.java:441-446

The calculateActualDuration() method returns null when task-level timing data is unavailable, but calculateAverageRuntime() falls back to pipeline-level timing (status.getEndTime() - status.getTimestamp()).

This inconsistency means:

Duration filtering will exclude pipeline statuses that only have pipeline-level timing
Average runtime calculation will include those same statuses

For consistency, consider adding the same fallback to calculateActualDuration():

if (minStartTime != null && maxEndTime != null && maxEndTime >= minStartTime) {
  return maxEndTime - minStartTime;
}

// Fallback to pipeline-level timing
if (pipelineStatus.getEndTime() != null && pipelineStatus.getTimestamp() != null) {
  return pipelineStatus.getEndTime() - pipelineStatus.getTimestamp();
}

return null;

What Works Well

The calculateAverageRuntime method properly implements task-level timing with pipeline-level fallback. The SQL queries in CollectionDAO are well-structured using CTEs with the same fallback logic. Comprehensive integration tests cover multi-task and null duration scenarios.

Recommendations

Extract the shared runtime calculation logic into a utility method to maintain consistency between calculateActualDuration, calculateAverageRuntime, and the SQL queries. This would prevent future divergence in behavior.

Tip

Comment Gitar fix CI or enable auto-apply: gitar auto-apply:on

Options

Auto-apply is off Gitar will not commit updates to this branch.
Display: compact Hiding non-applicable rules.

Comment with these commands to change:

`Auto-apply`	`Compact`
`gitar auto-apply:on`	`gitar display:verbose`

_{Was this helpful? React with 👍 / 👎 | This comment will update automatically (Docs)}

sonarqubecloud · 2026-01-09T15:53:20Z

Quality Gate passed for 'open-metadata-ui'

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

sonarqubecloud · 2026-01-09T17:25:02Z

Quality Gate passed for 'open-metadata-ingestion'

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

Chore Add Pipeline Obs averageRunTime

b36906b

SumanMaharana requested a review from a team as a code owner December 23, 2025 15:22

SumanMaharana had a problem deploying to test December 23, 2025 15:22 — with GitHub Actions Error

github-actions bot added Ingestion safe to test Add this label to run secure Github workflows on PRs labels Dec 23, 2025

Merge branch 'main' into fix-pipelinesobs-api

dbbda25

SumanMaharana temporarily deployed to test December 23, 2025 15:24 — with GitHub Actions Inactive

SumanMaharana had a problem deploying to test December 23, 2025 15:24 — with GitHub Actions Failure

SumanMaharana temporarily deployed to test December 23, 2025 15:24 — with GitHub Actions Inactive

Update generated TypeScript types

709a7dd

github-actions bot requested a review from a team as a code owner December 23, 2025 15:28

Merge branch 'main' into fix-pipelinesobs-api

8c6d0fe

SumanMaharana had a problem deploying to test January 5, 2026 11:04 — with GitHub Actions Error

SumanMaharana temporarily deployed to test January 5, 2026 11:04 — with GitHub Actions Inactive

SumanMaharana had a problem deploying to test January 5, 2026 11:04 — with GitHub Actions Error

SumanMaharana temporarily deployed to test January 5, 2026 11:04 — with GitHub Actions Inactive

Update generated TypeScript types

0f53efa

SumanMaharana had a problem deploying to test January 8, 2026 17:39 — with GitHub Actions Failure

gitar-bot bot reviewed Jan 8, 2026

View reviewed changes

Merge branch 'main' into fix-pipelinesobs-api

b8e19cf

SumanMaharana temporarily deployed to test January 9, 2026 06:00 — with GitHub Actions Inactive

SumanMaharana had a problem deploying to test January 9, 2026 06:00 — with GitHub Actions Failure

Merge branch 'main' into fix-pipelinesobs-api

5ebf7f8

SumanMaharana temporarily deployed to test January 9, 2026 11:45 — with GitHub Actions Inactive

SumanMaharana had a problem deploying to test January 9, 2026 11:45 — with GitHub Actions Failure

SumanMaharana temporarily deployed to test January 9, 2026 11:45 — with GitHub Actions Inactive

fix tests

096965e

SumanMaharana dismissed mohityadav766’s stale review via 096965e January 9, 2026 15:21

SumanMaharana had a problem deploying to test January 9, 2026 15:21 — with GitHub Actions Failure

SumanMaharana temporarily deployed to test January 9, 2026 15:21 — with GitHub Actions Inactive

SumanMaharana had a problem deploying to test January 9, 2026 15:21 — with GitHub Actions Failure

SumanMaharana temporarily deployed to test January 9, 2026 15:21 — with GitHub Actions Inactive

Chore: Add Pipeline Obs averageRunTime #24979

Are you sure you want to change the base?

Chore: Add Pipeline Obs averageRunTime #24979

Uh oh!

Conversation

SumanMaharana commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes:

Type of change:

Checklist:

Summary by Gitar

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

github-actions bot commented Jan 5, 2026

Uh oh!

github-actions bot commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Jest test Coverage

UI tests summary

Uh oh!

gitar-bot bot Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

gitar-bot bot Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

gitar-bot bot Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

gitar-bot bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Integration Test Failures (Related to PR)

Root Cause

Solution

Infrastructure Failures (Unrelated to PR)

Maven Build Timeouts

Playwright CI Failures

Solution for Infrastructure Failures

What Works Well

Recommendations

Uh oh!

sonarqubecloud bot commented Jan 9, 2026

Quality Gate passed for 'open-metadata-ui'

Uh oh!

sonarqubecloud bot commented Jan 9, 2026

Quality Gate passed for 'open-metadata-ingestion'

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SumanMaharana commented Dec 23, 2025 •

edited

Loading

github-actions bot commented Jan 5, 2026 •

edited

Loading

gitar-bot bot commented Jan 9, 2026 •

edited

Loading