feat: Add bigframes.pandas.job_history() API to track BigQuery jobs by shuoweil · Pull Request #2435 · googleapis/python-bigquery-dataframes

shuoweil · 2026-02-05T05:40:39Z

This PR is not ready for review. I need it for colab notebook testing.

This PR introduces a new function bigframes.pandas.job_history() that allows users to retrieve a pandas DataFrame listing the BigQuery jobs initiated by BigFrames in the current Python session. This provides visibility into the underlying BigQuery execution, including query text, resource usage, and job duration, which is invaluable for monitoring and optimization.

Key Changes:

New API: Added bigframes.pandas.job_history() which returns a local pandas DataFrame containing details of all BigQuery jobs run in the current session.
Metrics Tracking: Extended ExecutionMetrics and JobMetadata in bigframes/session/metrics.py to store individual job records (metadata only) rather than just aggregate statistics. This avoids holding references to full Job objects to prevent memory leaks.
Comprehensive Job Support: Updated count_job_stats to handle LoadJobs (data ingestion) in addition to the existing QueryJobs and fast-path RowIterators.
Detailed Metadata: The history captures:
- Identifiers: Job ID, Query ID (for fast path), Location, Project.
- Timings: Creation time, Start time, End time, Duration.
- Status: Job state (DONE, etc.), Error details.
- Query Info: SQL text, Destination table, Cache hit status.
- Resources: Total bytes processed, Slot milliseconds.
- Load Stats: Input files, Input bytes, Output rows (for load jobs).
- Links: A direct URL to the job in the Google Cloud Console.
Loader Integration: Updated bigframes/session/loader.py and internal query paths to ensuring internal management queries (like index uniqueness checks) and data loading operations are correctly recorded.
Testing: Added tests/unit/session/test_job_history.py to verify metric collection for various job types and the public API. Added a manual test notebook notebooks/dataframes/job_history.ipynb for demonstration.

Usage Example:

    1 import bigframes.pandas as bpd
    2 import pandas as pd
    3
    4 # ... run some bigframes operations ...
    5 df = bpd.read_gbq("SELECT 1")
    6
    7 # Upload some local data (triggers a Load Job)
    8 bpd.read_pandas(pd.DataFrame({'a': [1, 2, 3]}))
    9
   10 # Get a DataFrame of all BQ jobs run in this session
   11 history = bpd.job_history()
   12
   13 # Inspect recent queries, their costs, and durations
   14 print(history[['job_id', 'job_type', 'total_bytes_processed', 'duration_seconds', 'query']])

verified at vs code notebook: screen/8u2yhaRV9iHbDbF

Fixes #<481840739> 🦕

review-notebook-app · 2026-02-05T05:40:44Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

TrevorBergeron · 2026-02-05T18:33:34Z

bigframes/session/metrics.py



+@dataclasses.dataclass
+class JobMetadata:


can we add a static factory method to build this from an sdk query job object?

Done! Added a from_job classmethod (and from_row_iterator) to handle building the metadata object directly from the jobs.

TrevorBergeron · 2026-02-05T18:36:52Z

bigframes/session/metrics.py

+    error_result: Optional[Mapping[str, Any]] = None
+    cached: Optional[bool] = None
+    job_url: Optional[str] = None
+    query: Optional[str] = None


I do worry that at a certain point, storing all query test generated by the session might clog up memory?

Good point! To prevent memory bloat during long sessions, I have added truncation so we cap the stored query text strings at a maximum of 1024 characters.

sycai

I have the concern of placing job_history under the bigframes.pandas package. We may consider bigframes or session instances as the residing places, mainly because functionalities under bigframes.ml and bigframes.bigquery can also trigger jobs but they do not belong to bigframes.pandas.

chalmerlowe · 2026-03-02T15:07:45Z

Migration Notice: This library is moving to the google-cloud-python monorepo soon.

We closed this PR due to inactivity to ensure a clean migration. Please re-open this work in the new monorepo once the migration is complete!

…ings

…andas

shuoweil · 2026-03-24T00:55:45Z

I have the concern of placing job_history under the bigframes.pandas package. We may consider bigframes or session instances as the residing places, mainly because functionalities under bigframes.ml and bigframes.bigquery can also trigger jobs but they do not belong to bigframes.pandas.

I agree with you. I have fully moved it out of bf.pandas. The API is now renamed to execution_history() to better reflect the broadened abstraction and is directly exposed via the root module (bigframes.execution_history()) and on the Session instance.

shuoweil added 2 commits February 5, 2026 05:20

feat: Add bigframes.pandas.job_history() API to track BigQuery jobs

9c2513e

docs: Add manual test notebook for job_history()

7221178

shuoweil requested review from TrevorBergeron and tswast February 5, 2026 05:40

shuoweil self-assigned this Feb 5, 2026

shuoweil requested a review from a team as a code owner February 5, 2026 05:40

shuoweil requested a review from a team February 5, 2026 05:40

product-auto-label bot added size: xl Pull request size is extra large. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Feb 5, 2026

TrevorBergeron reviewed Feb 5, 2026

View reviewed changes

sycai requested changes Feb 5, 2026

View reviewed changes

shuoweil marked this pull request as draft February 5, 2026 19:36

shuoweil added 2 commits February 5, 2026 19:39

Merge branch 'main' into shuowei-job-history

117cb17

fix: fix mypy error

29b4e12

chalmerlowe closed this Mar 2, 2026

Merge branch 'main' into shuowei-job-history

8d3b0c5

shuoweil reopened this Mar 23, 2026

shuoweil force-pushed the shuowei-job-history branch from 2fbbfa1 to 8d3b0c5 Compare March 23, 2026 22:59

product-auto-label bot added size: l Pull request size is large. and removed size: xl Pull request size is extra large. labels Mar 23, 2026

shuoweil added 4 commits March 23, 2026 23:06

feat: rename job_history API to execution_history

501af1c

feat(session): refactor JobMetadata extraction and truncate query str…

d0afa91

…ings

feat(core): export execution_history from base namespace instead of p…

6557d90

…andas

refactor: update execution history API and tests

d2dc5f6

product-auto-label bot added size: xl Pull request size is extra large. and removed size: l Pull request size is large. labels Mar 24, 2026

shuoweil added 2 commits March 24, 2026 18:04

Merge branch 'main' into shuowei-job-history

0845048

fix: exclude execution_history from pandas test

f5c9cdd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add bigframes.pandas.job_history() API to track BigQuery jobs#2435

feat: Add bigframes.pandas.job_history() API to track BigQuery jobs#2435
shuoweil wants to merge 11 commits intomainfrom
shuowei-job-history

shuoweil commented Feb 5, 2026 •

edited

Loading

Uh oh!

review-notebook-app bot commented Feb 5, 2026

Uh oh!

TrevorBergeron Feb 5, 2026

Uh oh!

shuoweil Mar 24, 2026

Uh oh!

TrevorBergeron Feb 5, 2026

Uh oh!

shuoweil Mar 24, 2026

Uh oh!

sycai left a comment

Uh oh!

chalmerlowe commented Mar 2, 2026

Uh oh!

shuoweil commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants



		@dataclasses.dataclass
		class JobMetadata:

Conversation

shuoweil commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Feb 5, 2026

Uh oh!

TrevorBergeron Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

shuoweil Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

TrevorBergeron Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

shuoweil Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

sycai left a comment

Choose a reason for hiding this comment

Uh oh!

chalmerlowe commented Mar 2, 2026

Uh oh!

shuoweil commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shuoweil commented Feb 5, 2026 •

edited

Loading