[codex] Implement KPI dataset v2 instrumentation#334
Conversation
Co-authored-by: OpenAI Codex <codex@openai.com>
|
Claude finished @lewtun's task in 4m 34s —— View job KPI dataset v2 instrumentation — reviewNo blocking issues — 1 P1, 3 P2. The PII story is sound: salt is mandatory in the offline builder ( P1 — Every hourly run re-downloads and re-writes the entire month-to-date of session facts
So on day N of the month, each :05 cron tick re-downloads N days of Suggested fix: write P2 — Scheduler silently no-ops when
|
Co-authored-by: OpenAI Codex <codex@openai.com>
|
Claude finished @lewtun's task in 1m 15s —— View job KPI v2 re-reviewNo blocking issues — prior findings addressed, no new P0. This is a re-review of the
One observation, not blocking and not a regression: with Note: I couldn't execute the test suite here (sandbox blocked |
|
Addressed the review feedback in |
Summary
Implements the KPI dataset v2 plan for ML Intern analytics.
v2/, including session facts plus hourly, daily, and monthly rollups.KPI_USER_HASH_SALTand avoids writing raw user IDs, session IDs, or Hub artifact repo IDs.usage_metricsas the billing source instead of legacyllm_call.cost_usdsemantics.Validation
uv run --no-sync --with ruff ruff format --check .uv run --no-sync --with ruff ruff check .uv run --no-sync --with pytest --with pytest-asyncio pytest tests/unit/test_build_kpis.py tests/unit/test_telemetry_usage.py tests/unit/test_session_uploader.py tests/unit/test_hub_artifacts.py tests/unit/test_dataset_uploads.pyResult:
54 passed, 1 warning.