Agent Skills Library
View source
Data governance

Analytics Data Governance

Field-level privacy policy, retention targets, redaction rules, and LLM recommendation boundaries.

Analytics Data Governance

Skills analytics uses a strict-minimum data policy. The dashboard should be useful for team workflow improvement without storing secrets, raw prompts, raw transcripts, local machine details, deployment logs, work item descriptions, or full Azure Boards comments.

See Agent Analytics Metadata Contract for parser rules and Skills Metric Dictionary for dashboard metric meanings.

Classification Model

ClassificationMeaningHandling
AllowedSafe enough for MVP analytics when sourced from metadata or narrow Azure Boards fields.May be stored in normalized records and returned by authorized dashboard APIs.
ControlledUseful, but may need role-aware display, retention limits, or additional review.Store only when explicitly listed and keep out of public surfaces.
DerivedCalculated from allowed/controlled fields.Store or return with source evidence and no raw sensitive inputs.
ProhibitedSensitive, high-risk, or too broad for the MVP purpose.Reject, omit, redact, or keep transient only long enough to parse safely.

Field-Level Policy

Field Or DataClassificationStorage RuleNotes
schema, eventType, occurredAtAllowedStore in normalized records.Required for parser and metric grouping.
workItemId, workItemType, parentId, stateAllowedStore in normalized records and dashboard APIs.IDs and workflow state are core dashboard evidence.
Azure Boards work item URLAllowedStore normalized URL only.Do not include auth tokens or query strings with credentials.
Work item titleControlledAllowed for MVP signed-in dashboard roles; do not show on public pages.If a project later treats titles as sensitive, mask by role or use title hashes.
Work item description, acceptance criteria, full discussion commentProhibitedDo not store in metadata, normalized records, logs, or LLM inputs.The source adapter may hold comments transiently in memory only to extract marked metadata blocks.
skillName, skillVersion, agentHost, modelAllowedStore in normalized records.Unknown/unsupported values should be unknown, not inferred.
actorKind, createdByKind, completedByKind, attributionKind, confidence, reasonAllowedStore normalized attribution fields.Attribution should stay conservative when evidence is incomplete.
tokenEstimateControlledStore only best-effort estimates with isBillingRecord=false.Unavailable estimates must omit token counts. Never treat as invoices or billing records.
Cost estimateDerivedStore only after a versioned pricing table is approved.Cost is directional and must cite pricing version/date.
summaryControlledAllow short sanitized summaries up to the parser contract limit.No raw prompt text, transcript text, description excerpts, or secrets.
source.commentId, commentCreatedAt, parser offsetAllowedStore safe source pointers.Enables audit/debugging without raw comment content.
Parse failure code and messageAllowedStore safe code/message only.Do not include raw invalid JSON or surrounding comment text.
Azure Boards user display name, email, descriptorControlledOmit from MVP analytics records.If later needed, use approved mapping or a non-reversible project-scoped hash.
Cognito username, email, groups, ID token claimsControlledUse for API authorization; do not copy into analytics events.Admin audit trails are separate from analytics records.
PATs, passwords, API keys, bearer tokens, client secrets, auth headersProhibitedReject/omit and never log.This includes .env*, .mcp.json, local auth files, and deployment output that may contain credentials.
Raw prompts, raw transcripts, session logs, meeting recordingsProhibitedDo not store or send to recommendation jobs.Token counters may be read locally, but raw session content must not be persisted.
Local filesystem paths, machine names, OS/user profile detailsProhibitedDo not store.Host support should be represented by agentHost, not local paths.
Deployment logsProhibitedDo not store in analytics.Deployment evidence can be summarized in tickets without raw log content.
Recommendation title, summary, evidence metrics, confidenceDerivedStore recommendation records when generated from normalized/aggregate evidence.Recommendations must not quote raw comments, prompts, or descriptions.

User Identity Rule

MVP analytics should not store direct human user identifiers in event records. Azure Boards identities may be used transiently for conservative attribution, and Cognito claims may be used for authorization, but dashboard analytics records should use actor categories such as agent, human, mixed, and unknown.

If future metrics require user-level reporting, add a separate design decision before implementation. The default future path is an approved mapping table or a project-scoped non-reversible hash, not raw email addresses in analytics rows.

Retention Targets

These targets are policy inputs for storage, API, collector, and operations work. If infrastructure cannot enforce one yet, track the enforcement gap before production launch.

Data SetTarget RetentionNotes
Normalized analytics records in DynamoDB18 monthsLong enough for trend review across releases without becoming broad history.
Normalized accepted-record archive in S3180 daysArchive stores normalized accepted records, not raw Azure Boards comments.
Collector run health records180 daysNeeded for freshness and operational trend checks.
Parse failure records90 daysStore safe codes/messages only; use for cleanup and training.
CloudWatch Lambda/API logs30 daysExisting infrastructure uses short operational log retention. Logs must avoid raw comments and credentials.
Stored recommendation records18 monthsKeep generated guidance and evidence metrics with the same trend horizon as analytics.
LLM recommendation request payloadsTransient onlyDo not persist raw request bodies unless they contain only approved aggregate evidence and have explicit retention.
Sanitized local fixtures and docs examplesRepository lifetimeFixtures must remain sanitized and credential-free.

Redaction Rules

Title Visibility Decision

For MVP, work item titles are acceptable on authenticated Skills dashboard surfaces for the configured Cognito groups. Public library and documentation pages must not expose live Azure Boards titles. Role-based title masking is not required for MVP, but the API should keep title handling centralized so a future project can mask or hash titles for narrower roles.

LLM Recommendation Boundaries

Recommendation jobs may use aggregate counts, normalized event fields, safe work item IDs/links, sanitized summaries, token estimate availability, parse failure codes, and collector run metrics. They must not use raw Azure Boards comments, work item descriptions, raw prompts, transcripts, local paths, credentials, deployment logs, or direct user identifiers.

Recommendation outputs must cite dashboard evidence rather than source-sensitive text. Suggested process changes remain recommendations for human review; the system must not automatically rewrite skills, board states, or team workflows from LLM output.

Guidance For Authors And Operators