Phase 6: Rollout and Monitoring of Conflict Resolver Agent

Overview

Gradual rollout of the Conflict Resolver agent with monitoring, feedback collection, and final enablement.

Tasks

Stage 1: Internal Testing (Days 1-2)

  • Enable feature flag for GitLab team (User.where(admin: true))
  • Announce to internal team via Slack
  • Collect internal feedback via survey/issue
  • Monitor error rates and logs
  • Address critical bugs immediately
  • Track usage metrics

Stage 2: Beta Rollout (Days 3-4)

  • Enable for 10% of users via percentage rollout
  • Monitor key metrics (usage, errors, performance)
  • Set up Grafana dashboards for monitoring
  • Review user feedback channels
  • Identify and fix issues
  • Gradually increase to 25%, then 50%

Stage 3: General Availability (Day 5)

  • Enable feature flag globally
  • Announce in release notes
  • Publish blog post (coordinate with Marketing)
  • Share on social media
  • Monitor for issues
  • Have rollback plan ready

Monitoring Setup

  • Create Grafana dashboard for:
    • Button click rate
    • Chat sessions initiated
    • Tool call success rate
    • Tool call latency (p50, p95, p99)
    • Agent response time
    • Error rates by type
    • User ratings (thumbs up/down)
  • Set up alerts for error thresholds
  • Configure logging for debugging

Metrics Collection

  • Track button impressions (conflict page views)
  • Track button clicks
  • Track chat sessions completed
  • Track successful resolutions (MR becomes mergeable)
  • Track average conversation length
  • Track return usage rate

Feedback Collection

  • Set up user feedback mechanism (in-chat ratings)
  • Monitor support channels for issues
  • Create feedback issue template
  • Schedule post-launch retrospective

Communication

  • Draft release notes entry
  • Draft blog post (coordinate with Marketing)
  • Prepare social media posts
  • Update handbook/internal docs
  • Notify customer success team

Success Criteria Evaluation

  • Evaluate against planned metrics
  • Compare actual vs expected usage
  • Assess user satisfaction
  • Identify improvements for iteration

Rollout Schedule

Week 3:

  • Day 1-2: Internal testing
  • Day 3-4: Beta rollout (10% → 50%)
  • Day 5: General availability

Acceptance Criteria

  • Feature successfully rolled out to 100% of users
  • No critical bugs or performance issues
  • Monitoring dashboards operational
  • Success metrics being tracked
  • Release communications published
  • Positive user feedback
  • Rollback plan tested and ready

Monitoring Thresholds

Alerts if:

  • Error rate >5%
  • Tool call latency p95 >10s
  • Agent response time p95 >30s
  • User rating <60% positive

Rollback Plan

If critical issues:

  1. Disable feature flag immediately
  2. Notify stakeholders
  3. Investigate and fix issue
  4. Test fix thoroughly
  5. Re-enable with monitoring

Success Metrics (End of Week 1 Post-GA)

Target:

  • 15-25% of conflict page visitors use AI
  • 80% tool call success rate

  • 70% positive user ratings

  • <5s p95 tool call latency
  • 50% successful resolutions (MR becomes mergeable after)

Files Changed

  • Feature flag state (via Rails console/ChatOps)
  • Monitoring configurations
  • Communication materials

Timeline

5 days (Week 3)

Related to epic &20688

Assignee Loading
Time tracking Loading