Skip to content
GitLab
Next
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • GitLab FOSS GitLab FOSS
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Locked Files
  • Issues 0
    • Issues 0
    • List
    • Boards
    • Service Desk
    • Milestones
    • Iterations
    • Requirements
  • Merge requests 1
    • Merge requests 1
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Container Registry
    • Infrastructure Registry
  • Monitor
    • Monitor
    • Metrics
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • Code review
    • Insights
    • Issue
    • Repository
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Commits
  • Issue Boards
Collapse sidebar
  • GitLab.orgGitLab.org
  • GitLab FOSSGitLab FOSS
  • Issues
  • #45740
Closed
Open
Issue created Apr 25, 2018 by Stan Hu@stanhuOwner

Ship additional GitLab Prometheus alerts

We will be shipping AlertManager via GitLab 10.8 (omnibus-gitlab#2999 (closed)). I think we should begin shipping default alerts for GitLab administrators. What metrics are most useful to add ASAP as alerts for most GitLab users/customers?

Some ideas:

Component Exporter Endpoint Prometheus metric
Unicorn http://localhost:8080/-/metrics unicorn_active_connections
Unicorn http://localhost:8080/-/metrics unicorn_queued_connections
Unicorn http://localhost:8080/-/metrics job_register_attempts_failed_total
Sidekiq http://localhost:9168/sidekiq sidekiq_queue_size
Gitaly http://localhost:9236/metrics grpc_server_handled_total{grpc_code="ResourceExhausted}
PostgreSQL http://localhost:9187/metrics pg_stat_database_deadlocks
PostgreSQL http://localhost:9187/metrics pg_stat_database_conflicts_confl_deadlock
Redis http://localhost:9121/metrics redis_up
Workhorse http://localhost:9229/metrics ?
Pages http://localhost:9101/metrics gitlab_pages_domains_updated_total

/cc: @ayufan, @bjk-gitlab, @nick.thomas, @_stark, @dblessing, @jacobvosmaer-gitlab, @zj

Edited Sep 06, 2018 by Ben Kochie
Assignee
Assign to
Time tracking