Encoding::UndefinedConversionError: Sidekiq/DetectRepositoryLanguagesWorker

Sentry error: https://new-sentry.gitlab.net/organizations/gitlab/issues/1773141

Summary

Repository language detection fails when processing references containing Unicode characters that cannot be converted from UTF-8 to ASCII-8BIT.

Problem Description

The DetectRepositoryLanguagesWorker crashes with Encoding::UndefinedConversionError: U+0130 from UTF-8 to ASCII-8BIT when encountering certain Unicode characters.

The error originates in lib/gitlab/gitaly_client/commit_service.rb:400 during string initialization, preventing repositories from having their programming languages detected.

Encoding::UndefinedConversionError: U+0130 from UTF-8 to ASCII-8BIT
  from lib/gitlab/gitaly_client/commit_service.rb:400:in `initialize'
  ...
  from app/workers/detect_repository_languages_worker.rb:22:in `block in perform'

Proposed Fix

Update lib/gitlab/gitaly_client/commit_service.rb:400 to handle UTF-8 encoding properly without forcing ASCII-8BIT conversion

Edited by 🤖 GitLab Bot 🤖