Encoding::UndefinedConversionError: Sidekiq/DetectRepositoryLanguagesWorker
Sentry error: https://new-sentry.gitlab.net/organizations/gitlab/issues/1773141
Summary
Repository language detection fails when processing references containing Unicode characters that cannot be converted from UTF-8 to ASCII-8BIT.
Problem Description
The DetectRepositoryLanguagesWorker crashes with Encoding::UndefinedConversionError: U+0130 from UTF-8 to ASCII-8BIT when encountering certain Unicode characters.
The error originates in lib/gitlab/gitaly_client/commit_service.rb:400 during string initialization, preventing repositories from having their programming languages detected.
Encoding::UndefinedConversionError: U+0130 from UTF-8 to ASCII-8BIT
from lib/gitlab/gitaly_client/commit_service.rb:400:in `initialize'
...
from app/workers/detect_repository_languages_worker.rb:22:in `block in perform'
Proposed Fix
Update lib/gitlab/gitaly_client/commit_service.rb:400 to handle UTF-8 encoding properly without forcing ASCII-8BIT conversion
Edited by 🤖 GitLab Bot 🤖