Internal Developer Testing for Duo Context Feature

Learnings (from 3 testers)

User testing was conducted with three developers. The primary use cases tested were debugging import errors and generating test cases.

🎯 What's Working Well

Quality of AI Responses
- Generated high-quality test cases for Vue components
- Provided detailed debugging assistance
- Developers found the outputs accurate and valuable
File Selection
- Users could successfully select individual files
- The UI gives a clear indication that files were included as context

🚨 Pain Points

File Selection UX
- Files need to be re-selected if the file window is closed
- File paths are truncated, making it difficult to identify correct files
  - Chat window width doesn't accommodate full file names
- Confusion about time frame/context of previously shown files
- Natural language file selection doesn't work; must use /include command
  - one tester tried to copy and paste the url with /include command
UI/Visual Issues
- Similar color between Files icon popup (blue) and chat buttons in dark mode
- Poor accessible contrast for context references
- Duo Chat icon lacks GitLab branding, causing confusion
  - One user couldn’t figure out where to find Duo Chat from documentation and thought it was located in Duo Workflow. This suggests we may have future confusion between Duo Chat and Workflow access when launched publicly. User wants to know why the Duo Chat icon doesn’t have GitLab logo: “I would expect the Gitlab logo to open all the things”
File Discovery
- Difficulty finding files in larger repositories
- Search functionality not consistently returning expected files
  - One tester couldn't find the file that she recently opened and other files that she had just closed were showing up
- Confusion about file locations when multiple files share the same name
Response Format
- Responses sometimes too verbose
- Need better formatting for readability

Positive feedback

One developer who previously avoided AI tools became enthusiastic after seeing the quality of generated tests
Potential significant time savings in test writing
Effective at complex tasks like debugging import errors

Negative feedback

Workflow interruptions due to file selection issues
Time lost searching for files in large repositories
Extra steps required for basic file inclusion
- One tester noted they must first selection files, then select the actual file indicating it feels like an extra step

Opportunities

Improve file selection
- Improve natural language file selection
- Consider options like "Include all open files"
- Display full file paths
- Persist file selection across window closures
Enhance file discovery
- Optimize search for large repositories
- Add file path filtering options
- Auto-suggest including related files (e.g., package.json when generating tests)
UI improvements
- Increase contrast for better accessibility
- Adjust window width to show full file paths
- Distinguish UI elements with different colors in dark mode
Context Awareness
- Maintain history of commonly used file combinations
Response Formatting
- Implement collapsible sections for lengthy responses

Instructions for developer testing:

Link to test: https://app.usertesting.com/pp/308dcedf-da2f-40b0-a832-5f2a99ce80bd

You'll find instructions on the link too, but I'll copy them here:\

Setup the latest GitLab Workflow Extension for VS Code and open a repo in VS Code (Currently not available in JetBrains) https://docs.gitlab.com/ee/user/gitlab_duo_chat/#use-gitlab-duo-chat-in-vs-code\\
Pick a task in your workflow that requires multiple files to complete\
Open DuoChat\
Use the slash command /include\
Select the file(s) to add as context (feature only supports features in a git repo, not local files)\
Complete your task using DuoChat and the selected files

As you work through your task please talk through it, narrating your choices, what you expected, what surprised you, etc.

Objective

Conduct unmoderated testing sessions with internal developers to evaluate the effectiveness and user experience of the context feature Duo IDE

Background

We need to gather more data on how developers interact with and utilize the context feature in real-world scenarios. This will help us identify potential issues, understand user workflows, and improve the overall user experience.

Methodology

Create unmoderated tests using UserTesting platform
Recruit internal developers as participants
Have participants use Duo context feature to complete a task in their IDE
Have participants record their screens and think aloud while completing tasks using the context feature
Collect and analyze the recorded sessions

Test Setup

Create the test in UserTesting
Use links to invite internal users to participate
@nicollem to provide test content and structure

Participant Recruitment

@bvenker will rally internal developers to participate in the study

Key Areas to Explore

Effectiveness of the context feature in maintaining conversational context
Impact on developer efficiency and context switching
Integration with workflows involving multiple files
Overall user experience and performance

Deliverables

Recorded sessions of developers using the context feature
Summary of findings and insights
Recommendations for feature improvements

Future Considerations

Conduct moderated sessions for deeper exploration of specific issues or hypotheses
Expand testing to external developers if needed

CC: @katiemacoy @NickHertz

Please update this issue with any additional information or changes to the plan.

Edited Nov 11, 2024 by Nicolle Merrill