283 Commits

Author SHA1 Message Date
1grzyb1
527f612e75 VIM-3918 Run split mode tests using github actions
To make it work same way as ui tests I moved split mode tests to run on gh also
2026-03-26 09:51:15 +01:00
1grzyb1
7290a85166 Make Claude Code Review workflow manual and context-aware 2026-01-26 08:49:45 +01:00
Alex Plate
2f83606662 Change UI tests schedule to every 30 minutes
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-22 12:35:24 +02:00
Alex Plate
9394788e34 Add UI test debugging principles to autofix prompt
Add general guidance for fixing flaky UI tests:
- Flaky = race condition, not timeout issue
- Wait for unique state identifiers, not shared elements
- Understand framework built-in waits (findText already waits)
- Trace causality backwards to find correct wait condition
- State transitions have intermediate states

These principles should improve autofix success rate for UI test failures.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-22 12:34:50 +02:00
Alex Plate
b7607934a1 Change IntelliJ UI tests schedule from every 5 minutes to daily
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-22 11:50:28 +02:00
Alex Plate
ed0f74d85c Run UI tests every 5 minutes to test stability
Testing the UI tests stability with more frequent runs.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-16 15:08:59 +02:00
Alex Plate
add0c5b327 Run UI tests every 30 minutes instead of daily
Increase frequency to verify test stability.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-16 14:55:17 +02:00
Alex Plate
9ef7ce1d36 Add Neovim installation to Claude Code workflows
Install Neovim in workflows that run tests:
- testsMaintenance.yml: deals with @TestWithoutNeovim annotations
- codebaseMaintenance.yml: can run gradle tests
- youtrackAutoAnalysis.yml: uses TDD for bug fixes and features

Also add guidance in testsMaintenance to verify actual Neovim behavior
when working with skip reasons, and allow nvim/echo bash commands.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-13 07:05:27 +02:00
Alex Plate
64dfcdb9fa Remove clean task from PR verification workflow
Allows Gradle caching to be effective.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-09 19:05:34 +02:00
Alex Plate
eaeb3eeb79 Fix artifact upload paths in PR verification workflow
Remove incorrect ideavim/ prefix from paths - the checkout is at repo root.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-09 17:22:09 +02:00
Alex Plate
b24dd33e3b Temporarily disable macOS UI tests in IJ workflow
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-08 17:15:59 +02:00
Alex Plate
0bdecdc4c2 Fix disk space issues in Linux UI test workflows
Add disk space cleanup and reduce video recording size for Linux runners:
- Add "Free up disk space" step to remove .NET, Android SDK, GHC, CodeQL
- Reduce Xvfb resolution from 1920x1080 to 1280x720
- Optimize ffmpeg: 15fps, ultrafast preset, crf 28

This aligns with the existing Rider Linux workflow settings.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-08 17:15:59 +02:00
Alex Plate
64eca1ca11 Fix missing dependencies and git staging rule in UI test analysis jobs
Added missing dependencies:
- Java 21 setup (required for compilation verification)
- FFmpeg installation (required for ffprobe video analysis)

Added critical git staging rule to all UI test workflows:
- NEVER use `git add -A` or `git add .` when creating fix branches
- Always add modified files explicitly by path
- This prevents accidentally staging unrelated files that could cause
  push failures due to GitHub App workflow permission restrictions

Affected workflows:
- runUiTestsIJ.yml (unified IntelliJ IDEA tests)
- runUiPyTests.yml (PyCharm macOS tests)
- runUiPyTestsLinux.yml (PyCharm Linux tests)
- runUiRdTests.yml (Rider macOS tests)
- runUiRdTestsLinux.yml (Rider Linux tests)
- runUiOctopusTests.yml (Non-Octopus tests)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-08 07:32:43 +02:00
Alex Plate
54ec06ecf2 Fix Gradle cache for macOS Rider and PyCharm UI tests
Add explicit cache-read-only: false to ensure Gradle cache
is properly written and read on macOS builds, matching the
configuration used in the unified IntelliJ workflow.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-07 23:32:36 +02:00
Alex Plate
59696e1b75 Add UI test best practices to prompts: prefer cause-effect over timeouts 2026-01-07 23:24:36 +02:00
Alex Plate
d7d9774dca Add java and which commands to allowed tools in UI test workflows 2026-01-07 23:14:07 +02:00
Alex Plate
e1c7301802 Fix PyCharm IDE type from PU to PY in UI test workflows 2026-01-07 23:11:44 +02:00
Alex Plate
5543790620 Fix Gradle cache for Linux in unified IntelliJ IDEA workflow 2026-01-07 23:01:26 +02:00
Alex Plate
1df726e510 Change PyCharm IDE type from PC to PU 2026-01-07 22:54:17 +02:00
Alex Plate
4944053e51 Add mandatory compilation and test verification to UI test workflows 2026-01-07 22:54:17 +02:00
Alex Plate
a819f84dc5 Replace separate UI test workflows with unified IntelliJ IDEA workflow 2026-01-07 18:39:32 +02:00
Alex Plate
fe55795f71 Add disk cleanup and reduce screen recording size for Rider Linux tests 2026-01-07 18:22:02 +02:00
Alex Plate
8be630ed68 Add full tool permissions for automated PR creation in UI tests
Added Edit, Bash(git:*), Bash(gh:*), Bash(gradle:*), and Bash(./gradlew:*)
to the allowed-tools list in all UI test workflows.

This enables Claude Code to fully automate the PR creation process:
- Edit: Modify source files to apply fixes
- Bash(git:*): Create branches, add files, commit, push
- Bash(gh:*): Create pull requests
- Bash(gradle:*) & Bash(./gradlew:*): Run tests to verify fixes

Without these permissions, Claude Code would get "command requires approval"
errors when trying to create branches or edit files, preventing automated
PR creation.

Updated workflows:
- runUiTests.yml (IntelliJ macOS)
- runUiTestsLinux.yml (IntelliJ Linux)
- runUiTestsUnified.yml (IntelliJ unified)
- runUiRdTests.yml (Rider macOS)
- runUiRdTestsLinux.yml (Rider Linux)
- runUiPyTests.yml (PyCharm macOS)
- runUiPyTestsLinux.yml (PyCharm Linux)
- runUiOctopusTests.yml (Octopus macOS)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-07 18:00:45 +02:00
Alex Plate
45525ea215 Enable Gradle configuration cache for Linux UI tests
Added --configuration-cache flag to buildPlugin task in all Linux
UI test workflows to significantly improve build performance.

The configuration cache stores the result of the configuration phase
and reuses it for subsequent builds, avoiding the need to re-evaluate
build scripts when inputs haven't changed. This is particularly
beneficial for UI tests that run frequently on schedule.

Note on "Gradle User Home cache not found" message: This is expected
when the commit SHA changes, as the gradle-home cache key includes it
for security. However, the more important dependency and transform
caches use content-based hashing and are restored correctly across
commits, which is why builds still benefit from caching.

Updated workflows:
- runUiTestsLinux.yml
- runUiRdTestsLinux.yml
- runUiPyTestsLinux.yml
- runUiTestsUnified.yml (both jobs)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-07 17:57:20 +02:00
Alex Plate
6f3fec2b1a Instruct Claude Code to create PRs automatically without asking
Added explicit instruction to all UI test failure analysis prompts:
"IMPORTANT: If you have a concrete suggestion for fixing the test,
ALWAYS proceed with creating a branch and PR. Never ask for
permission - just do it."

This ensures Claude Code will automatically create fix PRs when it
identifies concrete solutions, eliminating the need for user
confirmation and enabling fully automated test maintenance.

Updated workflows:
- runUiTests.yml (IntelliJ macOS)
- runUiTestsLinux.yml (IntelliJ Linux)
- runUiTestsUnified.yml (IntelliJ unified)
- runUiRdTests.yml (Rider macOS)
- runUiRdTestsLinux.yml (Rider Linux)
- runUiPyTests.yml (PyCharm macOS)
- runUiPyTestsLinux.yml (PyCharm Linux)
- runUiOctopusTests.yml (Octopus macOS)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-07 17:45:21 +02:00
Alex Plate
6f8fe2bb22 Add unified IntelliJ UI tests workflow for cross-platform analysis
Created a unified workflow that runs macOS and Linux tests in parallel,
then performs a single AI analysis of failures from both platforms.

Key features:
1. Parallel execution: test-macos and test-linux run simultaneously
2. Separate artifact uploads: macos-reports and linux-reports
3. Unified analysis job that:
   - Triggers if either platform fails
   - Downloads artifacts from both platforms
   - Provides Claude Code with context from both test runs
   - Identifies common vs platform-specific issues
   - Creates a single PR for common issues
   - Clearly labels platform-specific fixes

Benefits:
- Single unified fix for issues affecting both platforms
- Better context for AI analysis by comparing across platforms
- Reduced PR noise (one PR instead of two for common issues)
- Cost efficiency (one AI analysis instead of two)

The analyze-failures job has git and gh tools enabled to allow
automatic branch creation and PR submission when fixes are identified.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-07 17:35:14 +02:00
Alex Plate
b738c17c3e Remove Windows UI tests workflow
Windows UI tests have been removed. The project will continue to run
UI tests on macOS and Linux platforms, which provide sufficient coverage
for UI testing across different operating systems.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-07 17:26:11 +02:00
Alex Plate
c6460ab515 Add macOS screen recording permission dialog automation to remaining workflows
Added the "Auto-click Allow button for screen recording permission" step
to all macOS UI test workflows that were missing it:
- runUiRdTests.yml (Rider macOS)
- runUiPyTests.yml (PyCharm macOS)
- runUiOctopusTests.yml (Octopus macOS)

This step automatically dismisses the macOS screen recording permission
dialog that appears when ffmpeg starts recording. Without this automation,
the dialog blocks the test execution and causes timeouts.

The step tries multiple coordinate positions using both cliclick and
AppleScript fallback to handle different screen resolutions and dialog
positions.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-07 17:20:09 +02:00
Alex Plate
e73dff9d9a Add automated fix workflow to UI test failure analysis
Updated all UI test workflow prompts to instruct Claude Code to
automatically create fixes when concrete solutions are identified:

1. Create a branch with descriptive name
2. Apply the suggested fix to the codebase
3. Run the specific failing test to verify the fix works
4. Create a PR if the test passes with clear documentation

Each workflow includes the appropriate test command for its IDE type:
- IntelliJ/Octopus: gradle :tests:ui-ij-tests:testUi --tests "..."
- Rider: gradle :tests:ui-rd-tests:testUi --tests "..."
- PyCharm: gradle :tests:ui-py-tests:testUi --tests "..."

This enables fully automated test fix proposals without manual
intervention, reducing the feedback loop for fixing flaky or broken
UI tests caused by platform changes.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-07 17:01:37 +02:00
Alex Plate
3f4da7ab8a Add macOS labels and create Linux variants for Rider/PyCharm UI tests
Clarified that existing UI tests without OS specification run on macOS
by updating workflow names to include "macOS" suffix.

Created Linux versions of Rider and PyCharm UI tests:
- runUiRdTestsLinux.yml: Rider tests on Linux with Xvfb setup
- runUiPyTestsLinux.yml: PyCharm tests on Linux with Xvfb and Python 3.10

Both new workflows follow the same Linux setup pattern as the existing
runUiTestsLinux.yml workflow, using x11grab for screen recording and
appropriate IDE type parameters (-PideaType=RD/PC).

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-07 16:59:14 +02:00
Alex Plate
86c94cbac1 Add hierarchy file guidance to UI test failure analysis prompts
When UI tests fail due to timeouts but the element is visible in the
video/screenshot, the failure may be caused by renamed properties or
changed class names. The hierarchy HTML file contains the actual UI
structure and can help identify these issues.

Updated all UI test workflows (macOS, Linux, Windows, Rider, PyCharm,
and Octopus) to instruct Claude Code to check build/reports/hierarchy-ideaVimTest.html
and suggest updated queries when this scenario occurs.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-07 16:22:59 +02:00
Alex Plate
4a7e59401b Remove Python CGEvent code from macOS permission dialog automation
Keep only cliclick and AppleScript methods for clicking the Allow button.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 23:47:42 +02:00
Alex Plate
32ab7d888b Add Bash(rm:*) to allowed tools for AI analysis in UI test workflows
Allows the AI to remove temporary files when creating thumbnail grids
from screen recordings for better video analysis.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 23:44:27 +02:00
Alex Plate
5a9d1fd0a3 Fix YAML syntax in macOS workflow - use heredoc for Python code 2026-01-06 23:39:33 +02:00
Alex Plate
c817436e2a Try multiple click methods and coordinates for macOS permission dialog
Attempts to auto-click the "Allow" button for screen recording permission using:
- cliclick (most reliable, installed via brew)
- AppleScript with coordinates
- Python CGEvent (low-level click simulation)

Tests multiple coordinate positions to handle different screen resolutions:
- 512,367 (standard dialog position)
- 960,540 (higher resolution)
- 640,400 and 800,450 (fallback positions)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 23:32:24 +02:00
Alex Plate
7f0d24965f Add Linux UI test workflow with screen recording and AI analysis
Creates a separate workflow for running UI tests on Ubuntu with:
- Xvfb virtual display for headless GUI testing
- FFmpeg screen recording using x11grab
- Claude Code AI analysis of test failures
- Artifact upload for test reports and recordings

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 23:22:26 +02:00
Alex Plate
0d5db19301 Fix Windows workflow to use bash shell instead of PowerShell
Changed Windows UI test workflow from PowerShell to bash:
- Run Idea: Now properly runs in background with `&`
- Screen recording: Uses bash syntax for background process and PID capture
- Stop recording: Uses standard bash kill command

Problem: PowerShell Start-Process completed immediately instead of
keeping gradle running in background, causing IntelliJ to never start
and health check to fail.

Solution: Use bash shell (available via Git Bash on Windows runners)
which properly handles background processes with `&` syntax.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 23:03:16 +02:00
Alex Plate
66e682dd55 Auto-click Allow button for ffmpeg screen recording permission on macOS
Added AppleScript automation to automatically grant screen recording
permission when the macOS permission dialog appears:
- Waits 2 seconds after ffmpeg starts
- Tries up to 10 times (1 second intervals) to click Allow button
- Uses SecurityAgent process to interact with system permission dialog
- Fails gracefully if dialog doesn't appear

This eliminates the manual permission prompt that was blocking
automated screen recording on macOS GitHub Actions runners.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 22:48:26 +02:00
Alex Plate
0c7323e4fa Add Windows UI tests workflow with screen recording
Created separate workflow for Windows UI tests:
- Runs on windows-latest
- FFmpeg installation via Chocolatey
- Screen recording using gdigrab (Windows GDI Grabber)
- No permission dialogs on Windows (unlike macOS)
- PowerShell for process management
- Gradle caching enabled
- Only runs IJ tests for now
- AI analysis on test failures
- Records at 30fps with H.264 codec

This provides an alternative platform for UI testing with
working screen recording capabilities.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 12:04:07 +02:00
Alex Plate
c3486a6b4e Remove --no-configuration-cache flag from UI test workflows
Removed --no-configuration-cache flag from all runIdeForUiTests commands
to enable Gradle configuration cache and improve build performance.

This works together with the gradle/actions/setup-gradle@v4 action
to provide optimal caching.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 11:41:55 +02:00
Alex Plate
41f6bd1782 Add Gradle caching to UI test workflows
Enabled gradle/actions/setup-gradle@v4 for all UI test workflows:
- runUiTests.yml
- runUiPyTests.yml
- runUiRdTests.yml
- runUiOctopusTests.yml

This action automatically caches:
- Gradle wrapper
- Dependencies
- Gradle build cache

This should significantly speed up the "Build Plugin" step on
subsequent runs.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 11:37:16 +02:00
Alex Plate
77fbe1ae2b Replace Gradle video recorder with CI-level screen recording
Extended macOS CI screen recording to all UI test workflows:
- Added FFmpeg screen recording to PyCharm, Rider, and Octopus tests
- Removed Linux UI tests workflow (runUiTestsLinux.yml)

Removed video-recorder-junit5 library from codebase:
- Removed dependency from all UI test modules
- Removed video recorder system properties from Gradle configs
- Removed @Video annotations and imports from all test files
- Removed "Move video" steps from all workflows

Updated Claude AI analysis prompts:
- Changed from dual video sources to single CI recording
- Added helpful ffmpeg commands for video analysis:
  * Extract frames at specific times
  * Create thumbnail grids
  * Get video duration
  * Extract last N seconds

This simplifies the video recording setup by relying solely on
CI-level screen capture, which provides complete coverage of the
entire test run.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 11:21:46 +02:00
Alex Plate
f6f4caaf5c Fix screen recording device index for macOS CI
Changed FFmpeg device from "1:0" to "0:none" for proper screen capture.
Added device listing step for debugging.

The previous device index caused "Invalid device index" error.
Using "0:none" captures screen 0 without audio, which is the correct
format for avfoundation screen capture on macOS runners.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 10:48:52 +02:00
Alex Plate
1424e200dc Add CI-level screen recording to macOS UI tests
Added FFmpeg screen recording at CI level using avfoundation:
- Captures full screen at 30fps starting when IntelliJ launches
- Saves to build/reports/ci-screen-recording/screen-recording.mp4
- Complements Gradle test video with full session recording
- Updated AI analysis prompt to reference both video sources

This provides a complete recording of the entire test run, which may
catch issues that occur outside the focused Gradle test recording.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 10:31:09 +02:00
Alex Plate
ebc9840fb9 Add separate Linux UI tests workflow
Created new runUiTestsLinux.yml with modern Linux configuration:
- Uses ubuntu-latest with Xvfb for headless display
- FFmpeg for video recording
- Same test suite as macOS (ui-ij-tests only)
- AI analysis on test failures
- Updated to Java 21 and latest action versions

Removed old commented-out Linux job from runUiTests.yml.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-06 10:08:36 +02:00
Alex Plate
49ebe672cb Remove --auto-approve-tools flag, keep just --allowed-tools 2026-01-05 23:48:52 +02:00
Alex Plate
4a613b5c83 Add missing bash commands and auto-approve Write for build/reports 2026-01-05 23:45:57 +02:00
Alex Plate
ba4913fb0a Add allowed tools to Claude Code AI analysis step 2026-01-05 23:39:54 +02:00
Alex Plate
70305d2f0e Add id-token permission to UI test workflows for Claude Code action 2026-01-05 23:06:21 +02:00
Alex Plate
ed0c292736 Add Claude Code AI analysis step to UI test workflows
When UI tests fail, Claude Code now automatically analyzes the failure by examining test reports, video recordings, and IDE logs. The analysis is saved to build/reports/ai-analysis/analysis.txt and included in uploaded artifacts.

Requires ANTHROPIC_API_KEY secret to be configured in GitHub repository settings.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-01-05 22:36:09 +02:00