Hack Week experiment: unit tests with claude by ancorgs · Pull Request #2922 · agama-project/agama

ancorgs · 2025-12-01T14:36:27Z

DISCLAIMER: This is not intended for production. Claude.ai has not been audited by neither SUSE or the openSUSE project, so I have no intention to introduce any code generated by such a tool into any production-ready branch of the Agama repository.

Problem

Adapting the unit tests manually when we refactor the code takes a lot of time.

Experiment

During this Hack Week we want to explore how AI can speed-up the process. See https://hackweek.opensuse.org/projects/ai-powered-unit-test-automation-for-agama

This is a first rough experiment using the free version of Claude.ai. It includes the rationale applied by the tool as markdown files. It is written in Spanish because I used the web console (the only interaction method available for free) and my browser is configured to request pages in Spanish.

What I did was:

Asking Claude to adapt the current unit tests to the changes introduced in the code.

It did a great job. I tried it with 5 different test files and 3 of them worked out of the box. 2 of them needed trivial fixes (committed separately). The changes introduced make sense to me at first sight (I still have to review them carefully). It was not too intrusive, it respected the approach of the original tests. Although it decided to expand some parts.

Asking Claude to write a difficult test (LvmPage) from scratch.

At first sight, the test seems to be correct and very comprehensive (it even allowed me to fix a bug in the code). Claude shows very good understanding on what the component is supposed to do.

Initially the data of the mocks contained errors because Claude does not have full access to the api-v2 repo branch. But as soon as I gave it the correct type definitions it was able to fix those errors itself and produce a fully working test (except one minor fix that is 100% understandable).

This error is also in the master branch, nothing new or introduced by Claude. Nevertheless, I asked it to fix it... and it came with an explanation and a fix that really didn't help.

ancorgs added 18 commits December 1, 2025 14:10

Changes proposed by Claude

91e43da

Add Claude rationale for changes (in Spanish)

5d9b8e8

Fix errors in a test proposed by Claude

e597063

More changes proposed by Claude

a270d52

Fix errors in a test proposed by Claude

726ac4b

Test written from scratch by Claude. Attempt 1 (imperfect mockups)

3bc5ab6

Mock structure fixed by Claude itself

7e70b7b

Fix for the test generated by Claude

ee4294d

Fix a bug detected by the Claude-generated tests

5813d0f

More conversations

ffc68ad

First attempt of Claude to adapt FormattableDevicePage.test.tsx

b74a2c3

Second Claude attempt to adapt FormattableDevicePage.test.tsx

c46b86b

Failed attemp of Claude to fix a minor error in console.log

ddec807

This error is also in the master branch, nothing new or introduced by Claude. Nevertheless, I asked it to fix it... and it came with an explanation and a fix that really didn't help.

Manual fixes to FormattableDevicePage.test.tsx

43299b3

Conversations about FormattableDevicePage.test.rationale.es.md

1b2d01f

First attempt of Claude to adapt PartitionPage.test.tsx

638c90a

Minor manual fixes for PartitionPage.test.tsx

a6e4e89

Added rationale for PartitionPage

9c6dc4e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hack Week experiment: unit tests with claude#2922

Hack Week experiment: unit tests with claude#2922
ancorgs wants to merge 18 commits intoapi-v2from
hack-week-claude-attempt1

ancorgs commented Dec 1, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ancorgs commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Experiment

Asking Claude to adapt the current unit tests to the changes introduced in the code.

Asking Claude to write a difficult test (LvmPage) from scratch.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ancorgs commented Dec 1, 2025 •

edited

Loading