One way to do acceptance tests in Go

An outer circle labelled 'Shell' surrounds an inner circle labelled 'Core logic'. An ellipse with 'Unit tests' points at the 'Core logic', while another ellipse with 'Acceptance tests' points at the 'Shell' — A simple application architecture, and how to test it.
This diagram was produced with *pic*, a "little language" dedicated to producing diagrams (see the source)

Give me six hours to chop down a tree,
and I will spend the first four sharpening the axe
— Attributed to Abraham Lincoln

Acceptance tests are tests that prove that a feature works as intended. Unlike unit tests, acceptance tests are always written in terms of what the business originally requested, and for this reason they are also known as customer tests. They are one of the original 12 practices of Extreme Programming.

The 12 practices of XP, arranged in 3 concentrical circles — The 12 practices of XP from Ron Jeffries' website; acceptance tests are called *customer tests* here

I was watching this excellent presentation by Russ Cox about testing in Go, and I was struck by the emphasis on acceptance tests; though he does not call it this way. He was presenting the tests for the “go” command, which is the command-line entry point for a number of Go utility commands. For instance, you build a Go program with the command go build; you format a Go program with go fmt, you test a Go progam with go test, and so on.

The tests for the go program are written as text files, the simplest of which is

! go asdf
stderr '^go asdf: unknown command'

The first line means that when you execute the command go asdf, wher asdf stands for any non-existent command name, the command should return a failure status code (the ! means “expect this command to fail”). The second line means that we expect the message “go asdf: unknown command” to be printed in the standard error channel.

It’s that simple! Adding a new test just requires to write a new text file; as of go 1.22, there are 806 of them. Another example:

! go list a.go
! stdout .
stderr 'invalid import path'
! stderr panic

-- a.go --
package a

import ""

Here the bottom part after -- a.go -- means “create a file with name a.go with the following contents.” You can guess what the rest means.

It is a very powerful way to create tests; however, it requires a certain amount of work for interpreting the text files. In general, Russ Cox encourages us to create small domain-specific languages for testing our things. As with many things in Go, this suggestion is very much in the tradition of the Unix world: it reminds me of the old article by Jon Bentley about little languages, that I read so long ago and got me hooked on the pic diagramming language (but this is another story).

* *
*

Now the other day I was working on the first exercise in the excellent Gophercises course by Jon Calhoun. It’s about creating a little console interactive quiz application, and I thought, why not test it in the same way? So I started looking at https://pkg.go.dev/rsc.io/script, which is an open-source extraction of the script-interpretation language for the go command; but it turned out to be too difficult to adapt to my needs, at least for me at this time.

I would have liked something like

What is 3+4? 
> 7
Correct!
What is 2-1? 
> 2
Wrong!
Your score is 50% (1/2)

where the lines starting with > are user input, and other lines are output from the program; but I could not see how to do it with rsc.io/script.

So I asked myself, what’s the next simplest thing I could do? The obvious way to test a console program is through a Bash script. After a few iterations, I came up with something like this:

1go build -o quiz || exit 1
2printf "7\n2\n" | ./quiz -q testdata/two-questions.csv > /tmp/actual.txt
3
4expected=\
5'What is 3+4? Correct!
6What is 2-1? Wrong!
7Your score is 50% (1/2)'
8
9diff <(echo "$expected") /tmp/actual.txt

In line 1, we build the program and terminate with error if the build fails.

In line 2, the printf command sends all the answers to the program under test, which will read them one at a time and produce output that is captured in file /tmp/output.txt.

Then in line 9 we compare what we got to expected output, thanks to the diff utility. If diff finds no difference, it will print nothing and the script will return a success status code. If diff finds a difference, it return a failure status code, and will print explicative output such as

11c1
2< What is 3+4? Correct!
3---
4> What is 3+4? OK

The above means: “I expected the text at line 2, but got the text at line 4”.

The funky <(echo "$expected") syntax means “save the contents of the $expected variable to a temporary file and return its name”; it is a quick way to pass the contents of a Bash variable to a utility that expects a filename.

This style of testing is much less declarative, but it is still quite effective; it’s not as readable as I’d like, but it serves the purpose of proving that the feature works. For example, here is how we test another feature: “if the user takes too much time answering, the application terminates with a ’timeout’ message”

1go build -o quiz || exit 1
2sleep 5 | ./quiz -t 1 -q testdata/two-questions.csv > /tmp/actual.txt
3
4expected=\
5'What is 3+4? Timeout!
6Your score is 0% (0/2)'
7
8diff <(echo "$expected") /tmp/actual.txt

Here in line 2 we add a -t 1 flag, that is supposed to tell our program to “set timeout to one second”. The sleep 5 | simulates a user that does nothing; the input simply blocks for 5 seconds.

If our program works correctly, it should exit after one second with a “Timeout!” message, and print the score accordingly. Since my implementation does not yet implement this feature, the test fails after 5 seconds with this error message:

1,2c1,6
< What is 3+4? Timeout!
< Your score is 0% (0/2)
---
> What is 3+4? panic: EOF
>
> goroutine 1 [running]:
> main.main()
>   /Users/matteo/..../2024-03-05-gophesercises-quiz/quiz.go:83 +0x24e
> exit status 2

and this proves that the acceptance test is doing its job!

* *
*

The one thing that is left is to connect the acceptance tests to the standard Go test mechanism, so that they will run, together with all the other tests, when we execute go test. It turns out to be quite easy:

 1func TestAll(t *testing.T) {
 2  matches, err := filepath.Glob("testdata/*.sh")
 3  if err != nil {
 4    panic(err)
 5  }
 6  for _, match := range matches {
 7    t.Run(match, func(t *testing.T) {
 8      cmd := exec.Command("bash", match)
 9      output := new(bytes.Buffer)
10      cmd.Stdout = output
11      cmd.Stderr = output
12      err := cmd.Run()
13      if err != nil {
14        t.Errorf("%s\n%s", err, string(output.Bytes()))
15      }
16    })
17  }
18}

At line 2, we scan the testdata directory for all the files matching *.sh. Then, for each of these files, we run it as a separate test, named with the name of the file (line 7).

In each test, we execute the Bash file (lines 8-12). At lines 9-11, we collect the standard output and error from the process in a buffer.

If the test fails, it will return an error status code, which will become a non-nil err returned by cmd.Run(). In this case, we use the contents of the byte buffer to create an explicative error message. For instance, if I now run the go test command, I get:

$ go test
--- FAIL: TestAll (6.16s)
    --- FAIL: TestAll/testdata/timeout.sh (5.22s)
        acceptance_test.go:26: exit status 1
            1,2c1,5
            < What is 3+4? Timeout!
            < Your score is 0% (0/2)
            ---
            > What is 3+4? panic: EOF
            >
            > goroutine 1 [running]:
            > main.main()
            > 	/Users/matteo/..../2024-03-05-gophesercises-quiz/quiz.go:83 +0x24e
FAIL
exit status 1
FAIL	github.com/xpmatteo/gophesercises-quiz	6.518s

* *
*

Why are these acceptance tests important? I see several reasons.

One is that they clarify what is the end goal of a user story: they push me to define the final expected behaviour.

Another reason is that they bridge the gap from the unit tests to the “outer shell” of the application. You see, unit tests are great for developing the core business logic of the app. In this core, I don’t have I/O and i don’t have timeouts. The business logic is simply an object that has the following methods:

Start() (prompt string)
Answer(ans string) (response string, more bool)
Timeout() (response string)

The outer shell calls Start() when the application starts, calls Answer() when there is input from the user, and calls Timeout() when time is up. The outer shell deals with all the messy real-world things. So while the core business logic is pure Platonic logic, very easy to test, the outer shell is very hard to test with conventional unit tests. This is where these acceptance tests shine: they treat the whole application as a black box, and check that the outer shell and the core logic together behave the way they should.

My conclusions?

Bash test script make viable acceptance tests for console applications
It’s easy to collect these scripts and run them with the standard Go test library
It’s good to write at least one acceptance tests before starting to TDD
It is worth investing the time to get the acceptance tests right

Keep on testing!

Want to leave a comment? Please do so on Linkedin

#Go #TDD #XP