Test all the things! :tada:

Question

Test all the things! :tada:

Opened this issue a year ago · 42 comments

jeaye commented a year ago

General process

Anyone is welcome to join in and write tests. The process goes like this:

Pick a function from one of the milestones:
a. clojure.core: https://github.com/jank-lang/clojure-test-suite/milestone/1
Leave a comment on the function's ticket to claim that function
Add a new test file using bb (documented here)
Work through the testing questions, implementing tests for each
Add any additional tests for all the edge cases you can think of. Try to write tests that will challenge the runtime as much as possible. Put some thought into it.
Keep your tests dialect-independent (i.e. wrap any Java interop in a reader conditional and provide CLJS equivalents, avoid using :default for reader conditionals)
Make a PR to add your new tests!

Testing questions

Common cases

What happens when the input is nil? (apply to all inputs)
What happens if it's given all valid inputs? (this will require some manual work to identify edge cases)
Are there any special cases for inputs?
What happens when the transducer arity is called?
Is metadata preserved through the function?

Edge cases

What happens when the input is an incorrect shape (i.e. a number instead of a sequence)? (apply to all inputs)
If the input accepts a sequence, what happens when it's an infinite sequence?
If the input accepts a map, what happens with both array maps and hash maps?
If the input accepts a set, what happens with both sorted sets and hash sets?
If the function accepts unboxed inputs/outputs, what happens with different combinations?

Things we don't need to test

Invalid arity (too many/too few args, the runtime does this for us, not the fn itself)

Keep your tests small

Try to only use the var you're testing and the testing framework. If you can avoid using other vars in your test, try to do so. This will keep each test focused and make it easier for new Clojure dialects to test what they have without implementing all of clojure.core.

Generative and property-based testing

We're not looking to add these types of tests right now. We want to get good coverage of all core functions using intentional, manually written tests. This will make the test suite easier to run for new Clojure dialects which may not have a ton of functionality yet.

Answer 1 · 2024-10-19T16:48:41.000Z

Claiming * and *'

Answer 2 · 2024-10-21T14:42:19.000Z

Claimed add-tap, remove-tap and tap>

Note: Included some short calls to Thread/sleep since taps respond asynchronously and code can race.

Answer 3 · 2024-10-29T14:05:54.000Z

Claiming or: #8

Answer 4 · 2024-10-30T18:01:36.000Z

Claiming some? and not: #11

Answer 5 · 2024-11-30T06:20:42.000Z

Claiming even?, odd?, and nil?

Answer 6 · 2024-12-01T20:21:37.000Z

Claiming inc and dec

Answer 7 · 2024-12-01T20:49:29.000Z

Claiming = (this might take a bit to finish)

Answer 8 · 2024-12-09T15:31:36.000Z

Claiming:

identical?
zero?
pos?
neg?
number?
ratio?
rational?
integer?
int?
pos-int?
nat-int?
decimal?
float?
double?
true?
false?
boolean

These are all fairly easy to do as a group since the tests are similar.

Answer 9 · 2024-12-09T18:02:42.000Z

Claiming:

keyword
symbol
name
intern
namespace
keyword?
symbol?
ident?
simple-keyword?
simple-symbol?
simple-ident?
qualified-keyword?
qualified-symbol?
qualified-ident?

Answer 10 · 2024-12-10T15:21:44.000Z

Claiming:

char
char?
format
pr-str
print-str
println-str
prn-str
str
string?
subs
with-out-str

Answer 11 · 2024-12-11T15:45:49.000Z

Claiming:

byte
short
int
long
float
double
bigint
bigdec
num
rationalize

Answer 12 · 2024-12-20T23:54:59.000Z

Claiming:

-
/
quot
rem
mod
inc
dec
max
min
with-precision
numerator
denominator
rand
rand-int

Answer 13 · 2024-12-21T15:21:45.000Z

Claiming

compare

Answer 14 · 2024-12-21T22:26:00.000Z

Claiming

seq #26

Answer 15 · 2024-12-25T18:22:58.000Z

Claiming fnil: #29

Answer 16 · 2024-12-31T14:40:15.000Z

Claiming partial: #30

Answer 17 · 2025-01-08T00:38:36.000Z

Claiming binding: #33

Answer 18 · 2025-01-18T15:31:41.000Z

Claiming: bound-fn: #42

Answer 19 · 2025-01-27T04:31:00.000Z

Claiming:

drop
drop-last
drop-while
take
take-last
take-while

Answer 20 · 2025-01-28T17:21:19.000Z

Claiming:

first
second
rest
next
nth
nthrest
nthnext

Answer 21 · 2025-01-29T15:23:51.000Z

I'll take zipmap.

Answer 22 · 2025-01-29T16:08:53.000Z

Claiming:

count
get
butlast
sequential?
associative?
sorted?
counted?
reversible?
seqable?
coll?
seq?
vector?
list?
map?
set?

Answer 23 · 2025-01-31T11:24:59.000Z

Claiming sort #67

Answer 24 · 2025-02-01T02:31:52.000Z

Claiming:

interleave
interpose

Answer 25 · 2025-02-04T20:24:02.000Z

Claiming shuffle

Answer 26 · 2025-02-20T17:06:24.000Z

Claiming:

==
<
>
<=
>=

Answer 27 · 2025-02-20T17:09:25.000Z

@quoll , any chance you could look into the tap code and fix the intermittent failure there? I think you originally wrote that, right? I got the intermittent failure again last night when running tests locally.

Answer 28 · 2025-03-08T00:45:05.000Z

Claiming reduce

Answer 29 · 2025-04-22T00:16:39.000Z

I've disabled the taps tests for now, due to the intermittent failures.

The list in this ticket has been updated. We're currently sitting at 20% coverage of all Clojure vars. Let's get that to 80%! 🚀

Answer 30 · 2025-06-10T11:38:22.000Z

claiming boolean?

Answer 31 · 2025-06-12T04:27:27.000Z

Claiming empty #84

Answer 32 · 2025-06-13T20:32:07.000Z

Claiming #86

ffirst
fnext
last
nfirst
nnext

Answer 33 · 2025-06-17T22:20:52.000Z

Claiming #87

hash-map
hash-set
set

Answer 34 · 2025-06-30T21:48:00.000Z

Claiming #89

empty?
get-in
find
contains?

Answer 35 · 2025-07-04T22:07:13.000Z

Claiming #90

parse-boolean
parse-long
parse-double
parse-uuid

Answer 36 · 2025-07-23T20:36:53.000Z

I claim update #92
NB: coll? is already added

Answer 37 · 2025-09-01T13:06:41.000Z

Claiming realized?

Answer 38 · 2025-09-05T09:17:52.000Z

claiming atom, constantly, fn?, ifn?

Answer 39 · 2025-09-05T18:13:24.000Z

We need a bot for what's claimed lol

Answer 40 · 2025-09-05T19:35:05.000Z

Github docs say markdown task lists are "retired" and that we should use sub tasks instead. So I tried creating sub tasks for all of these, but I had to do it one by one. That was annoying, but I was stubborn enough to do it. Until I hit cycle, which was the 100th sub task. Now any more sub tasks fail, since Github has a limit of 100 sub tasks per issue. 😑

So I'll use normal issues instead of sub tasks, but I don't think there's a way to convert the 100 sub tasks I have into normal issues. 🤔

Answer 41 · 2025-10-28T18:46:25.000Z

Ex-post conditionally (on AI acceptance) claiming all missing clojure.core functions starting with 'v' or 'w'.

Answer 42 · 2025-10-28T19:48:36.000Z

Overview

Ex-post conditionally (on AI acceptance) claiming all missing clojure.core functions starting with 'v' or 'w'.

Thanks for the interest and for the PR: #770 I appreciate that you'd like to help fill out these tests and I'm glad that you made clear that AI was used so that we can discuss it.

In short, I am not open to accepting AI writing these tests, at this point. The primary practical reason is over testing, along with the controversial philosophical reasoning. Yes, it takes effort to come up with good, concise coverage of a function. Most things worth well doing take effort. This is worth doing well.

Good unit tests have minimal to no redundancy. Each test is specifically chosen to handle one case, which is clear from reading it. With AI generated tests, this is rarely the case. To demonstrate my point, let's analyze the tests in your PR.

Numbers

Hypothetically, if we were to finish the remaining 455 tests using AI, let's take a look at the amount of code we'd be dealing with.

Based on your PR, containing 23 test files, the median file has 102 lines of code.
Based on the existing 180 test files currently in the repo, the median file has 30 lines of code.

I don't think that your PR contains three times the test coverage per file, but I would grant that an AI may come up with some cases which we might miss. The rest is likely to be over testing. So, using those numbers, that means, for the remaining 455 untested functions, we'd end up with:

The AI way: 46,410 lines of new code (grand total of 54,808)
The manual way: 13,650 lines of new code (grand total of 22,048)

The manual way will take longer, and will involve more effort, but we'll end up with less than a third of the lines of new code. When we're talking about a difference of 33k lines of new code, this is serious business. I need to maintain this code, not you, and not the AI you used.

Dialects

Furthermore, comparing your submitted code to what we have in main, I suspect we'll need a great more reader conditionals in order to properly handle ClojureScript, Clojure CLR, and babashka. Your submitted tests are taking CLJS into account, but are likely missing nuances between each dialect which are more likely found through experimentation. This will only increase the median line count per AI test file, which will only make the whole thing less appealing.