Implement Record/Replay functionality for btest #185

rexim · 2025-06-30T09:59:02Z

rexim · 2025-06-30T10:01:39Z

I'm gonna wait 1 day for the feedback and then merge it

deniska · 2025-06-30T10:21:16Z

Two potential bikeshedding points:

Do we expect different outputs for different architectures? If we do, that can potentially lead to some targets having a recorded "semi-broken" behavior which will look green in the test runner, and you wouldn't know it without a deeper inspection ("vibe check") of the particular test output. If the test is expected to be broken on the particular architecture (especially runtime broken, rather than build time broken), it should be marked more clearly that this particular architecture for this particular test is "special".

And if we go with "different test output per target", perhaps put them in different files, so that diffs fixing codegen don't conflict with each other?

Miezekatze64 · 2025-06-30T10:28:09Z

For me the ./build/btest -c asm_func_fasm_x86_64_linux -t fasm-x86_64-windows test gives the correct output on wine, although it is expected to give an incorrect output (which makes sense, because the test is linux-only, but the test system currently reports this as unexpected)

It seems like the JSON currently contains

"fasm-x86_64-windows": {
    "status": "RunSuccess",
    "stdout": "8070071\r\n"
  }

although for some reason it works correctly for me (wine version: wine-10.9).
(exact same behaviour for asm_func_gas_x86_64_linux on gas-x86_64-windows)

Maybe we could introduce a way to say that particular test should only be tested for some architectures instead of asserting failure?

rexim · 2025-06-30T13:47:45Z

@deniska I think pretty-printing the json files makes things less conflict and error prone c919af6 What do you think?

deniska · 2025-06-30T16:38:59Z

Pretty printing json makes the diff problem quite less severe, sure.

But there's still the conceptual problem of "by default we expect every architecture to have a different output".

My idea for a system like this would be to have toupper.json with the output it's supposed to have, for all targets, and then to have toupper.uxn.json if for some reason the uxn target can't do the usual output on the toupper test, with the overrides (maybe different output, or maybe it doesn't compile at all, whatever it does, reflected in the overridden json file).

nullnominal

LGTM Other then tests/ being bloated with the JSON files.

We think it would be a good idea to have them in a sub-folder.

rexim · 2025-07-01T03:45:01Z

For me the ./build/btest -c asm_func_fasm_x86_64_linux -t fasm-x86_64-windows test gives the correct output on wine, although it is expected to give an incorrect output (which makes sense, because the test is linux-only, but the test system currently reports this as unexpected)

It seems like the JSON currently contains
"fasm-x86_64-windows": {
    "status": "RunSuccess",
    "stdout": "8070071\r\n"
  }
although for some reason it works correctly for me (wine version: wine-10.9). (exact same behaviour for asm_func_gas_x86_64_linux on gas-x86_64-windows)

Maybe we could introduce a way to say that particular test should only be tested for some architectures instead of asserting failure?

@Miezekatze64 Thank you for your feedback! I changed up the concept at 4b79ae0. Now build and runtime failures are not something that we expect and you can also disable tests on certain targets right in the json. Let me know what you think.

nullnominal · 2025-07-01T05:55:22Z

LGTM Other then tests/ being bloated with the JSON files.

We think it would be a good idea to have them in a sub-folder.

Created a PR for this

Miezekatze64 · 2025-07-01T15:08:27Z

For me the ./build/btest -c asm_func_fasm_x86_64_linux -t fasm-x86_64-windows test gives the correct output on wine, although it is expected to give an incorrect output (which makes sense, because the test is linux-only, but the test system currently reports this as unexpected)

It seems like the JSON currently contains
"fasm-x86_64-windows": {
    "status": "RunSuccess",
    "stdout": "8070071\r\n"
  }
although for some reason it works correctly for me (wine version: wine-10.9). (exact same behaviour for asm_func_gas_x86_64_linux on gas-x86_64-windows)
Maybe we could introduce a way to say that particular test should only be tested for some architectures instead of asserting failure?
@Miezekatze64 Thank you for your feedback! I changed up the concept at 4b79ae0. Now build and runtime failures are not something that we expect and you can also disable tests on certain targets right in the json. Let me know what you think.

Looks good now.

rexim · 2025-07-02T19:05:29Z

LGTM Other then tests/ being bloated with the JSON files.

We think it would be a good idea to have them in a sub-folder.

@nullnominal merged them all into a single file at b48c7e2. Let me know what you think.

nullnominal · 2025-07-02T21:21:10Z

LGTM Other then tests/ being bloated with the JSON files.
We think it would be a good idea to have them in a sub-folder.

@nullnominal merged them all into a single file at b48c7e2. Let me know what you think.

yeah its good

rexim · 2025-07-02T21:53:15Z

Thank you everybody for the feedback! Sorry that it took so long. This is a very important for me functionality which I can't continue merging other PRs without. Thank you again!

rexim added 9 commits June 29, 2025 07:08

btest: Roughly outline the -record flag

0b701bb

btest: load/save the outcomes as JSON

928f1f2

btest: Properly report unexpected outcomes

ae10c81

btest: factor out ANSI escape codes

9a24670

btest: temporarily remove -build-only and building examples on CI

2a40c58

btest: forward stdout

ea9809e

btest: print report on -record too

90b2b02

btest: make -record patch the json files instead of overwritting

48abf14

btest: Report the difference between actual and expected outcomes

aad023b

rexim changed the title ~~Implement Record/Replay functionality for best~~ Implement Record/Replay functionality for btest Jun 30, 2025

Pretty-print json files

c919af6

rexim added 2 commits June 30, 2025 21:04

Check the vibes of some tests

9cb1804

btest: adjust legend and unexpected outcome reporting

d7d4693

nullnominal suggested changes Jun 30, 2025

View reviewed changes

btest: make Build and Run fail non-expectable, allow disabling tests

4b79ae0

nullnominal mentioned this pull request Jul 1, 2025

GFI: Target Specific Tests #183

Open

rexim added 2 commits July 2, 2025 18:27

slice_lookup -> assoc_lookup

6e37393

Merge all of the little json files into one big json file

b48c7e2

rexim added 2 commits July 3, 2025 03:29

Remove debug output

c2de94d

Introduce ReportStatus::NeverRecorded

c032912

nullnominal approved these changes Jul 2, 2025

View reviewed changes

rexim added 2 commits July 3, 2025 04:33

Introduce enum_with_order!{} macro and compress ReportStatus with it

4bbc719

Use enum_with_order!{} to define Targets

06a5daa

rexim merged commit dfc20e5 into main Jul 2, 2025
1 check passed

rexim deleted the rere branch July 2, 2025 21:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement Record/Replay functionality for btest #185

Implement Record/Replay functionality for btest #185

rexim commented Jun 30, 2025 •

edited

Loading

Uh oh!

rexim commented Jun 30, 2025

Uh oh!

deniska commented Jun 30, 2025

Uh oh!

Miezekatze64 commented Jun 30, 2025 •

edited

Loading

Uh oh!

rexim commented Jun 30, 2025

Uh oh!

deniska commented Jun 30, 2025

Uh oh!

nullnominal left a comment

Uh oh!

rexim commented Jul 1, 2025 •

edited

Loading

Uh oh!

nullnominal commented Jul 1, 2025

Uh oh!

Miezekatze64 commented Jul 1, 2025

Uh oh!

rexim commented Jul 2, 2025

Uh oh!

nullnominal commented Jul 2, 2025

Uh oh!

rexim commented Jul 2, 2025

Uh oh!

Uh oh!

Uh oh!

Implement Record/Replay functionality for btest #185

Implement Record/Replay functionality for btest #185

Conversation

rexim commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rexim commented Jun 30, 2025

Uh oh!

deniska commented Jun 30, 2025

Uh oh!

Miezekatze64 commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rexim commented Jun 30, 2025

Uh oh!

deniska commented Jun 30, 2025

Uh oh!

nullnominal left a comment

Choose a reason for hiding this comment

Uh oh!

rexim commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nullnominal commented Jul 1, 2025

Uh oh!

Miezekatze64 commented Jul 1, 2025

Uh oh!

rexim commented Jul 2, 2025

Uh oh!

nullnominal commented Jul 2, 2025

Uh oh!

rexim commented Jul 2, 2025

Uh oh!

Uh oh!

Uh oh!

rexim commented Jun 30, 2025 •

edited

Loading

Miezekatze64 commented Jun 30, 2025 •

edited

Loading

rexim commented Jul 1, 2025 •

edited

Loading