lyng/pi_spigot_perf_baseline.md at 30e56946a011081fa38aa1c20f1c78e8a5c58e8a

sergeych d8454a11fc optimize arithmetics

2026-04-04 04:01:43 +03:00

Pi Spigot JVM Baseline

Saved on April 4, 2026 before the List<Int> indexed-access follow-up fix.

Benchmark target:

Execution path:

Python: python3 examples/pi-bench.py
Lyng JVM: ./gradlew :lyng:runJvm --args='/home/sergeych/dev/lyng/examples/pi-bench.lyng'
Constraint: do not use Kotlin/Native lyng CLI for perf comparisons

Baseline measurements:

Baseline ratio:

Primary finding at baseline:

The hot reminders[j] accesses in piSpigot were still lowered through boxed object index ops and boxed arithmetic.
Newly added GET_INDEX_INT and SET_INDEX_INT only reached pi, not reminders.
Root cause: initializer element inference handled list literals, but not List.fill(boxes) { 2 }, so reminders did not become known List<Int> at compile time.

Follow-up change:

Verification:

piSpigot disassembly now contains typed ops for reminders, for example:
- GET_INDEX_INT s5(reminders), s10(j), ...
- SET_INDEX_INT s5(reminders), s10(j), ...

Post-change measurements using jlyng:

Observed improvement vs baseline:

Residual gap vs Python baseline:

Full script: Lyng JVM is still about 3.9x slower than Python (655.8 ms vs 167 ms)
Warm function: Lyng JVM is still about 2.3x slower than Python (286.2 ms vs 126.126 ms)

Current benchmark-test snapshot (n=200, JVM test harness):

optimized-int-division-rval-off: 135 ms
optimized-int-division-rval-on: 125 ms
piSpigot bytecode now contains:
- LIST_FILL_INT for both pi and reminders
- GET_INDEX_INT / SET_INDEX_INT for the hot indexed loop