Add benchmarking tests (close #300) by mscwilson · Pull Request #301 · snowplow/snowplow-java-tracker

Miranda Wilson (mscwilson) · 2022-02-01T11:07:16Z

For issue #300.

Java comes with a benchmarking tool called Java Microbenchmarking Harness, JMH. It allows relatively accurate measuring of parts of the code. I integrated it as part of the main project build.gradle, and was able to measure the average time for Tracker.track(PageView) to complete, in the local 0.12.0-alpha.0 version. It takes a very consistent 1 microsecond.

That's not the time taken to send the event to the mocked HttpClientAdapter - that happens asynchronously, so Futures would be needed to measure that. Since we have had no reports of performance problems, that is almost certainly overkill.

I then wanted to measure the speed of Tracker.track(PageView) in the previous version 0.10.1 and 0.11.0. This has been problematic in multiple ways.

Problem 1: to interrupt the tracker's threads at the end of each test, jmh calls tracker.close(). This is a new method added in #298. Therefore to test previous versions, this has to be changed to emitter.close().

Problem 2: I had set up JMH manually as a gradle task, defining the tracker as a dependency (jmhImplementation 'com.snowplowanalytics:snowplow-java-tracker:0.11.0'. This dependency was required - removing it caused jmh to fail because it couldn't resolve any of the Snowplow components. I added this print statement at the beginning of the test run.

@Setup
public void doSetUp() {
    System.out.println("Using tracker version: " + tracker.getTrackerVersion());
}

It was always tracker version: 0.12.0-alpha.0 i.e. my local version currently being worked on. Presumably the Java compiler is somehow using both versions - local and from mavenCentral - but preferentially using the local one. So it was impossible to measure an old version.

Therefore, I created a brand new Gradle project inside the examples folder, and set up JMH again, this time using the popular JMH gradle plugin. I can now use mavenLocal and mavenCentral to resolve the snowplow-java-tracker version of my choice. Btw, Gradle strongly recommends never using mavenLocal.

Problem 3: When running the benchmarking on version 0.10.1 or 0.11.0, it gradually gets slower and slower for each test, until I have to manually cancel the run after a few minutes wait. So the first few iterations will take e.g. 500 ns, which will increase to e.g. 0.4 ms before I stop it.

Perhaps it's not worth measuring the old versions after all? We could just start benchmarking from this version onwards.

Miranda Wilson (mscwilson) · 2022-02-01T11:09:37Z

Currently, the JMH additions are present in both the main project and the separate project. I will delete one set once we decide what to do.

AlexBenny

Thanks for the clear explanation.

I prefer the benchmarking in a separate project because it simplifies the build.gradle of the tracker (it's already quite complex) and it also simplifies the switching between current tracker version and old tracker versions.

I don't think mavenLocal would be a problem for our CI. I think it would be a problem if used for the regular development in a large project. However, we can always add an issue as a reminder where we suggest to move to Ivy (it seems preferred in the doc you linked).

I think keeping the benchmarking data stored somewhere is a good idea. We can monitor for dangerous regressions. IMO, for now, a simple manual check with the old tracker would be enough. No need to keep those data stored somewhere. We can think at this for the future versions (maybe addressing this improvement in a future task).

I don't have an opinion about the Problem 3 (degrading performance of previous trackers). Maybe the old tracker is not fully shut down when the new tracker is instanced?!

Miranda Wilson (mscwilson) · 2022-02-01T19:50:08Z

I've improved the JMH code so that it recreates and then close()s the tracker/emitter for every iteration. This has fixed a large part of the memory problems (Problem 3). I can now run the benchmarking test successfully for version 0.11.0, with intermittent "out of heap space" errors for version 0.10.1.

Edit: to be clear, all the following tests were performed on my MacBook Pro M1.

The benchmarking test measures the average time to complete Tracker.track(pageView), over 500ms of testing. The results:

Tracker version	Time (mean), ns/op	S.D.	n
0.10.1	390.138	32.9	60
0.11.0	375.125	27.2	100
0.12.0-alpha.0	972.818	28.2	100

There were only 60 tests for version 0.10.1 because two of the forks were dropped due to "out of heap space".

I also tested using JMH's singleShotTime setting, which is supposed to measure the time for a single operation. The error was huge - some standard deviation results bigger than the mean. It claimed that a single attempt in version 0.12.0-alpha.0 took ~200 +- 70 microseconds. This obviously is at odds with the averageTime ns/operation result of ~1 microsecond.

I also tested manually using System.nanoTime(). The code is not uploaded here. I adapted the simple-console demo to create an ArrayList containing two of each of the six event types. I then looped over the list 200 times, calling Tracker.track() on each event (24000 events tracked in total), and measured how long it took. NB these are ms, not ns.

Tracker version	Time (mean), ms	S.D.	n
0.10.1	69.2	19.2	5
0.11.0	49.2	12.8	5
0.12.0-alpha.0	134.6	18.7	5

I don't think it's worth reading anything into the specific times measured: we can only compare the relative differences between the tracker versions. It's clear that the new version of the tracker is twice as slow as previous versions.

AlexBenny · 2022-02-02T15:50:44Z

These numbers are interesting. The fact that the new version is slower is quite expected. Being slower 2 or 3 times might not be a problem, after all we are talking of a microsecond. Mostly depends on how/where this tracker is used.
I think much of the slowness is due to the work needed to instance a thread for each event. If we want to reduce this time we can adopt the same buffer approach before the tracker threadpool. In practice the track method puts the event in a LinkedBlockingQueue and the tracker threadpool consumes the queue. It should make the time spent on the track method almost the same as the v0.11.

Paul Boocock (@paulboocock), thoughts on this?

Miranda Wilson (mscwilson) · 2022-02-10T15:16:20Z

We've decided to merge this in now, and revisit the discussion of a producer/consumer queue for the Tracker later.

Miranda Wilson (mscwilson) requested a review from AlexBenny February 1, 2022 11:07

Snowplow CLA bot (snowplowcla) added the cla:no [Auto generated] Snowplow Contributor License Agreement has not been signed. label Feb 1, 2022

Snowplow (snowplow) deleted a comment from Snowplow CLA bot (snowplowcla) Feb 1, 2022

Miranda Wilson (mscwilson) removed the cla:no [Auto generated] Snowplow Contributor License Agreement has not been signed. label Feb 1, 2022

AlexBenny reviewed Feb 1, 2022

View reviewed changes

Comment thread examples/benchmarking/src/jmh/java/com/snowplowanalytics/TrackerBenchmark.java Outdated

Comment thread src/jmh/java/com/snowplowanalytics/snowplow/jmh/TrackerBenchmark.java Outdated

Miranda Wilson (mscwilson) added 4 commits February 1, 2022 12:28

Set up JMH testing in main project

0c6d6dc

Add benchmark test

8b9c11c

Create separate project for jmh

21abe14

Remove JMH from main project

185fb14

Miranda Wilson (mscwilson) force-pushed the issue/300-performance_testing branch from 2b3c601 to 185fb14 Compare February 1, 2022 12:28

Fix memory leaks

d1c04b9

Add comments and readme

942b4e0

Miranda Wilson (mscwilson) marked this pull request as ready for review February 10, 2022 15:16

Miranda Wilson (mscwilson) merged commit f9cab8c into release/0.12.0 Feb 10, 2022

Miranda Wilson (mscwilson) deleted the issue/300-performance_testing branch February 10, 2022 15:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmarking tests (close #300)#301

Add benchmarking tests (close #300)#301
Miranda Wilson (mscwilson) merged 6 commits intorelease/0.12.0from
issue/300-performance_testing

Miranda Wilson (mscwilson) commented Feb 1, 2022

Uh oh!

Miranda Wilson (mscwilson) commented Feb 1, 2022

Uh oh!

AlexBenny left a comment

Uh oh!

Uh oh!

Uh oh!

Miranda Wilson (mscwilson) commented Feb 1, 2022 •

edited

Loading

Uh oh!

AlexBenny commented Feb 2, 2022

Uh oh!

Miranda Wilson (mscwilson) commented Feb 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Conversation

Miranda Wilson (mscwilson) commented Feb 1, 2022

Uh oh!

Miranda Wilson (mscwilson) commented Feb 1, 2022

Uh oh!

AlexBenny left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Miranda Wilson (mscwilson) commented Feb 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexBenny commented Feb 2, 2022

Uh oh!

Miranda Wilson (mscwilson) commented Feb 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Miranda Wilson (mscwilson) commented Feb 1, 2022 •

edited

Loading