Grab the existing benchmark harness code to test various end-to-end query scenarios, checking behavior for various query scenarios such as:
Keeping it at a high level of "send request, check resulting external behavior and response" should keep it pretty resilient to refactoring etc.
This can later be expanded to validate server+client functionality for e.g. #21 (COOKIE) and #35 (NSID).
It's a bit quiet in here.