ACCU World of Code

I still use the mutt email client when I’m remoted into some of my FreeBSD servers. It might not be the most eye pleasing email client ever, but it’s powerful, lightweight and fast. Mutt has a very powerful feature that allows you to tag messages via regular expressions. It has a couple of special pattern modifiers that allow you to apply the regex to certain mail headers only. I can never remember so I’m starting a list of the ones I tend to use most in the hope that I’ll either remember them eventually or can refer back to this post.

December 15, 2016December 15, 2016

Migrate MelatiSite from CVS to github

Tim Pizey from Tim Pizey

Re-visiting http://tim-pizey.blogspot.co.uk/2011/10/cvs-to-github.html (why did I not complete this at the time?)

Following How to export revision history from mercurial or git to cvs?

On hanuman I created an id file git_authors mapping cvs ids to github name, email format for all contributors:


timp=Tim Pizey<timp@paneris.org>

then create a repository on github (melati in this example, I already have uploaded my ssh public key for this machine)


cd ~
git cvsimport -d /usr/cvsroot -C MelatiSite -r cvs -k -A git_authors MelatiSite

cd melati
echo A jdbc to java object relational mapping system. 1999-2011 > README.txt
git add README.txt
git commit -m "Initial" README.txt
git remote add origin git@github.com:timp21337/melati.git
git push -u origin master

See https://github.com/timp21337/melati.

December 13, 2016

Converting files from DOS to UNIX file formats using Emacs

The Lone C++ Coder's Blog from The Lone C++ Coder's Blog

What do you do if you don’t have dos2unix, need to convert a file or three from DOS (or Mac) format to UNIX format, but all you have is Emacs? Well, of course you use Emacs for the file conversion, what else?

December 12, 2016December 12, 2016

Surprising Defaults â€“ HttpClient ExpectContinue

Chris Oldwood from The OldWood Thing

One of the things you quickly discover when moving from building services on-premise to â€œthe cloudâ€ is quite how many more bits of wire and kit suddenly sit between you and your consumer. Performance-wise this already elongated network path can then be further compounded when the framework youâ€™re using invokes unintuitive behaviour by default [1].

The Symptoms

The system was a new REST API built in C# on the .Net framework (4.6) and hosted in the cloud with AWS. This AWS endpoint was then further fronted by Akamai for various reasons. The initial consumer was an on-premise adaptor (also written in C#) which itself had to go through an enterprise grade web proxy to reach the outside world.

Naturally monitoring was added in fairly early on so that we could start to get a feel for how much added latency moving to the cloud would bring. Our first order approximation to instrumentation allowed us to tell how long the HTTP requests took to handle along with a breakdown of the major functions, e.g. database queries and 3rd party requests. Outside the service we had some remote monitoring too that could tell us the performance from a more customer-like position.

When we integrated with the 3rd party service some poor performance stats caused us to look closer into our metrics. The vast majority of big delays were outside our control, but it also raised some other questions as the numbers didnâ€™t quite add up. We had expected the following simple formula to account for virtually all the time:

HTTP Request Time ~= 3rd Party Time + Database Time

However we were seeing a 300 ms discrepancy in many (but not all) cases. It was not our immediate concern as there was bigger fish to fry but some extra instrumentation was added to the OWIN pipeline and we did a couple of quick local profile runs to look out for anything obviously out of place. The finger seemed to point to time lost somewhere in the Nancy part of the pipeline, but that didnâ€™t entirely make sense at the time so it was mentally filed away and we moved on.

Serendipity Strikes

Whilst talking to the 3rd party about our performance woes with their service they came back to us and asked if we could stop sending them a â€œExpect: 100-Continueâ€ header in our HTTP requests.

This wasnâ€™t something anyone in the team was aware of and as far as we could see from the various RFCs and blog posts it was something â€œnaturally occurringâ€ on the internet. We also didnâ€™t know if it was us adding it or one of the many proxies in between us and them.

We discovered how to turn it off, and did, but it made little difference to the performance problems we had with them, which were in the order of seconds, not milliseconds. Feeling uncomfortable about blindly switching settings off without really understanding them we reverted the change.

The mention of this header also cropped up when we started investigating some errors we were getting from Akamai that seemed to be more related to a disparity in idle connection timeouts.

Eventually, as we learned more about this mysterious header someone in the team put two-and-two together and realised this was possibly where our missing time was going too.

The Cause

Our REST API uses PUT requests to add resources and it appears that the default behaviour of the .Net HttpClient class is to enable the sending of this â€œExpect: 100-Continueâ€ header for those types of requests. Its purpose is to tell the server that the headers have been sent but that it will delay sending the body until it receives a 100-Continue style response. At that point the client sends the body, the server can then process the entire request and the response is handled by the client as per normal.

Yes, thatâ€™s right, it splits the request up so that it takes two round trips instead of one!

Now you can probably begin to understand why our request handling time appeared elongated and why it also appeared to be consumed somewhere within the Nancy framework. The request processing is started and handled by the OWN middleware as that only depends on the headers, it then enters Nancy which finds a handler, and so requests the body in the background (asynchronously). When it finally arrives the whole request is then passed to our Nancy handler just as if it had been sent all as a single chunk.

The Cure

When you google this problem with relation to .Net youâ€™ll see that there are a couple of options here. We were slightly nervous about choosing the nuclear option (setting it globally on the ServicePointManager) and instead added an extra line into our HttpClient factory so that it was localised:

var client = new HttpClient(...);
...
client.DefaultRequestHeaders.ExpectContinue = false;

We re-deployed our services, checked our logs to ensure the header was no longer being sent, and then checked the various metrics to see if the time was now all accounted for, and it was.

Epilogue

In hindsight this all seems fairly obvious, at least, once you know what this header is supposed to do, and yet none of the people in my team (who are all pretty smart) joined up the dots right away. When something like this goes astray I like to try and make sense of why we didnâ€™t pick it up as quickly as perhaps we should have.

In the beginning there were so many new things for the team to grasp. The difference in behaviour between our remote monitoring and on-premise adaptor was assumed to be one of infrastructure especially when we had already battled the on-premise web proxy a few times [2]. We saw so many other headers in our requests that we never added so why would we assume this one was any different (given none of us had run across it before)?

Given the popularity and maturity of the Nancy framework we surmised that no one would use it if there was the kind of performance problems we were seeing, so once again were confused as to how the time could appear to be lost inside it. Although we were all aware of what the async/await construct does none of us had really spent any serious time trying to track down performance anomalies in code that used it so liberally and so once again we had difficulties understanding perhaps what the tool was really telling us.

Ultimately though the default behaviour just seems so utterly wrong that none of use could imagine the out-of-the-box settings would cause the HttpClient to behave this way. By choosing this default we are in essence optimising PUT requests for the scenario where the body does not need sending, which we all felt is definitely the exception not the norm. Aside from large file uploads or massive write contention we were struggling to come up with a plausible use case.

I donâ€™t know what forces caused this decision to be made as I clearly wasnâ€™t there and I canâ€™t find any obvious sources that might explain it either. The internet and HTTP has evolved so much over the years that itâ€™s possible this behaviour provides the best compatibility with web servers out-of-the-box. My own HTTP experience only covers the last few years along with few more around the turn of the millennium, but my colleagues easily cover the decades Iâ€™m missing so I donâ€™t feel Iâ€™m missing anything obvious.

Hopefully some kind soul will use the comments section to link to the rationale so we can all get a little closure on the issue.

[1] Violating The Principle of Least Astonishment for configuration settings was something I covered more generally before in â€œSensible Defaultsâ€.

[2] See â€œThe Curse of NTLM Based HTTP Proxiesâ€.

December 6, 2016December 6, 2016

the design and implementation of cyber-dojo

Jon Jagger from less code, more software

At the excellent Agile on the Beach conference in Cornwall I did a presentation outlining some of the history, design and implementation of cyber-dojo. The video has just gone live on youtube.

December 3, 2016

What happened to XEmacs?

The Lone C++ Coder's Blog from The Lone C++ Coder's Blog

I used XEmacs quite a lot in the 2000s before I switched back to the more stable GNU Emacs. That was back then before GNU Emacs offered a stable official Windows build when XEmacs did, and at the time I was doing a lot of Windows development. Out of curiosity and for some research I tried to look into the current state of the project and found that the www.xemacs.org appears to be unreachable.

December 2, 2016

Bit the bullet and upgraded my Mac Pro’s CPU

The Lone C++ Coder's Blog from The Lone C++ Coder's Blog

I’ve been an unashamed fan of the old “cheese grater” Mac Pro due to its sturdiness and expandability. Yes, they’re not the most elegant bit of kit out there but they are well built. And most importantly for me, they are expandable by plugging things inside the case, not by creating a Gordian Knot of hubs, Thunderbolt cables, USB cables and stacks of external disks all evenly scattered around a trash can. Oh, and they’re designed to go under a desk. Where mine happens to live, right next to my dual boot Linux/Windows development box.

November 19, 2016

Switching to Manjaro Linux and getting an AMD RX 470 to work

The Lone C++ Coder's Blog from The Lone C++ Coder's Blog

I’ve been a Xubuntu user for years after switching from OpenSuse. I liked its simplicity and the fact that it just worked out of the box, but I was getting more and more disappointed with Ubuntu packages being out of date, sorry, stable. Having to rebuild a bunch of packages on every install was getting a little old. Well, they did provide material for all those “build XXX on Ubuntu” posts. Recently I’ve been playing with Manjaro Linux in a VM as I had been looking for an Arch Linux based distribution that gave me the right balance between DIY and convenience. I ended up liking it so much that I did a proper bare metal install on my main desktop. The install was pretty smooth apart from a issue with getting my AMD RX 470 graphics card to work.

November 17, 2016November 17, 2016

Overly Prescriptive Tests

Chris Oldwood from The OldWood Thing

In my recent post â€œTautologies in Testsâ€ I adapted one of Einsteinâ€™s apocryphal sayings and suggested that tests should be â€œas precise as possible, but not too preciseâ€. But what did I mean by that? How can you be too precise, in fact isnâ€™t that the point?

Mocking

One way is to be overly specific when tracking the interactions with mocks. Itâ€™s very easy when using a mocking framework to go overboard with your expectations, just because you can. My personal preference (detailed before in â€œMock To Test the Outcome, Not the Implementationâ€) is to keep the details of any interactions loose, but be specific about the outcomes. In other words what matters most is (usually) the observable behaviour, not necessarily how itâ€™s achieved.

For example, rather than set-up detailed instructions on a mock that cover all the expected parameters and call counts Iâ€™ll mostly use simple hand-crafted mocks [1] where the method maps to a delegate where Iâ€™ll capture only the salient details. Then in the assertions at the end I verify whatever I need to in the same style as the rest of the test. Usually though the canned response is test case specific and so rarely needs any actual logic.

In essence what Iâ€™m creating some people prefer to call stubs as they reserve the term â€œmocksâ€ for more meatier test fakes that record interactions for you. Iâ€™d argue that using the more complex form of mock is largely unnecessary and will hurt in the long run. To date (anecdotally speaking) Iâ€™ve wasted too much time â€œfixingâ€ broken tests that overused mocks by specifying every little detail and were never written to give the implementation room to manoeuvre, e.g. during refactoring. In fact an automated refactoring tool is mandatory on code like this because the methods are referenced in so many tests it would take forever to fix-up manually.

I often feel that some of the interactions with dependencies Iâ€™ve seen in the past have felt analogous to testing private methods. Another of my previous posts that was inspired by mocking hell is â€œDonâ€™t Pass Factories, Pass Workersâ€. Naturally there is a fine line here and maybe Iâ€™ve just not seen enough of it done well to appreciate how this particular tool can be used effectively.

White-Box Testing

The other form of overly specific test Iâ€™ve seen comes from what I believe is relying too much on a white-box testing approach so that the tests express the output exactly.

The problem with example based tests is that they are often taken literally, which I guess is kind of the point, but as software engineers we should try and see passed the rigid examples and verify the underlying behaviour instead, which is what weâ€™re really after.

For example, consider a pool of numbers [2] up to some predefined limit, say, 10. A naÃ¯ve approach to the problem might test the pool by asserting a very specific sequence, i.e. the starting one:

[Test]
public void returns_sequence_up_to_limit()
{
var pool = new NumberPool(10);
var expected = new[] { 1, 2, 3, ... , 9, 10 };

for (var number in expected)
Assert.That(pool.Acquire(), Is.EqualTo(number));
}

From a white-box testing approach we can look inside the NumberPool and probably see that itâ€™s initially generating numbers using the ++ operator. The implementation might eagerly generate that sequence in the constructor, add them to the end of a queue, and then divvy out the front of the queue.

From a â€œprogrammerâ€™s testâ€ point of view (aka unit test) it does indeed verify that, if my expectation is that the implementation should return the exact sequence 1..10, then it will. But how useful is that for the maintainer of this code? Iâ€™d argue that weâ€™ve over-specified the way this unit should be allowed to behave.

Verify Behaviours

And that, I think, lies at that heart of the problem. For tests to be truly effective they should not describe exactly what they do, but should describe how they need to behave. Going back to our example above the NumberPool class does not need to return the exact sequence 1..10, it needs to satisfy some looser constraints, such as not returning a duplicate value (until re-acquired), and limiting the range of numbers to between 1 and 10.

[Test]
public void sequence_will_be_unique()
{
var pool = new NumberPool(10);
var sequence = new List<int>();

for (var i in Enumerable.Range(1, 10))
sequence.Add(pool.Acquire());

Assert.That(sequence.Distinct().Count(),
Is.EqualTo(10));
}

[Test]
public void sequence_only_contains_one_to_limit()
{
var pool = new NumberPool(10);
var sequence = new List<int>();

for (var i in Enumerable.Range(1, 10))
sequence.Add(pool.Acquire());

Assert.That(sequence.Where(n => (n < 1) || (n > 10)),
Is.Empty);
}

With these two tests we are free to change the implementation to generate a random sequence in the constructor instead if we wanted, and they would still pass, because it conforms to the looser, albeit still well defined, behaviour. (It may have unpredictable performance characteristics but that is a different matter.)

Once again we are beginning to enter the realm of property based testing which forces us to think harder about what behaviours our code exhibits rather than what it should do in one single scenario.

This does not mean there is no place for tests that take a specific set of inputs and validate the result against a known set of outputs. On the contrary they are an excellent starting point for thinking about what the real test should do. They are also important in scenarios where you need some smoke tests that â€œkick the tyresâ€ or you are naturally handling a very specific scenario.

Indicative Inputs

Sometimes we donâ€™t intend to make our test look specific but it just turns out that way to the future reader. For example in our NumberPool tests above what is the significance of the number â€œ10â€? Hopefully in this example it is fairly obvious that it is an arbitrary value as the test names only talk about â€œa limitâ€. But what about a test for code that handles, say, an HTTP error?

[Test]
public void client_throws_when_service_unavailable()
{
using (FakeServer.Returns(InternalServerError))
{
    var client = new RestClient(. . .);

    Assert.That(client.SendRequest(. . .),
                Throws.InstanceOf<RequestException>());
}
}

In this test we have a mock (nay stub) HTTP server that will return a non-2XX style result code. Now, what is the significance of the InternalServerError result code returned by the stub? Is it a specific result code weâ€™re handling here, or an indicative one in the 5XX range? The test name uses the term â€œservice unavailableâ€ which maps to the more specific HTTP code 503, so is this in fact a bug in the code or test?

Unless the original author is around to ask (and even remembers) we donâ€™t know. We can surmise what they probably meant by inspecting the production code and seeing how it processes the result code (e.g. a direct comparison or a range based one). From there we might choose to see how we can avoid the ambiguity by refactoring the test. In the case where InternalServerError is merely indicative we can use a suitably named constant instead, e.g.

[Test]
public void throws_when_service_returns_5xx_code()
{
const int CodeIn5xxRange = InternalServerError;

using (FakeServer.Returns(CodeIn5xxRange))
{
    var client = new RestClient(. . .);

    Assert.That(client.SendRequest(. . .),
                Throws.InstanceOf<RequestException>());
}
}

A clue that there is a disconnect is when the language used in the test name isnâ€™t correctly reflected in the test body itself. So if the name isnâ€™t specific then nor should the test be, but also vice-versa, if the name is specific then expect the test to be. A corollary to this is that if your test name is vague donâ€™t surprised when the test itself turns out equally vague.

Effective Tests

For a suite of tests to be truly effective you need them to remain quietly in the background until you change the code in a way that raises your awareness around some behaviour you didnâ€™t anticipate. The fact that you didnâ€™t anticipate it means that youâ€™ll be relying heavily on the test rather than the code you just changed to make sense of the original intended behaviour.

When it comes under the spotlight (fails) a test needs to convince you that it was well thought out and worthy of your consideration. To be effective a guard dog has to learn the difference between friend and foe and when we write tests we need to learn how to leave enough room for safe manoeuvring without forgetting to bark loudly when we exceed our remit.

[1] When you keep your interfaces simple and focused this is pretty easy given how much a modern IDE can generate for you when using a statically typed language.

[2] This example comes from a real one where the numbers where identifiers used to distinguish compute engines in a grid.

November 17, 2016

Take the 10% code reduction challenge!

The Lone C++ Coder's Blog from The Lone C++ Coder's Blog

It might sound paradoxical, but in general, writing more code is easier than writing less code that accomplishes the same goals. Even if your code starts out clean, compact and beautiful, the code that is added later to cover the corner cases nobody thought of usually takes care of the code being well designed, elegant and beautiful. Agile programming offers a solution, namely constant refactoring, but who has time for that? That’s why I occasionally give myself the 10% code reduction challenge and I encourage you to do the same.

Posts

Mutt regex pattern modifiers

Migrate MelatiSite from CVS to github

Converting files from DOS to UNIX file formats using Emacs

Surprising Defaults â€“ HttpClient ExpectContinue

the design and implementation of cyber-dojo

What happened to XEmacs?

Bit the bullet and upgraded my Mac Pro’s CPU

Switching to Manjaro Linux and getting an AMD RX 470 to work

Overly Prescriptive Tests

Take the 10% code reduction challenge!