No bugs journey episode 2: It’s a matter of values…
Read episode 1 first.
Here we are at episode 2, and time for another question. Of the three, which one do you value the most?
- Shipping on schedule
- Shipping with a given set of features
- Shipping with high quality
Write down your answer.
Ha ha! It was a trick question. Pretty much everybody is going to say, “it depends”, because it’s a tradeoff between these three. Let’s try an alternate expression:
- Build it fast
- Build a great product
- Build it right
All of these are obviously important. Hmm. Let’s try a different tact…
If you get in a situation where you need to choose between schedule, features, and quality, which one would you choose?
- Shipping on schedule with the right set of features but adequate quality
- Shipping late with the right set of features and high quality.
- Shipping on schedule and with high quality but with fewer features than we had hoped
Write down your answer.
Let’s talk about option #1. First off, it doesn’t exist. The reason it doesn’t exist is that there is a minimum quality level at which you can survive and grow as a company. If you are trying to hold features and schedule constant, as you develop features you are forcing the quality level down – it *has to* go down because something has to give if you get behind (short estimates, etc.). That means you start with a product that you just shipped, and then it degrades in quality as you do features, and then at some point you want to ship, so you realize you need quality, so you focus on getting back to *adequate*. Unfortunately, fixing bugs is the activity with the most schedule uncertainty, so there is no way you are shipping on time.
It’s actually worse than this. Typically, you’ve reached a point where your quality is poor and you know that there is no way that you are going to get the features done and reach acceptable quality, so you cut features.
And you end up with a subset of features that ships late with barely adequate quality. There’s an old saying in software that goes, “schedule/features/quality – pick 2”, but what few people realize is that it’s very easy to pick zero.
I hope I’ve convinced you that the first approach doesn’t work, and given that I’m saying this series is about “no bugs”, you probably aren’t surprised. Let’s examine the other options.
I’ve see #2 work for teams; they had a feature target in their head, built their software to high quality standards, and then shipped when they were done. This was common in the olden days when box products were common, when the only way to get your product out there was produce a lot of CDs (or even diskettes) and ship them out to people. It worked, in the sense that companies that used the approach were successful.
It is, however, pretty disruptive on the business as a whole; it’s hard to run a business where you don’t know:
- If new revenue is going to show up 1 year or 3 years from now
- When you’ll be spending money on sales and marketing
- Whether the market has any interest in buying what you build
Not to mention making it tough on customers who don’t know when they’ll get that bug fix and enhancement they requested.
Which leaves option #3, which through an amazing coincidence, is the one that is the most aligned with agile philosophy. Here’s one way to express it:
Given that:
- It is important to the rest of the business and our customers that we be predictable in our shipping schedule
- Bugs slow us down, make customers unhappy, and pose significant schedule risk
- The accuracy at which we can make estimates of the time to develop features is poor
- Work always expands (people leave the team or get sick, we need to spend time troubleshooting customer issues, engineering systems break), forcing us to flex somewhere
The only rational option that we have is to flex on features. If we are willing to accept that features don’t show up as quickly as we would like (or had originally forecast), it is possible to ship on time with high quality.
I chose the phrase “it is possible” carefully; it is possible to build such a system but not get the desired result.
Flexing on features effectively
Time for another exercise. Studies have shown that you will get a much better review of a plan if you spend some focused time thinking about how it might go wrong.
Thinking about the world I just described, what could go wrong? What might keep us from shipping on time with high quality?
Write down your answers.
Here’s my list:
- At the time we need to decide to cut features, we might have 10 features that are 80% done. If we move out all 10 of them to the next iteration, we have nothing to ship.
- We might have a hard time tracking where we are early enough to make decisions; most people have seen a case where all the features were on schedule until a week before shipping and suddenly 25% were behind by a week or more. If this happens, it may be too late to adapt.
- We might have teams that are dependent on each other; the only way to make my schedule is to cut work from team A, but that means that teams B & C can’t finish their work as planned, and they will have to adjust, using time we don’t have.
- This release was feature A and a few bugfixes, and feature A isn’t on schedule. We’ll look silly if we just shipped bugfixes.
(At this point I’m really hoping that you don’t have something important on your list that I’ve missed. If so, that’s what comments are for…)
How can we mitigate? Well, for the first one, we can focus on getting one feature done before we move on to the next one. That means that we would have 8 features 100% done instead of 10 features 80% done. This is one of the main drivers for the agile “work together as a team” approach.
This mitigation works for the second one as well. If we make our tracking “is the feature fully complete and ready to ship”, we can tell where we are (3/10 features current done and ready to ship (“done done” in agile terminology)) and we have a better chance of predicting where we are going. This is another driver for “work together as a team”. Note that for both the first and the second one, the more granular our features are, the easier it is to make work; it works great if the team has 5-10 items per iteration but poorly if it only has two. This is the driver for “small self-contained stories” and “velocity measurement” in agile.
I have a few thoughts on the third one. You can mitigate by using short cycles and having teams B and C wait until A is done with their work. You can try to break the work A does into parts so the part the other teams need can be delivered further. Or you can go with a “vertical team” approach, which works great.
For the fourth one, the real problem is that we put all of our eggs in one basket. Chopping feature A up will give us some granularity and the chance to get part way there. I also think that a shorter cycle will be our friend; if we are giving our customers updates every month, they will probably be fine with a message that says, “this month’s update only contains bugfixes”.
To summarize, if we have small stories (a few days or less) and we work on them sequentially (limiting how many we are working on at one time), our problems about tracking and having something good enough to ship become much more tractable. We can predict early what features are not going to make it, and simply shift them to the next cycle (iteration). That is our pressure-relief valve, the way that we can make sure we have enough time to get features completed on the shipping schedule with great quality.
The world isn’t quite as simple as I’ve described it here, but I’ve also omitted a number of advanced topics that help out with that, so I think it’s a pretty fair overview.
Before I go on, I’d like to address one comment I’ve heard in relation to this approach.
If we flex on features, then I’m not going to be able to forecast where we will be in 6 months
The reality is that nobody has ever been able to do that, you were just pretending. And when you tried, you were often spending time on things that weren’t the most important ones because of new priorities.
Instead, how about always working on whatever is the highest priority for the business, and having the ability to adjust that on a periodic basis? How does that sound?
The culture of commitment
Time for another question:
What are the biggest barriers to making this work in your organization?
Write down your answers.
I have a list, but I’m only going to talk about one, because it is so much more important than the rest. It’s about a culture of commitment.
Does your organization ask development teams to *commit* to being done in the time that they estimated? Does it push back when developer estimates are too large? Do developers get emails or visits from managers telling them they need to be done on time?
If so, you are encouraging them to write crappy code. You have trained them that being on the “not done” list is something to be avoided, and they will do their best not to avoid it. They can either work harder/longer – which has limited effectiveness and is crappy for the company in other ways – or they can cut corners. That’s all they can do.
Pressure here can be pretty subtle. If my team tracks “days done” and I have to update my estimate to account for the fact that things were harder than I thought, that puts pressure on me to instead cut corners.
This is one reason agile uses story points for estimation; it decouples the estimation process from the work process.
Changing the culture
Here are my suggestions:
- Get rid of any mention the words “committed” or “scheduled” WRT work. The team is *planning* what work they will attempt in the next iteration.
- Change the “are you going to be done?” interaction to ask, “is there anything you would like to move out to the next iteration?” Expect that initially, people aren’t going to want to take you up on this, and you may have to have some personal interactions to convince somebody that it’s really okay to do this.
- When you have somebody decide to move something out, make sure that you take public note of it.
“To adjust the amount of work to the capacity of the team and maintain shippable quality, the paypal feature has been moved out to the next iteration.”
- When teams/individuals check in unfinished features (buggy/not complete) instead of letting those features make it into the common build, force them to remove them or disable them. Make this part of the team-wide status report for the iteration (“the ‘email invoice’ feature did not meet quality standards and has been disabled for this iteration”).
- I highly recommend switching to a team-ownership approach. Teams are much more likely to change how they work and improve over time than individuals.
You will need to be patient; people are used to working in the old ways.
This cultural change is the most important thing you can do to reduce the number of bugs that you have.
These are all well and good – until you work in an industry that is controlled outside of the company on when you MUST ship. In a number of industries, the controlling factor is the government which mandates when something is due. Note that the government is not the client nor even the sponsor – just a controlling body. You end up with "Build it fast" as a mantra that no amount of work within the company can change.
What's episode 3 title?
I'm really enjoying this series so far, I believe it's a very important topic. I even wrote an article myself some time ago, describing various ways to ensure the team develops features that are complete. I wonder what would be your opinion on it: blog.rafalb.com/fighting-almost-done-syndrome
David – I don't have any relevant experience in that sort of environment, so you should probably ignore what I'm about to say. My experience is that building software with "no bugs" is faster than building it without, so I think it gives you a better chance of hitting a hard deadline.
Pavel – No idea yet. I think I have something like 10 topics hanging around.
Rafal – I'm glad you're enjoying it. I read your article, and I agree with pretty much all that you have there. I think I would focus a little more on the benefits of the team ownership, and I differ a bit on assigning all the small tasks out; I think it's good to enumerate all of them to make sure the team remembers, but I find that just putting a sticky note up on the kanban board is sufficient.
David, in the situation you describe, where ship date is fixed, is the set of features also fixed?
Even in that situation, I agree with Eric that no bugs is faster, so your high quality increases your chance of meeting the requirements. As Brian Ghiesler puts it, "quality is speed".
Of course, the *pressure* to meet the requirements will tend to compromise quality, which is unfortunate for everyone involved.
I agree 100% with David. Most of the software developers I know work in agencies who have contracted commitments to clients to deliver a given specification within a given timescale. They clients are not interested in quality vs features vs timescale – they want it all.
I have tried to persuade clients to adopt an ajile approach – and never yet succeeded. I've managed to persuade clients that there are risks on both sides, that it's not in their interests if they insist on a fixed price and we go bust because that API they want us to use is badly documented, doesn't work, and needs endless experimentation. But none have *EVER* decided it's fair to spread that risk. Mention time and materials to a client, and generally, they put the phone down on you.
And the problem is extreme: if you run an agency type business, you're always one project away from going bust. I say this because generally, clients want the security of knowing that they will pay a fixed price for the project. As a business delivering software development to external clients, this is incredibly challenging because there are so many reasons beyond your control that mean that your original time estimate that you based your quote will be junk. You play russian roulette every time.
Having gone bust once, and suffered very severe personal financial penalties because of this predicament, I ended up changing my business model completely. We now offer clients an upfront figure for an off-the-shelf-solution based on on an open source framework with established functionality, and ask them to take out a support contract (i.e. a monthly retainer) that we then use to add bespoke functionality on a time and materials basis. Eighteen months in, and we are at last making money, doing a great job for clients, and have a measure of security.
I've read a great deal on development methodologies, and never yet come across a method that addresses and embraces the cold, harsh realities of working in an agency environment where the CUSTOMER dictates the commercials. And until there is a method out there that is acceptable to both agencies and clients, then for me, they are so much hot air.