Once upon a time

A few months back, someone sent me this “pixar format” of storytelling.

While it makes sense, I have deep-seated insecurities regarding this format, going back to when I was in “upper kindergarten” (about 5 years old).

Until I was 14 or so, I had a pronounced stutter. It was very rare until then that I would win any prizes in speaking events even though I was comfortably the class topper in academics – basically I couldn’t speak. The mystery got unlocked when some teacher wondered if I stuttered because I “thought faster than I could speak”. That one remark made me conscious, and helped me slow down, and I remember pretty much cleaning up the speaking events prizes in school the following year.

Anyways, ten years before that I couldn’t speak. On top of that I couldn’t remember. I mean I could remember obscure things (for a five year old) such as the capital of Angola or the inventor of the telescope, but I couldn’t remember a coherent passage of text.

And one such passage of text that I first needed to mug up (and remember) and then speak it out (double nightmare) happened to be in the above (Pixar) format. There was a storytelling session in school for which we had to mug up stories and then tell it out in class.

I don’t exactly remember the text of the story (well I couldn’t remember it in 1987-88, so what chance do I have of remembering it now?), but it went something like this.

Once upon a time, there were four cows who lived in the jungle.

Every day, they grazed together. So if a tiger attacked, they could get together and chase it away.

One day, the cows quarrelled among one another.

Because of that, they started grazing separately.

Because of that, it was now possible for the tiger to take them on one-on-one.

Until finally, one day, the tiger attacked the cows one by one and ate up all of them.

Don’t ask me how a tiger could eat four cows in a day. I remember struggling like crazy to remember this story and speak it out. I remember that my father tried to make me mug it up several times during one weekend, after which I was supposed to speak it out in school.

I don’t remember how well or badly I spoke it out. However, what lasted was that this kind of stories started giving me nightmares. From then on, I developed a fear of the phrase “once upon a time”. Any story that started with “once upon a time” were scary to me.

I remember this one day in school when one classmate was asked to narrate a story. He went up to the front of the class and started with “one day … “. That was liberating – that not every story needed to start with once upon a time was a massive relief to me.

It’s funny the kind of things we remember from childhood, and the kind of seemingly innocuous things that have a long-term impact on us.

Telling stories with data

I’m about 20% through with The Verdict by Prannoy Roy and Dorab Sopariwala. It’s a fascinating book, except for one annoyance – it is full of tables that serve no purpose but to break the flow of text.

I must mention that I’m reading the book on the Kindle, which means that the tables can pose a major annoyance. Text breaks off midway through one page, and the next couple of pages involve a table or two, with several lines of text explaining what’s in the table. And then the text continues. It makes for a rather disruptive reading experience. And some of the tables have just one data point – making one wonder why it has been inserted there at all.

This is not the first book that I’ve noticed that makes this mistake. Some of the sports analytics books I’ve read in recent times, such as The Numbers Game also make the same error (I read that in print, and still had the same disruption). Bhagwati and Panagariya’s Why Growth Matters is similarly unreadable. Tables abruptly inserted into the middle of text, leading to the reader losing flow in the reading.

Telling a data story in book length is a completely different challenge to telling one in article length. And telling a story with data is a complete art form. When you’re putting a table there, you need to be able to explain why that table is important to the story – rather than putting it there just because it seems more rigorous.

Also the exact placement of the table (something that can’t be controlled well in Kindle, but is easy to fix in either HTML or print) matters –  the table should be relevant to the piece of text immediately preceding and succeeding it, in a way that it doesn’t disrupt the reader’s flow. More importantly, the table should be able to add value at that particular point – perhaps building on something that has been described in the previous paragraph.

Book length makes it harder because people don’t normally expect tables and figures to disturb their reading flow when reading something of book length. Also, the book format means that it is not always possible to insert a table at a precise point (even in print, where pagination is an issue).

So how do you tell a book length story with data? Firstly, be very stingy about the data that you want to show – anything that doesn’t immediately add value should be banished to the appendix. Even the rigour, which academics might be particular about, can be pushed to the end notes (not footnotes, since those can be disruptive to flow as well, turning pages into half pages).

Then, once you know that showing a particular table or graph is inevitable to telling the story, put it either in the beginning or the end of a chapter. This way, it doesn’t break the reader’s flow. Then, refer to individual numbers in the middle of the text without having to put the entire table in there. Unless each and every data point in the table is important, banish it to the endnotes.

One other common mistake (I did it in my piece in Forbes published yesterday) is to put a big table and not talk about it. It only seeks to confuse the reader, who starts looking for explanations for everything in the table in later parts.

I guess authors and analysts tend to get possessive. If you have worked hard to produce insights from data, you seek to share as much of it as possible. And this can mean simply dumping data all the data in the piece without a regard for what the reader will do with it.

I’m making a note to myself to not repeat this mistake in future.