SK – Page 4 – Pertinent Observations

Bayes Theorem and Respect

Regular readers of this blog will know very well that I keep talking about how everything in life is Bayesian. I may not have said it in those many words, but I keep alluding to it.

For example, when I’m hiring, I find the process to be Bayesian – the CV and the cover letter set a prior (it’s really a distribution, not a point estimate). Then each round of interview (or assignment) gives additional data that UPDATES the prior distribution. The distribution moves around with each round (when there is sufficient mass below a certain cutoff there are no more rounds), until there is enough confidence that the candidate will do well.

In hiring, Bayes theorem can also work against the candidate. Like I remember interviewing this guy with an insanely spectacular CV, so most of the prior mass was to the “right” of the distribution. And then when he got a very basic question so badly wrong, the updation in the distribution was swift and I immediately cut him.

On another note, I’ve argued here about how stereotypes are useful – purely as a Bayesian prior when you have no more information about a person. So you use the limited data you have about them (age, gender, sex, sexuality, colour of skin, colour of hair, education and all that), and the best judgment you can make at that point is by USING this information rather than ignoring it. In other words, you need to stereotype.

However, the moment you get more information, you ought to very quickly update your prior (in other words, the ‘stereotype prior’ needs to be a very wide distribution, irrespective of where it is centred). Else it will be a bad judgment on your part.

In any case, coming to the point of this post, I find that the respect I have for people is also heavily Bayesian (I might have alluded to this while talking about interviewing). Typically, in case of most people, I start with a very high degree of respect. It is actually a fairly narrowly distributed Bayesian prior.

And then as I get more and more information about them, I update this prior. The high starting position means that if they do something spectacular, it moves up only by a little. If they do something spectacularly bad, though, the distribution moves way left.

So I’ve noticed that when there is a fall, the fall is swift. This is again because of the way the maths works – you might have a very small probability of someone being “bad” (left tail). And then when they do something spectacularly bad (well into that tail), there is no option but to update the distribution such that a lot of the mass is now in this tail.

Once that has happened, unless they do several spectacular things, it can become irredeemable. Each time they do something slightly bad, it confirms your prior that they are “bad” (on whatever dimension), and the distribution narrows there. And they become more and more irredeemable.

It’s like “you cannot unsee” the event that took their probability distribution and moved it way left. Soon, the end is near.

RG

Last night some colleagues and I were discussing the case of the Titan Submersible. For people who will be reading this after the news cycle has passed, this is basically a submersible that took people to see the debris of the Titanic, and then disappeared.

At the time of discussion, there was reportedly “20 hours of oxygen left” in the vessel, which meant rescue operations had to go on quickly. Then again, I’m writing this 23 hours after our conversation and there is no update yet, so I don’t know what that “20 hours means”.

In any case, someone in the group said “the worst thing that will happen is if someone panics. At that point, the rest of the people will have no option but to just kill this person”. I took a while to figure out what was happening, and then someone mentioned that when you panic, you tend to consume more oxygen.

The “20 hours of oxygen” was at “ground state”, with everyone remaining calm and consuming the average human amount of oxygen. However, if someone panicked, their rate of consumption of oxygen would go much higher, meaning the oxygen reserves will get drawn down much faster, thus lessening the chance of the others to be found.

So, from an expected value basis, it is rational for the rest of the people to kill the panicker, and give themselves a better chance of being found.

There was nobody from my JEE coaching factory in the group, so I didn’t talk about this there, but I got reminded of this story back from 1999 (I wrote JEE in 2000).

Our JEE factory had been making efforts to “imbibe us with fire in the belly”. As one of the teachers in the factory had told us in class, “naavu Kannadigarige aambode mosaranna koTTbiTTre khushhyaagiddbiDtivi” (if someone gives us Kannadigas falafel and curd rice, we’ll live happily forever, and we will forget about working hard).

And so there was this feeling that we need to be taught to be more competitive and ruthless, and part of the factory process involved giving us inspirational lectures to that effect.

“Ning kOpa baralva?” (“don’t you get angry?”), they would ask. They would ask us to imagine something that would make us angry, and then “channel that anger towards cracking JEE”. We needed to have that killer instinct, they would say.

Again, in the context of yesterday’s discussion on the Titan submersible and limited oxygen supplies, I got reminded of yet another of these inspirational speeches from our factory, about the killer instinct.

Remember that this was 1999. The Kargil War had just ended, and was still on everyone’s minds. I’m paraphrasing what one of the teachers said.

“Imagine you are in the army. There is a very good friend with you. You went through the defence academy together, and have always served together. Now you are at war.

The fight isn’t going very well and you both are hiding somewhere. And then your friend gets hit badly. He is alive but very very badly hurt and can’t move. And he can’t help but groan, and that means there is the risk of giving away your location to the enemy.

So what do you do? You put a bullet in his back and put him out of your misery. Yes, he is your friend. You have both served together for the longest time. But at that moment, you should be willing to shoot him because that is your only chance of survival.”

I don’t know what impact it had on us. The only impact it had on me is that it got etched in my super-normal long term memory. And in a very different, but sort of related context, I remembered it yesterday.

Oh, and when we went to IIT, we found that there was a term for this – “RG”, from “relative grading”. Because grading in most courses was relative, one way of getting better grades was to make sure others performed worse than you (even if you couldn’t perform better).

This took bizarre forms – hiding books in the library so nobody could find them; refusing to share your notes with your classmates; doing much more than required in your course assignments and term papers (this was very very common in my Computer Science class); flattening the tyres of your classmates’ cycles on exam days; teaching others the wrong formulae; and so on.

So in that sense, our factory teachers knew what they were prepping us for!

Hybrid work

I’m in a job that can broadly be described as “hybrid”. The mandate from HR is that we are are “expected to be in office three days a week, and live in the same city as the office”. Nobody really checks how often people go in to office, though I do end up going three times a week on average.

Of late, some tech “gurus” have taken on dunking on hybrid work. DHH of 37signals / Basecamp (I quite like his blog, in general) wrote that “hybrid combines the worst of in-person and remote“. Then, Paul Graham wrote some tweets on remote work. I quite like this one:

Why were all these smart people fooled? Partly I think because remote work does work initially, if you start with a system already healthy from in-person work. (It's like communism in that respect.) And partly because it seemed to solve recruiting, which is always a bottleneck.

— Paul Graham (@paulg) June 10, 2023

Back to hybrid work – I’m in a hybrid role now, where I go into office about three days a week on average, and stay home the other two days (in general, because Monday is crowded with long online meetings, and another day to do some “thinking work”). Different people in my company have different such strategies, and all come into office on their own schedules.

This is not the first time I’m doing “hybrid”. During my rather long independent consulting career, I largely worked from home but travelled to clients’ offices ever so often (once a week if in Bangalore; one week a month if not; on average). It was about getting the best combination of focussed work and collaboration. It worked then, and it works now.

In fact, as far back as 2007 I was in a hybrid office. I was in what is now called a “global capability centre”, and interacting with headquarters in Texas meant being available for calls later in the evening. Consequently, we could work from home a few days a week as long as we were available for these calls.

Coming as it did at the beginning of my career, it was a disaster. I slacked like nobody’s business. Less time spent in office meant less time understanding parts of the business not directly concerned with what I was working on. Most of my development in that period happened due to my independent reading and writing, rather than due to my work.

Now, once again, I’m in a company with “multiple headquarters”. This means that irrespective of where you are, you end up spending a considerable amount of time on video calls with people in other locations. According to DHH, video calls when you are in office is a waste of office time. I agree with him there. The way I manage is through my schedule.

Of course, it helps that I have a reputation in office that I don’t like to do unnecessary meetings – and all matters need to be resolved to the extent possible in text messages or email. This means I spend less time on video calls than many of my colleagues, and when I find a lot of them appearing on a day, i spend that day at home.

Also, I have an unspoken agreement with my (rather small) team on days of the week when we’ll meet in office, and so the technical discussions I find so difficult to have online can be had in person.

Hybrid primarily works because of optionality (a rather underappreciated concept). In my line of business, things can get so technical that there is a limit on the complexity of discussions that can be had online. Similarly, things can get so technical that we need undisturbed alone time to think through some of the solutions.

Hybrid works because it allows for both – it allows you to have your me time for your deep thinking, and the optionality of summoning a teammate to office “tomorrow” for some deep collaboration. The former is unavailable in an all-in office; the latter is not possible if you’re fully remote (I’ve experienced this during the pandemic years).

Yes, hybrid means you need to live within commuting distance of office (sometimes during interviews, I see candidates furiously googling for “richmond circle” or “residency road” when I tell them our office is there. It’s a strong signal that they’re not going to join 😛 ). However, that you only need to commute twice a week (rather than 5 times a week) means you can choose to live a little bit farther.

Yes, it does make hiring harder (compared to all-remote), but once hired, people can be far more productive in a hybrid model. With the option of doing deep work without the danger / fear of someone poking you (this literally happened to me yesterday) when you’re in the middle of deep work!

So yes, put me down as someone who likes the hybrid model of work.

Inverse Endorsements

The main purpose of a brand endorsing an entity – either a person or a team or an event – is so that people who associate themselves with, or simply follow, the latter, will gain awareness of the brand. For example, if you think of “Philips top 10”, every time you think of song countdown shows on prime time TV, you think of Philips.

A lot of times it works. For example, in 2005 (after the Champions League Semis first leg) I started following Liverpool FC. I quickly found that their shirt sponsor was the Danish beer brand Carlsberg. A couple of weeks later, I’d gone for drinks with my then colleagues, and was asked what beer I would have. Having no basis to make my decision (I wasn’t much of a beer drinker then), I went for Carlsberg, which was “my (newfound) team”‘s brand.

This is all basic stuff.

However, sometimes the causation can flow the other way as well. This especially has to do with little-known brands that are largely in the viewers’ minds because of their association with one single entity. Long back I had written about “triangle marketing” – where people will notice an entity if they learn of it from two or more independent sources. In the absence of a second source in which you learn about a company or brand, your only association of it is due to the endorsement, and you start associating the two together.

I started watching the English Premier League sometime in 2006 – before that most of my football watching had been restricted to World Cups and European Championships (and the semis of the 2005 champions league). Since it was from a foreign country (i’d interned in London in 2005 but then chose to take up a job in India post my graduation in 2006), I wasn’t aware of many of the brands who had their logos on the teams’ shirts. And so there was no other way to learn about the brands, and I started instantly associating them.

For example, I’ve never been into running and the likes, so it wasn’t until 2012 or so till I learnt of Garmin as being a very good fitness band. However, I’d seen plenty of the brand in the mid-noughties, on the Middlesbrough jersey.

Even now, when I see Garmin, I first think of Middlesbrough. Because my mind associated these two brands, but not the causal direction. In other words, the mind registers the correlation, not the causation.

Then there is the Indian dairy brand Akshayakalpa. I like their ghee and cheddar, but find their Paneer inferior to Milky Mist. Nevertheless, a few years back I first heard of them when they sponsored this young Indian grandmaster named Nihal Sarin. Now every time I see Akshayakalpa (even when I’m buying their ghee or cheddar, or paneer), I think of Nihal Sarin.

There are many other such examples that I think of from time to time – when I see the sponsoring brand and think of the sponsored brand, but I’m not able to remember those right now, so I’ll stop here.

PS: I remembered now what the other inverse endorsement is. I was watching Ponniyan Selvan 2 (an atrocious movie) last weekend, and saw it was by “Lyca Productions”. My immediate thought was “this is the company behind Lyca Kovai Kings“

JEE Math!!

Of late I’ve been feeling a little short in terms of intellectual stimulation. Maybe it was my decision at work to hunker down and focus on execution and tying up loose ends this quarter, rather than embarking on fresh exploratory work. Maybe it’s just that I’m not meeting too many people.

The last time I REMEMBER feeling this way was in May-June 2007. I clearly remember the drive (I was in my old Zen, driving past Urvashi Theatre on an insanely rainy Sunday afternoon, having met friends for lunch) where I felt this way. Back then, I had responded by massively upping my reading – that was the era of blogs and I had subscribed to hundreds on my bloglines (remember that?). I clearly remember feeling much better about myself by the end of that year.

Now, I continue to read, and read fairly insightful stuff. I’m glad that Substack has taken the place that blogs had in the noughties (after the extreme short-form-dominated 2010s), and have subscribed (for free) to a whole bunch of fairly interesting newsletters.

What I miss, though, is the stimulation in conversations. Maybe it’s just that I’m having way fewer of them, and not a reflection of the average quality of conversations I’ve been having. I’ve come to a stage where I don’t even know who I should meet or what I should talk about to stimulate me.

With that background, I was really happy to come across my (2000) JEE maths paper on Twitter. Baal sent it to me this afternoon when I was at work. Having got home, had dinner and dessert and sent off the daughter to bed, I got to it.

Memories of that Sunday morning in Malleswaram came flooding back to me. Looking back, I’m impressed with my seventeen year old self in terms of the kind of prep I did for the exam. For the JEE screening that January, I had felt I had peaked a week too early, so I took an entire week off after my board exams so that I could peak at the right time.

For a few days before the exam, I practiced waking up really early, so that I could change my shit rhythm (the exam started at 8am in Malleswaram, meaning we would have to leave home by 7. Back then, you didn’t want to go to any toilets outside of home). The menu for the day had been carefully pre-planned (breakfast after the maths exam, lunch after physics).

The first fifteen minutes or so of this maths paper I had blanked out. And then slowly started working my way from the first question. I remember coming out of the exam feeling incredibly happy. “I’m surely getting in, if I don’t screw up the other papers”, I remember telling some friends.

Anyway, having seen this paper, I HAD to attempt it. I didn’t bother with any “exam conditions”. I put on a “heavy metal” playlist on spotify, took out my iPad and pencil, and started looking at the questions.

Again courtesy https://twitter.com/ravihanda

I took 15 minutes for the first part of the first question. While I was clearly rusty, this was a decent start. Then I started with the second part of the first question, got stuck and gave up.

I started browsing Twitter but decided the paper is more interesting. The second question was relatively easy. I left the third one (forgotten my trigonometry), but found the fourth one quite easy (and I remember from my JEE about encountering Manhattan Distance ). The second half I didn’t focus so much on today, but was surprised to see the eighth question – with full benefit of hindsight, it’s way too easy to make it to the JEE!

I didn’t bother attempting all the questions, of “completing the paper” in any way. I didn’t need that. I haven’t decayed THAT MUCH in 23 years. And this was some nice intellectual stimulation for a weekday evening!

PS: I don’t think I’ll feel remotely as kicked if I encounter my physics or chemistry IIT-JEE papers.

PS2: Now one of my school and IIT classmates is pinging me on WhatsApp discussing questions. And i’m finding bugs in my (today’s) answers

New blood joins this team

I intended to write this a year ago, when Sadio Mane left Liverpool after six brilliant years at the club. There was much heartbreak among the club fan base about Mane leaving, and a lot of people saw it as a failure on the part of the management and ownership in terms of not being able to keep him.

Now, a year on, I admit that Darwin Nunez hasn’t quite set the club on fire (though I personally quite like him), but as a general principle, this kind of “freshening up” is a highly necessary process in a team, if you need to avoid stagnation.

A month or two back, I was watching some YouTube video on “Liverpool’s greatest Premier League goals against Manchester City” (this was just before the 4-1 hammering at the Etihad). As the goals were shown one by one, I kept trying to guess which season and game it was in.

There were important clues – whether Firmino wore 9 or 11, whether Mane wore 19 or 10, the identity of some players, the length of Trent Alexander Arnold’s hair, my memory of the scoreline from that game, etc. (Liverpool always wear the home Red at the Etihad, so the colour of the away kit wasn’t a clue).

However, for one goal I simply wasn’t able to figure out which season it was. There was TAA wearing 66, Fabinho, Henderson, the fab front three (Firmino-Mane-Salah, wearing 9-10-11 respectively) and Robertson. That’s when it hit me that for a fairly long time, a large part of Liverpool’s team had stayed constant! There was very little change at the club.

Now, there are benefits to having a consistently settled team (as the fabulous 2021-22 season showed), but there is also the danger of stasis. In something like football where careers are short, you don’t want the whole team “getting old together”. In the corporate world, people can get into too much of a comfort zone. And cynicism can set in.

Good new employees are always buzzing with ideas, fearless about what has been rejected before and who thinks how. As people spend longer in the organisation, though, colleagues become predictable and certain ways of doing things become institutionalised. Sooner than you know it, you would have become a “company man”, (figuratively) wearing the same white shirt and blue suits as your fellow company men, and socialising with your colleagues at the (figurative) company club.

There can be different kinds of companies here – some companies allow people to retain a lot of their individuality; and there the “decay” into company-manhood is slower. In this kind of a place, the same set of people can stay together for longer and still continue to innovate and add significant value to one another.

Other companies are less forgiving, and you very quickly assimilate, and lose part of your idiosyncrasy. Insofar as innovation comes out of fresh ideas and thinking and unusual connections, these companies are not very good at it. And in such companies, pretty much the only way to keep the innovative wheel going and continue to add value is by bringing in fresh blood well-at-a-faster-rate.

Putting it another way, if you are a cohesive kind of company, some attrition may not actually be a bad thing (unless you are growing rapidly enough to expand your team rapidly). To grow and innovate, you need people to think different.

And you get there either by having the sort of superior culture where existing employees continue to think different long after they’ve been exposed to one another’s thoughts; or by continuing to bring in fresh employees.

There is no other way.

Jordan “visa interview on arrival”

The peak-end hypothesis means that we’ve come back from our trip to Jordan really happy. It was a brilliant and diverse experience, involving Roman History (Jerash, Amman Citadel), Christian Theology (Mount Nebo, Madaba), hill climbing (at Petra – more on that later), wilderness (Wadi Rum) and a resort and floating on water (Dead Sea).

However, preceding all this was an absolutely atrocious “process” that we had to go through at the Amman airport. I waited to return to India to write this.

Nominally Jordan has “visa on arrival” for Indians. This means you don’t need to get a visa before you travel. However, what they don’t really tell you is that it doesn’t work the same way as visas on arrival in other countries – such as Hong Kong or Thailand or Maldives (based on my limited experience), where you enter the passport control, get your passport stamped, maybe pay a fee and move on.

In Jordan that’s not the way it works. We had pre-bought a “Jordan Pass” that includes fees for the visa and to some of the historic attractions in the country. Upon landing at Amman Airport, we encountered a line saying “for jordan pass / visa on arrival”. And that’s where the arbitrariness started.

Firstly, it is the “border police” who man this, unlike India where it’s bureaucrats from the external affairs ministry. More importantly, there is no “process”. You go to the window where the person there leafs through the passport looking for active visas – if you have a valid US or UK or Schengen or even Saudi visa, your visa gets printed on a paper and you get waved on to passport control. In the absence of all this, you are asked to “wait there”, without any further direction.

Then we were asked to go to “police in room 1”, which was some 200m away. This is where we had our first cultural shock of the trip – there was a heavy smell of cigarettes there, and we entered to see cops smoking there as they were talking to us.

The same process repeated – the cops leafing through the passport to see if there are any other valid visas, and then when not finding anything, asking us to “wait”. Again there was no definite timeline or process. We waited for a bit (during which the cops did namaz, and presumably stopped smoking while doing so), and then went in again and asked. Again we were asked to “wait”.

The cops all had identical uniforms so it was impossible for us to know who was “superior” or to escalate. After a few rounds of such waiting, my wife finally put senti saying we have a small child who is hungry (thankfully our daughter managed to produce a reasonably sad face at that time, though she was unable to cry), and finally they started considering our application.

We had printed out all our hotel reservations (I’d read on some forum that it might be required at “immigration” – though those fora didn’t mention how arbitrary the process is) and handed them over to the police, who went through them. One cop got convinced (I don’t know if it helped that we had booked in a few expensive hotels; he even asked us for our salaries and what work we do, etc.) and we got sent to another one. Yet again, and this was not the first time we were encountering him, he started the process all from the beginning, looking for valid visa stamps in our passports!

And then he started filling out some application. It was the first time I had seen someone actually write right to left, so it was mildly amusing (and it’s interesting that finally he stapled all our documents at the top RIGHT corner). He asked for our return tickets, which we hadn’t printed out, so I showed him on the phone. He took the phone and put it on the xerox machine and took a “copy” of the tickets! And then he stapled everything together and asked us to “wait”. Apparently his “boss” was supposed to call him (this guy took a picture of the application he had written and sent it to someone).

Then five minutes later, he gave us a small chit of paper and asked us to go back to the Visa On Arrival counter. I assumed we were almost through and messaged our driver that “we should be out soon”.

I don’t know if the guy at the visa on arrival counter was incompetent, but it’s not funny how many times he entered details of the same passports. In the middle of this, one lady walked near his counter, and he got busy talking to her while “processing” our stuff. And entered details many more times.

He got thoroughly confused because we had two Jordan Passes, and had to pay for our daughter’s visa (since she didn’t need a ticket to see the monuments this made more sense). In the middle he suddenly picked up all our passports and walked over to the police room. By now I was thoroughly psyched and had already swallowed my panic attack pill.

After yet another inordinate delay, he printed out our visas and sent us to passport control (a few metres away). Again we thought we were done, only to be told he had printed out my visa wrong (remember I said he entered details multiple times). Since the distance there was short, the passport control officer called the visa on arrival guy over and he took my passport YET AGAIN, and started entering details on his computer.

Another ten minutes later, he brought over my passport and visa to the passport control, where my passport was duly stamped and we were sent on our way.

Our bag was there in one corner, and we picked it up and walked out, feeling glad that we had booked a driver for the length of the trip who would be available for any further interfacing with Jordanian cops.

Overall, the whole process was rather bizarre. I’ve waited hours in line at Heathrow to be let in. I’ve visited the US, again waiting for a long time at JFK and even being pulled over for a customs check. None of that was even remotely comparable to our experience at Queen Alia International Airport last Tuesday.

If Jordan wants to outsource its visa process to more developed countries, that is fine, but they need to make it explicit. Turkey, for example, offers visa on arrival to Indians with a valid US or Schengen visa, but everyone else is expected to apply for a visa before travel.

Jordan says no such thing, and instead subjects people to arbitrary waits without any process in a smoky police station in the airport. Which is really really bizarre.

Round Tables

One of the “features” of being in a job is that you get invited to conferences and “industry events”. I’ve written extensively about one of them in the past – the primary purpose of these events is for people to be able to sell their companies’ products, their services and even themselves (job-hunting) to other attendees.

Now, everyone knows that this is the purpose of these events, but it is one of those things that is hard to admit. “I’m going to this hotel to get pitched to by 20 vendors” is not usually a good enough reason to bunk work. So there is always a “front” – an agenda that makes it seemingly worthy for people to attend these events.

The most common one is to have talks. This can help attract people at two levels. There are some people who won’t attend talks unless they have also been asked to talk, and so they get invited to talk. And then there are others who are happy to just attend and try to get “gyaan”, and they get invited as the audience. The other side of the market soon appears, paying generous dollars to hold the event at a nice venue, and to be able to sell to all the speakers and the audience.

Similarly, you have panel discussions. Organisers in general think this is one level better than talks – instead of the audience being bored by ONE person for half an hour, they are bored by about 4-5 people (and one moderator) for an hour. Again there is the hierarchy here – some people won’t want to attend unless they have been put on the panel. And who gets to be on the panel is a function of how desperate one or more sponsors is to sell to the potential panelists.

The one thing most of these events get right is to have sufficient lunch and tea breaks for people to talk to each other. Then again, these are brilliant times for sponsors to be able to sell their wares to the attendees. And it has the positive externality that people can meet and “network” and talk among themselves – which is the best value you can get out of an event like this one.

However, there is one kind of event that I’ve attended a few times, but I can’t understand how they work. This is the “round table”. It is basically a closed room discussion with a large number of invited “panellists”, where everyone just talks past each other.

Now, at one level I understand this – this is a good way to get a large number of people to sell to without necessarily putting a hierarchy in terms of “speakers” / “panellists” and “audience”. The problem is that what they do with these people is beyond my imagination.

I’ve attended two of these events – one online and one offline. The format is the same. There is a moderator who goes around the table (not necessarily in any particular order), with one question to each participant (the better moderators would have prepared well for this). And then the participant gives a long-winded answer to that question, and the answer is not necessarily addressed at any of the other participants.

The average length of each answer and the number of participants means that each participant gets to speak exactly once. And then it is over.

The online version of this was the most underwhelming event I ever attended – I didn’t remember anything from what anyone spoke, and assumed that the feeling was mutual. I didn’t even bother checking out these people on LinkedIn after the event was over.

The offline version I attended was better in the way that at least we could get to talk to each other after the event. But the event itself was rather boring – I’m pretty sure I bored everyone with my monologue when it was my turn, and I don’t remember anything that anyone else said in this event. The funny thing was – the event wasn’t recorded, and there was hardly anyone from the organising team at the discussion. There existed just no point of all of us talking for so long. It was like people who organise Satyanarayana Poojes to get an excuse to have a party at home.

I’m wondering how this kind of event can be structured better. I fully appreciate the sponsors and their need to sell to the lot of us. And I fully appreciate that it gives them more bang for the buck to have 20 people of roughly equal standing to sell to – with talks or panels, the “potential high value customers” can be fewer.

However – wouldn’t it be far more profitable to them to be able to spend more time actually talking to the lot of us and selling, rather than getting all of us to waste time talking nonsense to each other? Like – maybe just a party or a “lunch” would be better?

Then again – if you want people to travel inter-city to attend this, a party is not a good enough excuse for people to get their employers to sponsor their time and travel. And so something inane like the “round table” has to be invented.

PS: There is this school of thought that temperatures in offices and events are set at a level that is comfortable for men but not for women. After one recent conference I attended I have a theory on why this is the case. It is because of what is “acceptable formal wear” for men and women.

Western formal wear for men is mostly the suit, which means dressing up in lots of layers, and maybe even constraining your neck with a tie. And when you are wearing so many clothes, the environment better be cool else you’ll be sweating.

For women, however, formal wear need not be so constraining – it is perfectly acceptable to wear sleeveless tops, or dresses, for formal events. And the temperatures required to “air” the suit-wearers can be too cold for women.

At a recent conference I was wearing a thin cotton shirt and could thus empathise with the women.

Shrinking deadlines

I’m reminded of this old joke/riddle, which also happened to feature in Gowri Ganesha. “If a 1 metre long sari takes 1 hour to dry in the sun, how long will and 8 metre long sari take to dry?”.

The instinctive answer, of course, is 8 hours, while if you think about it (and assume that you have enough clothesline space to not need to fold), the correct answer is likely to be 1 hour.

Now this riddle is completely unconnected to do with the point of the post, except that both have to do with time.

And then one day you find, ten years have got behind you.
No one told you when to run. You missed the starting gun.

Ok enough distractions. I’m now home, home again.

Modern workspaces are synonymous with tight deadlines. Even when you give a conservative estimate on how long something will take, you get asked to compress the timelines further. If you protest too much and say that there is a lot to be done, sometimes you might get asked to “put one more person on the job and get it done quickly”.

This might work for routine, or “fighter” jobs – for example, if your job is to enter and copy data for (let’s say) 1000 records, you can easily put another person on the job, and the entire job will be done in about half the time (allowing for a little time for the new person to learn the job and for coordination).

As the job gets more complex, the harder it gets. At one level, there is more time to be spent by the new person coming into the job. Then, as the job gets more complex, it gets harder to divide and conquer, or to “specialise”. This means there is lesser impact to the new person coming in.

And then when you get closer and closer to the stud end of the spectrum, the advantage of putting more people to get the work done faster get lesser and lesser. There comes a point when the extra person actively becomes a liability. Again – I’m reminded of my childhood when occasionally I would ask my mother if she needed help in cooking. “Yes, the best way for you to help is for you to stay out of the kitchen”, she would say.

And then when the job gets really creative, there is a further limit on compression – a lot of the work is done “offline”. I keep telling people about how I finally discovered the proof of Ramsey’s numbers (3,3) while playing table tennis in my hostel, or how I had solved a tough assignment problem while taking a friend’s new motorcycle for a ride.

When you want to solve problems “offline” (to let the insight come to you rather than going hunting for it – I had once written about this) – there is no way to shorten the process. You need to let the problem stew in your head, and hope that some time it will get solved.

There is nothing that can be done here. The more you hurry up, the less the chances you give yourself of solving the problem. Everything needs to take its natural course.

I got reminded of it when we missed a deadline last Friday, and I decided to not think about it through the weekend. And then, an hour before I got to work on Monday, an idea occurred in the shower which fixed the problem. Even if I’d stressed myself (and my team) out on Friday, or done somersaults, the problem would not have been solved.

As I’d said in 2004, quality takes time.

Pre-trained models

On Sunday evening, we were driving to a relative’s place in Mahalakshmi Layout when I almost missed a turn. And then I was about to miss another turn and my wife said “how bad are you with directions? You don’t even know where to turn!”.

“Well, this is your area”, I told her (she grew up in Rajajinagar). “I had very little clue of this part of town till I married you, so it’s no surprise I don’t know how to go to your cousin’s place”.

“But they moved into this house like six months ago, and every time we’ve gone there together. So if I know the route, why can’t you”, she retorted.

This gave me a trigger to go off on a rant on pre-trained models, and I’m going to inflict that on you now.

For a long time, I didn’t understand what the big deal was on pre-trained machine learning models. “If it’s trained on some other data, how will it even work with my data”, I wondered. And then recently I started using GPT4 and other similar large language models. And I started reading blogposts on how with very little finetuning these models can do “gymnastics”.

Having grown up in North Bangalore, my wife has a “pretrained model” of that part of town in her head. This means she has sufficient domain knowledge, even if she doesn’t have any specific knowledge. Now, with a small amount of new specific information (the way to her cousins’s new house, for example), it is easy for her to fit in the specific information to her generic knowledge and get a clear idea on how to get there.

(PS: I’m not at all suggesting that my wife’s intelligence is artificial here)

On the other hand, my domain knowledge of North Bangalore is rather weak, despite having lived there for two years. For the longest time, Mallewaram was a Chakravyuha – I would know how to go there, but not how to get back. Given this lack of domain knowledge, the little information on the way to my wife’s cousin’s new house is not sufficient for me to find my way there.

It is similar with machines. LLMs and other pre-trained models have sufficient “generic domain knowledge” in lots of things, thanks to the large amounts of data they’ve been trained on. As a consequence, if you can train them on fairly small samples of specific data, they are able to generalise around this specific data and learn around them.

More pertinently, in real life, depending upon our “generic domain knowledge” of different domains, the amount of information that you and I will need to learn a certain amount about a certain domain can be very very different.

Everything is context-sensitive!