When is AIML worth it?

Summary: Heuristic microeconomic models that show under what conditions ML/AI increases profits.

Recently I’ve been exploring opportuntities in AI/ML X Biotechnology in the context of startups. Several companies have been founded on the premise that artificial intelligence and machine learning can help speed up the discovery process of both drugs, delivery vehicles and targets. If you read twitter or the blogosphere, the potential of new AI/ML techniques to delivery value seems obvious. However, when I talk to biologists on the ground, it seems like we’re still pretty far from actually using ML systems for drug discovery and it’s not totally clear if it will actually generate that much economic value above current techniques (namely random screens or a biologist/chemist using their knowledge to design molecules).

I wanted to try to understand where and when the ML tools that have been developed create “economic value,” in the context of increasing profits. What will follow is an attempt to quantiatively model what properties a ML pipeline needs to improve profits. There is a tradeoff between improved accuracy increasing revenut and the costs of training and serving computational models.

We’ll explore two cases where AI/ML systems are currently seen as viable strategies: targeting advertisements and optimizing drug design. In both we’ll start with a general model with generally defined functions to model the profit change from implmenting such a system. Then we’ll plug in some simple functions to get some order-of-magnitude estimates for what conditions are required to make an AI/ML system profitable.

Targeting Advertisement with Classifiers

Every time we open the internet we are bombarded with advertisements: at the top of google results, in the middle of a recipe, or at the bottom of an article. The internet advertising economy is massive, powering the profits of the major tech companies for the last two decades. Can ML push those profits higher? The massive investment in research by firms such as Google and Microsoft seems to suggest so. Let’s dig in!

A general model

The profit, $ P $, a firm generates is the difference between the revenues, $R$, and the costs $C$.

\[P = R - C\]

Let’s start with the revenue side. For simplicity, lets say a company sells one product. It can serve ads to customer $i \in 1,…,N$. Lets say the company has $K$ different advertisements and can serve each customer any $a \in 1,…,K$. Each customer has a probability of clicking through on the ad served and purchasing the product $p_{i,a}$,and will spend some amount $q_i$. So under some strategy $\mathcal{S}$ that maps customer $i$ to ad $a(\mathcal{S},i)$, the revenue $R(\mathcal{S})$ is:

\[R(\mathcal{S}) = \sum_{i = 0}^{N} p_{i, a(\mathcal{S},i)} q_i\]

What we’re really interested in the change in profit, and so the change in revenue (and later costs). If we switch from some old strategy $\mathcal{S_0}$ to a new “AI” strategy $\mathcal{S_{AI}}$, the change in revenue, assuming the purchasing amount remains constant, is:

\[\Delta R = R(\mathcal{S_{AI}}) - R(\mathcal{S_0}) = \sum_{i = 0}^{N} (p_{i, a(\mathcal{S_{AI}},i)} - p_{i, a(\mathcal{S_0},i)}) q_i\]

Now let’s turn our attention to the cost side. A general model of the cost of AI is hard to pin down specifically. For this exercise, we’ll decompose the costs of some AI strategy into three parts:

Data collection and storage, $DC$
Model training, $MT$
Model deployment, $MD$

The total cost is then:

\[C(\mathcal{S_{AI}}) = DC + MT + MD\]

Each part can take on different forms depending on the strategy. At a high level, we’d expect data collection and storage to increase as the amount of data points increases but with decreasing marginal costs. Model training could depend on the number of datapoints, but likely at a lower rate than data collection. Model deployment we might expect to scale linearly with the number of customers we serve. I’m not an expert in how ad services work, so for this exercise lets abstract the original costs as $C(\mathcal{S_0})$.

So when is an AI strategy, $\mathcal{S_{AI}}$ beneficial? From a business perspective, we want the change in profit to be positive:

\[\Delta P = \Delta R - \Delta C > 0\]

\[\Delta R > \Delta C\]

Mostly obvious stuff! Plugging in our terms:

\[\sum_{i = 0}^{N} (p_{i, a(\mathcal{S_{AI}},i)} - p_{i, a(\mathcal{S_0},i)}) q_i > DC(\mathcal{S_{AI}}) + MT(\mathcal{S_{AI}}) + MD(\mathcal{S_{AI}}) - C(\mathcal{S_{0}})\]

We now have a “framework” (s/o consulting interviews) to decide if switching to an “AI strategy” is a good business decision! A business might hire some analysts to project the costs and revenues given some real life models to make these decisions. Going forward, I’ll try some basic functional forms for these terms to get a high-level understanding of which conditions lead to more or less favorable results for our AI strategy.

Targeting Advertisement with Classifiers

A general model

A toy model