Category Archives: Data Science

Your ICP Starts as an Aspiration and Becomes a Regression

Posted on August 30, 2025 | 3 comments

The concept of an ideal customer profile (ICP) has been around for a long time, but like its cousin, the minimum viable product (MVP), it is often misunderstood. In this post, I’ll offer some background commentary on the ICP concept and then build into one of my favorite sayings: your ICP starts out as an aspiration and becomes a regression.

There are four common questions around ICPs. Here they are, along with my answers:

Is the ICP about a person or a firm? Both. It should include firmographic as well as role (or persona) information. Example: VPs of sales at technology companies between $500M and $2B in revenues. Here, we included the size and industry of the company along with the target buyer’s title.
Should an ICP include a problem to be solved? Yes. VPs of sales have lots of different problems from recruiting to training to pipeline management to forecasting, just to name a few. Thus, your ICP should include the ideal person at the ideal company and the problem you’re looking to solve for them.
Should the ICP include adjacent systems? Yes. Deciding at the outset if you want to focus on customers using specific, adjacent systems is often critical (e.g., NetSuite vs. Oracle vs. Xero for core financials, Salesforce vs. HubSpot for CRM). The alternative is drowning in integration work while never having the time to support the idiosyncrasies of a given package which, when you do it, is usually adored by customers.
Should an ICP include sales qualification criteria? No. The ICP is about the buyer: this is the person we’re looking for. They have this job at this kind of company. Whether they’re out shopping right now, whether they have budget, whether they have a buying timeframe and purchasing authority are all important qualification questions, but they are not part of the ICP itself. People differ on this, I know [1].

Because the world is imperfect and it’s difficult to find “Mr. or Ms. Right” every time, it’s useful to think of the ICP as a bullseye. The absolute perfect customer is in ring 0, the next level off in ring 1, after that ring 2, et cetera. Note that I have no religion about the things you vary across the rings, but the usual candidates are: job title, industry, size, adjacent systems, and problem (aka use-case). And you might do them in unusual combinations. For example, if you think a director of finance with a budgeting problem is about as good as a manager of finance at a bigger company with an operational reporting problem, then you can put them both in ring 2.

The idea is to give you a simple and flexible model to agree on who to target and who to prioritize across sales, marketing, and product.

For a zero-to-one startup, you might focus exclusively on ring 0. As you grow you will typically get more use-cases, more industries, more adjacent systems, and thus more rings. That’s fine as long as you’re defining the rings clearly and triaging them into: hot pursue, pursue, and slow-roll or some similar encoding system.

With a few clearly established tiers you are now ready to report on ARR and pipeline by ICP tier to see if “you’re walking the talk” when it comes to your ICP. At many companies, you will find the majority of the ARR and pipeline [2] outside ring 2 or 3. In these cases, you simply aren’t living your ICP and instead suffering from a faux focus. The usual cause is an inability to control the sales force and prevent their default “chase anything” behavior [3].

The ICP is typically born in the founder’s head as an hypothesis. Think: I bet if we can build something like this, it will solve a problem like that. By the time a company has been founded and a product built, it becomes an aspiration. Think: I want to sell to people like this to solve a problem like that. So you sharpen your definitions of this and that, and add some additional targeting criteria like company size, industry, or adjacent systems. And then you go off and sell.

Let’s say it works. One day you look up and you’re now $50M or $100M in ARR. Congratulations. Should your ICP still be an intuition-driven aspiration? No. It should be a regression. Reality happened. Let’s find out what reality is telling us.

Are the people in our ICP ring 0 really our best customers?

Well, what do we mean by best? Do they have higher ASPs? Do they have shorter sales cycles? Do they renew at higher rates? Do they expand at higher rates (e.g., NRR)? Do we win new deals at higher rates? Do they give us higher CSAT scores?

At the first order, these are all just simple segmented metrics calculations that you can and should do. Your QBR and board decks should show these key metrics segmented by ICP tier [4]. And — since not all these metrics can be important — your e-team also needs to have the conversation about “what do we mean by best” so you can have a common, precise definition of the “best” customers that you are trying to target [5].

But the best answers to these questions are not performed using segment analysis [6]. Segment analysis is great for finding anomalies — e.g., why do we have a higher win rate in ICP tier 3 than tier 1? But it’s not a great technique for actually finding the impact of different variables on the success criteria.

For that, we need regression analysis. Regression analysis will tell us which variables most strongly correlate with the outcomes we want (e.g., that the strongest predictor of renewal is company size, not CSAT) [7]. A good regression analysis will tell you not only which factors most correlate with the outcome, but it can also be the best way to bucket those variables (e.g., the real breakpoint is at 250 employees, but your initial segment went from 0 to 500).

Odds are, when we do this kind of analysis we’ll find lots of surprises. Some of your intuition will be proven correct, but some won’t. And you’ll likely find entirely new variables (e.g., number of data scientists) that you didn’t even consider in your initial ICP exercises.

So this is why I like to say that your ICP starts as an aspiration — about who you want to sell to — and over time becomes a regression. Because one day you will have lots of data to analyze to determine who your best customers are — subject to your definition of best, of course — as opposed to who you thought they would be.

# # #

Notes

[1] Regardless of where you land at least be aware there are two types of criteria: those that change slowly or not at all (e.g., company size, adjacent systems, industry) and those that can change overnight (e.g., out shopping, budget, authority). My analogy here is dating: you can meet the right person at the wrong time. It doesn’t change the fact that they’re the right person. (And that’s why God made nurture tracks.)

[2] Think of pipeline as a potential leading indicator of ARR. Well, it should be, at least.

[3] Using the ICP in territory and compensation plan definitions can help with that. Think: you only earn commissions on customers in ICP rings 1 through 3 within your geographic territory. That will get your sellers’ attention.

[4] Note that I’m kind of using ICP tier and ring synonomously here and that’s generally OK. However, in cases where you have lots of rings, I would then sort those rings into tiers, so ring is the more specific and tier the more general term. For me, because I like simplicity, I want to see ICP segmentation in at most 3-4 buckets, so if there are N rings underlying those, I’d prefer to hide those by using 3-4 tiers.

[5] You probably don’t want marketing targeting high LTV prospects when sales wants to target high win rate ones. We should all be on the same targeting page.

[6] One of the key problems being that the segments themselves were somewhat arbitrarily chosen. Sure, we did our best to guess who’d be our best customers. But who are they actually? We may have used not only the wrong bucket boundaries (e.g., 100 emps vs. 500 emps) but even the wrong dimensions (e.g., maybe company size is a poor predictor and industry or use-case a powerful one).

[7] I cheated here on purpose to see if you were paying attention. Thus far, we’ve largely said the ICP is about a company (firmographics) and a role/persona. But here I’ve said that company size is a better predictor or renewal than CSAT — and CSAT isn’t a ICP-style criteria. The reality is these tools can do precisely that, looking across a wide range of input variables to see which most influence the output. Obviously, for marketing targeting purposes we don’t want CSAT to be an input variable to the model, but for renewals analysis we sure would.

3 Comments

Posted in Customer Success, Data Science, Entrepreneurship, Marketing, Sales, Scaling, Startups

Tagged ICP

Appearance on Data Radicals: Frameworks and the Art of Simplification

Posted on August 16, 2023 | 2 comments

This is a quick post to highlight my recent appearance on the Data Radicals podcast (Apple, Spotify), hosted by Alation founder and CEO, Satyen Sangani. I’ve worked with Alation for a long time in varied capacities — e.g., as an angel investor, advisor, director, interim executive, skit writer, and probably a few other ways I can’t remember. This is a company I know well. They’re in a space I’m passionate about — and one that I might argue is a logical second generation of the semantic-layer-based BI market where I spent nearly ten years as CMO of Business Objects.

Satyen is a founder for whom I have a ton of respect, not only because of what he’s created, but because of the emphasis on culture and values reflected in how did it. Satyen also appreciates a good intellectual sparring match when making big decisions — something many founders pretend to enjoy, few actually do, and fewer still seek out.

This is an episode like no other I’ve done because of that history and because of the selection of topics that Satyen chose to cover as a result. This is not your standard Kellblog “do CAC on a cash basis,” “use pipeline expected value as a triangulation forecast,” or “align marketing with sales” podcast episode. Make no mistake, I love those too — but this is just noteably different content from most of my other appearances.

Here, we talk about:

The history and evolution of the database and tools market
The modern data stack
Intelligent operational applications vs. analytic applications
Why I feel that data can often end up an abstraction contest (and what to do about that)
Why I think in confusing makets that the best mapmaker wins
Who benefits from confusion in markets — and who doesn’t
Frameworks, simplification, and reductionism
Strategy and distilling the essence of a problem
Layering marketing messaging using ternary trees
The people who most influenced my thinking and career
The evolution of the data intelligence category and its roots in data governance and data catalogs
How tech markets are like boxing matches — you win a round and your prize is to earn the chance to fight in the next one
Data culture as an ultimate benefit and data intelligence as a software category

I hope you can listen to the episode, also available on Apple podcasts and Spotify. Thanks to Satyen for having me and I wish Alation continuing fair winds and following seas.

2 Comments

Posted in Data Science, Decision Making, Silicon Valley, Startups

Tagged data intelligence, messaging, Podcasts

Joining the Profisee Board of Directors

Posted on August 27, 2019 | Leave a comment

We’re announcing today that I’m joining the board of directors of Profisee, a leader in master data management (MDM). I’m doing so for several reasons, mostly reflecting my belief that successful technology companies are about three things: the people, the space, and the product.

I like the people at both an investor and management level. I’m old friends with a partner at ParkerGale, the private equity (PE) firm backing Profisee, and I quite like the people at ParkerGale, the culture they’ve created, their approach to working with companies, and of course the lead partner on Profisee, Kristina Heinze.

The management team, led by veteran CEO and SAP alumnus Len Finkle, is stocked with domain experts from larger companies including SAP, Oracle, Hyperion, and Informatica. What’s more, Gartner VP and analyst Bill O’Kane recently joined the company. Bill covered the space at Gartner for over 8 years and has personally led MDM initiatives at companies including MetLife, CA Technologies, Merrill Lynch, and Morgan Stanley. It’s hard to read Bill’s decision to join the team as anything but a big endorsement of the company, its leadership, and its strategy.

These people are the experts. And instead of working at a company where MDM is an element of an element of a suite that no one really cares about anymore, they are working at a focused market leader that worries about MDM — and only MDM – all day, every day. Such focus is powerful.

I like the MDM space for several reasons:

It’s a little obscure. Many people can’t remember if MDM stands for metadata management or master data management (it’s the latter). It’s under-penetrated; relatively few companies who can benefit from MDM use it. Historically the market has been driven by “reluctant spend” to comply with regulatory requirements. Megavendors don’t seem to care much about MDM anymore, with IBM losing market share and Oracle effectively exiting the market. It’s the perfect place for a focused specialist to build a team of people who are passionate about the space and build a market-leading company.

It’s substantial. It’s a $1B market today growing at 5%. You can build a nice company stealing share if you need to, but I think there’s an even bigger opportunity.

It’s teed up to grow. On the operational side, I think that single source of truth, digital transformation, and compliance initiatives will drive the market. On the analytical side, if there’s one thing 20+ years in and around business intelligence (BI) has taught me, it’s GIGO (garbage in, garbage out). If you think the GIGO rule was important in traditional BI, I’d argue it’s about ten times more important in an artificial intelligence and machine learning (AI/ML) world. Garbage data in, garbage model and garbage predictions out. Data quality is the Achilles’ heel of modern analytics.

I like Profisee’s product because:

It’s delivering well for today’s customers.
It has the breadth to cover a wide swath of MDM domains and use-cases.
It provides a scalable platform with a broad range of MDM-related functionality, as opposed to a patchwork solution set built through acquisition.
It’s easy to use and makes solving complex problems simple.
It’s designed for rapid implementation, so it’s less costly to implement and faster to get in production which is great for both committed MDM users and — particularly important in an under-penetrated market – those wanting to give MDM a try.

I look forward to working with Len, Kristina, and the team to help take Profisee to the next level, and beyond.

Now, before signing off, let me comment on how I see Profisee relative to my existing board seat at Alation. Alation defined the catalog space, has an impressive list of enterprise customers, raised a $50M round earlier this year, and has generally been killing it. If you don’t know the data space well you might see these companies as competitive; in reality, they are complementary and I think it’s synergistic for me to work with both.

Data catalogs help you locate data and understand the overall data set. For example, with a data catalog you can find all of the systems and data sets where you have customer data across operational applications (e.g., CRM, ERP, FP&A) and analytical systems (e.g., data warehouses, data lakes).

MDM helps you rationalize the data across your operational and analytical systems. At its core, MDM solves the problem of IBM being entered in your company’s CRM system as “Intl Business Machines,” in your ERP system as “International Business Machines,” and in your planning system as “IBM Corp,” to give a simple example. Among other approaches, MDM introduces the concept of a golden record which provides a single source of truth of how, in this example, the customer should be named.

In short, data catalogs help you find the right data and MDM ensures the data is clean when you find it. You pretty obviously need both.

My Appearance on DisrupTV Episode 100

Posted on April 3, 2018 | 1 comment

Last week I sat down with interviewers Doug Henschen, Vala Afshar, and a bit of Ray Wang (live from a 777 taxiing en route to Tokyo) to participate in Episode 100 of DisrupTV along with fellow guests DataStax CEO Billy Bosworth and big data / science recruiter Virginia Backaitis.

We covered a full gamut of topics, including:

The impact of artificial intelligence (AI) and machine learning (ML) on the enterprise performance management (EPM) market.
Why I joined Host Analytics some 5 years ago.
What it’s like competing with Oracle … for basically your entire career.
What it’s like selling enterprise software both upwind and downwind.
How I ended up on the board of Alation and what I like about data catalogs.
What I learned working at Salesforce (hint: shoshin)
Other lessons from BusinessObjects, MarkLogic, and even Ingres.

DisrupTV Episode 100, Featuring Dave Kellogg, Billy Bosworth, Virginia Backaitis from Constellation Research on Vimeo.

1 Comment

Posted in BI, Cloud, Data Science, Enterprise, Leadership, Management, SaaS, Uncategorized

Kellblog’s 2017 Predictions

Posted on January 2, 2017 | 1 comment

New Year’s means three things in my world: (1) time to thank our customers and team at Host Analytics for another great year, (2) time to finish up all the 2017 planning items and approvals that we need to get done before the sales kickoff (including the one most important thing to do before kickoff), and time to make some predictions for the coming year.

Before looking at 2017, let’s see how I did with my 2016 predictions.

2016 Predictions Review

The great reckoning begins. Correct/nailed. As predicted, since most of the bubble was tied up in private companies owned by private funds, the unwind would happen in slow motion. But it’s happening.
Silicon Valley cools off a bit. Partial. While IPOs were down, you couldn’t see the cooling in anecdotal data, like my favorite metric, traffic on highway101.
Porter’s five forces analysis makes a comeback. Partial. So-called “momentum investing” did cool off, implying more rational situation analysis, but you didn’t hear people talking about Porter per se.
Cyber-cash makes a rise. Correct. Bitcoin more doubled on the year (and Ethereum was up 8x) which perversely reinforced my view that these crypto-currencies are too volatile — people want the anonymity of cash without a highly variable exchange rate. The underlying technology for Bitcoin, blockchain, took off big time.
Internet of Things goes into trough of disillusionment. Partial. I think I may have been a little early on this one. Seems like it’s still hovering at the peak of inflated expectations.
Data science rises as profession. Correct/easy. This continues inexorably.
SAP realizes they are a complex enterprise application company. Incorrect. They’re still “running simple” and talking too much about enabling technology. The stock was up 9% on the year in line with revenues up around 8% thus far.
Oracle’s cloud strategy gets revealed – “we’ll sell you any deployment model you want as long as your annual bill goes up.” Partial. I should have said “we’ll sell you any deployment model you want as long as we can call it cloud to Wall St.”
Accounting irregularities discovered at one or more unicorns. Correct/nailed. During these bubbles the pattern always repeats itself – some people always start breaking the rules in order to stand out, get famous, or get rich. Fortune just ran an amazing story that talks about the “fake it till you make it” culture of some diseased startups.
Startup workers get disappointed on exits. Partial. I’m not aware of any lawsuits here but workers at many high flyers have been disappointed and there is a new awareness that the “unicorn party” may be a good thing for founders and VCs, but maybe not such a good thing for rank-and-file employees (and executive management).
The first cloud EPM S-1 gets filed. Incorrect. Not yet, at least. While it’s always possible someone did the private filing process with the SEC, I’m guessing that didn’t happen either.
2016 will be a great year for Host Analytics. Correct. We had a strong finish to the year and emerged stronger than we started with over 600 great customers, great partners, and a great team.

Now, let’s move on to my predictions for 2017 which – as a sign of the times – will include more macro and political content than usual.

The United States will see a level of divisiveness and social discord not seen since the 1960s. Social media echo chambers will reinforce divisions. To combat this, I encourage everyone to sign up for two publications/blogs they agree with and two they don’t lest they never again hear both sides of an issue. (See map below, coutesy of Ninja Economics, for help in choosing.) On an optimistic note, per UCSD professor Lane Kenworthy people aren’t getting more polarized, political parties are.

news

Social media companies finally step up and do something about fake news. While per a former Facebook designer, “it turns out that bullshit is highly engaging,” these sites will need to do something to filter, rate, or classify fake news (let alone stopping to recommend it). Otherwise they will both lose credibility and readership – as well as fail to act in a responsible way commensurate with their information dissemination power.

Gut feel makes a comeback. After a decade of Google-inspired heavily data-driven and A/B-tested management, the new US administration will increasingly be less data-driven and more gut-feel-driven in making decisions. Riding against both common sense and the big data / analytics / data science trends, people will be increasingly skeptical of purely data-driven decisions and anti-data people will publicize data-driven failures to popularize their arguments. This “war on data” will build during the year, fueled by Trump, and some of it will spill over into business. Morale in the Intelligence Community will plummet.

Under a volatile leader, who seems to exhibit all nine of the symptoms of narcissistic personality disorder, we can expect sharp reactions and knee-jerk decisions that rattle markets, drive a high rate of staff turnover in the Executive branch, and fuel an ongoing war with the media. Whether you like his policies or not, Trump will bring a high level of volatility the country, to business, and to the markets.

With the new administration’s promises of $1T in infrastructure spending, you can expect interest rates to raise and inflation to accelerate. Providing such a stimulus to already strong economy might well overheat it. One smart move could be buying a house to lock in historic low interest rates for the next 30 years. (See my FAQ for disclaimers, including that I am not a financial advisor.)

Huge emphasis on security and privacy. Election-related hacking, including the spearfishing attack on John Podesta’s email, will serve as a major wake-up call to both government and the private sector to get their security act together. Leaks will fuel major concerns about privacy. Two-factor authentication using verification codes (e.g., Google Authenticator) will continue to take off as will encrypted communications. Fear of leaks will also change how people use email and other written electronic communications; more people will follow the sage advice in this quip:

Dance like no one’s watching; E-mail like it will be read in a deposition

In 2015, if you were flirting on Ashley Madison you were more likely talking to a fembot than a person. In 2016, the same could be said of troll bots. Bots are now capable of passing the Turing Test. In 2017, we will see more bots for both good uses (e.g., customer service) and bad (e.g., trolling social media). Left unchecked by the social media powerhouses, bots could damage social media usage.

Artificial intelligence hits the peak of inflated expectations. If you view Salesforce as the bellwether for hyped enterprise technology (e.g., cloud, social), then the next few years are going to be dominated by artificial intelligence. I’ve always believed that advanced analytics is not a standalone category, but instead fodder that vendors will build into smart applications. They key is typically not the technology, but the problem to which to apply it. As Infer founder Vik Singh said of Jim Gray, “he was really good at finding great problems,” the key is figuring out the best problems to solve with a given technology or modeling engine. Application by application we will see people searching for the best problems to solve using AI technology.

The IPO market comes back. After a year in which we saw only 13 VC-backed technology IPOs, I believe the window will open and 2017 will be a strong year for technology IPOs. The usual big-name suspects include firms like Snap, Uber, AirBnB, and Spotify. CB Insights has identified 369 companies as strong 2017 IPO prospects.

Megavendors mix up EPM and ERP or BI. Workday, which has had a confused history when it comes to planning, acquired struggling big data analytics vendor Platfora in July 2016, and seems to have combined analytics and EPM/planning into a single unit. This is a mistake for several reasons: (1) EPM and BI are sold to different buyers with different value propositions, (2) EPM is an applications sale, BI is a platform sale, and (3) Platfora’s technology stack, while appropriate for big data applications is not ideal for EPM/planning (ask Tidemark). Combining the two together puts planning at risk. Oracle combined their EPM and ERP go-to-market organizations and lost focus on EPM as a result. While they will argue that they now have more EPM feet on the street, those feet know much less about EPM, leaving them exposed to specialist vendors who maintain a focus on EPM. ERP is sold to the backward-looking part of finance; EPM is sold to the forward-looking part. EPM is about 1/10^th the market size of ERP. ERP and EPM have different buyers and use different technologies. In combining them, expect EPM to lose out.

And, as usual, I must add the bonus prediction that 2017 proves to be a strong year for Host Analytics. We are entering the year with positive momentum, the category is strong, cloud adoption in finance continues to increase, and the megavendors generally lack sufficient focus on the category. We continue to be the most customer-focused vendor in EPM, our new Modeling product gained strong momentum in 2016, and our strategy has worked very well for both our company and the customers who have chosen to put their faith in us.

I thank our customers, our partners, and our team and wish everyone a great 2017.

# # #

1 Comment

Posted in BI, Bubbles, Data Science, Decision Making, Disruption, Enterprise, IPO, Predictions, Silicon Valley, Startups, VC

Category Archives: Data Science

Your ICP Starts as an Aspiration and Becomes a Regression

Appearance on Data Radicals: Frameworks and the Art of Simplification

Joining the Profisee Board of Directors

My Appearance on DisrupTV Episode 100

Kellblog’s 2017 Predictions

Pages

Follow Blog via Email

Popular Posts

Categories

Search Kellblog

Archives