Category Archives: Data Science

Kellblog’s 2017 Predictions  

New Year’s means three things in my world:  (1) time to thank our customers and team at Host Analytics for another great year, (2) time to finish up all the 2017 planning items and approvals that we need to get done before the sales kickoff (including the one most important thing to do before kickoff), and time to make some predictions for the coming year.

Before looking at 2017, let’s see how I did with my 2016 predictions.

2016 Predictions Review

  1. The great reckoning begins. Correct/nailed.  As predicted, since most of the bubble was tied up in private companies owned by private funds, the unwind would happen in slow motion.  But it’s happening.
  2. Silicon Valley cools off a bit. Partial.  While IPOs were down, you couldn’t see the cooling in anecdotal data, like my favorite metric, traffic on highway101.
  3. Porter’s five forces analysis makes a comeback. Partial.  So-called “momentum investing” did cool off, implying more rational situation analysis, but you didn’t hear people talking about Porter per se.
  4. Cyber-cash makes a rise. CorrectBitcoin more doubled on the year (and Ethereum was up 8x) which perversely reinforced my view that these crypto-currencies are too volatile — people want the anonymity of cash without a highly variable exchange rate.  The underlying technology for Bitcoin, blockchain, took off big time.
  5. Internet of Things goes into trough of disillusionment. Partial.  I think I may have been a little early on this one.  Seems like it’s still hovering at the peak of inflated expectations.
  6. Data science rises as profession. Correct/easy.  This continues inexorably.
  7. SAP realizes they are a complex enterprise application company. Incorrect.  They’re still “running simple” and talking too much about enabling technology.  The stock was up 9% on the year in line with revenues up around 8% thus far.
  8. Oracle’s cloud strategy gets revealed – “we’ll sell you any deployment model you want as long as your annual bill goes up.”  Partial.  I should have said “we’ll sell you any deployment model you want as long as we can call it cloud to Wall St.”
  9. Accounting irregularities discovered at one or more unicorns. Correct/nailed.  During these bubbles the pattern always repeats itself – some people always start breaking the rules in order to stand out, get famous, or get rich.  Fortune just ran an amazing story that talks about the “fake it till you make it” culture of some diseased startups.
  10. Startup workers get disappointed on exits. Partial.  I’m not aware of any lawsuits here but workers at many high flyers have been disappointed and there is a new awareness that the “unicorn party” may be a good thing for founders and VCs, but maybe not such a good thing for rank-and-file employees (and executive management).
  11. The first cloud EPM S-1 gets filed. Incorrect.  Not yet, at least.  While it’s always possible someone did the private filing process with the SEC, I’m guessing that didn’t happen either.
  12. 2016 will be a great year for Host Analytics. Correct.  We had a strong finish to the year and emerged stronger than we started with over 600 great customers, great partners, and a great team.

Now, let’s move on to my predictions for 2017 which – as a sign of the times – will include more macro and political content than usual.

  1. The United States will see a level of divisiveness and social discord not seen since the 1960s. Social media echo chambers will reinforce divisions.  To combat this, I encourage everyone to sign up for two publications/blogs they agree with and two they don’t lest they never again hear both sides of an issue. (See map below, coutesy of Ninja Economics, for help in choosing.)  On an optimistic note, per UCSD professor Lane Kenworthy people aren’t getting more polarized, political parties are.

news

  1. Social media companies finally step up and do something about fake news. While per a former Facebook designer, “it turns out that bullshit is highly engaging,” these sites will need to do something to filter, rate, or classify fake news (let alone stopping to recommend it).  Otherwise they will both lose credibility and readership – as well as fail to act in a responsible way commensurate with their information dissemination power.
  1. Gut feel makes a comeback. After a decade of Google-inspired heavily data-driven and A/B-tested management, the new US administration will increasingly be less data-driven and more gut-feel-driven in making decisions.  Riding against both common sense and the big data / analytics / data science trends, people will be increasingly skeptical of purely data-driven decisions and anti-data people will publicize data-driven failures to popularize their arguments.  This “war on data” will build during the year, fueled by Trump, and some of it will spill over into business.  Morale in the Intelligence Community will plummet.
  1. Under a volatile leader, who seems to exhibit all nine of the symptoms of narcissistic personality disorder, we can expect sharp reactions and knee-jerk decisions that rattle markets, drive a high rate of staff turnover in the Executive branch, and fuel an ongoing war with the media.  Whether you like his policies or not, Trump will bring a high level of volatility the country, to business, and to the markets.
  1. With the new administration’s promises of $1T in infrastructure spending, you can expect interest rates to raise and inflation to accelerate. Providing such a stimulus to already strong economy might well overheat it.  One smart move could be buying a house to lock in historic low interest rates for the next 30 years.  (See my FAQ for disclaimers, including that I am not a financial advisor.)
  1. Huge emphasis on security and privacy. Election-related hacking, including the spearfishing attack on John Podesta’s email, will serve as a major wake-up call to both government and the private sector to get their security act together.  Leaks will fuel major concerns about privacy.  Two-factor authentication using verification codes (e.g., Google Authenticator) will continue to take off as will encrypted communications.  Fear of leaks will also change how people use email and other written electronic communications; more people will follow the sage advice in this quip:

Dance like no one’s watching; E-mail like it will be read in a deposition

  1. In 2015, if you were flirting on Ashley Madison you were more likely talking to a fembot than a person.  In 2016, the same could be said of troll bots.  Bots are now capable of passing the Turing Test.  In 2017, we will see more bots for both good uses (e.g., customer service) and bad (e.g., trolling social media).  Left unchecked by the social media powerhouses, bots could damage social media usage.
  1. Artificial intelligence hits the peak of inflated expectations. If you view Salesforce as the bellwether for hyped enterprise technology (e.g., cloud, social), then the next few years are going to be dominated by artificial intelligence.  I’ve always believed that advanced analytics is not a standalone category, but instead fodder that vendors will build into smart applications.  They key is typically not the technology, but the problem to which to apply it.  As Infer founder Vik Singh said of Jim Gray, “he was really good at finding great problems,” the key is figuring out the best problems to solve with a given technology or modeling engine.  Application by application we will see people searching for the best problems to solve using AI technology.
  1. The IPO market comes back. After a year in which we saw only 13 VC-backed technology IPOs, I believe the window will open and 2017 will be a strong year for technology IPOs.  The usual big-name suspects include firms like Snap, Uber, AirBnB, and SpotifyCB Insights has identified 369 companies as strong 2017 IPO prospects.
  1. Megavendors mix up EPM and ERP or BI. Workday, which has had a confused history when it comes to planning, acquired struggling big data analytics vendor Platfora in July 2016, and seems to have combined analytics and EPM/planning into a single unit.  This is a mistake for several reasons:  (1) EPM and BI are sold to different buyers with different value propositions, (2) EPM is an applications sale, BI is a platform sale, and (3) Platfora’s technology stack, while appropriate for big data applications is not ideal for EPM/planning (ask Tidemark).  Combining the two together puts planning at risk.  Oracle combined their EPM and ERP go-to-market organizations and lost focus on EPM as a result.  While they will argue that they now have more EPM feet on the street, those feet know much less about EPM, leaving them exposed to specialist vendors who maintain a focus on EPM.  ERP is sold to the backward-looking part of finance; EPM is sold to the forward-looking part.  EPM is about 1/10th the market size of ERP.  ERP and EPM have different buyers and use different technologies.  In combining them, expect EPM to lose out.

And, as usual, I must add the bonus prediction that 2017 proves to be a strong year for Host Analytics.  We are entering the year with positive momentum, the category is strong, cloud adoption in finance continues to increase, and the megavendors generally lack sufficient focus on the category.  We continue to be the most customer-focused vendor in EPM, our new Modeling product gained strong momentum in 2016, and our strategy has worked very well for both our company and the customers who have chosen to put their faith in us.

I thank our customers, our partners, and our team and wish everyone a great 2017.

# # #

 

So You Wanna Be a Data Scientist

I’ve often said that “data science” is the new “plastics,” hearkening back to that famous scene in The Graduate where a neighbor gives cryptic one-word career advice to the young graduate Benjamin Braddock, portrayed by Dustin Hoffman.

I’ve told my own son data science numerous times as well.  (Yes, that’s to the one in college, not grade school, but I suppose it’s never too early to start.)

The question this begs is how to become a data scientist.  Few schools have a data science major, per se, but many schools are starting to offer related majors at both the undergraduate and graduate level.  Some, like Northwestern, even do this online.

The other day, I found this great post on the subject from Zipfian Academy  and I not only tweeted it on the spot, but wanted to blog about it here.

Here’s the introduction:

There are plenty of articles and discussions on the web about what data science is, what qualities define a data scientist, how to nurture them, and how you should position yourself to be a competitive applicant. There are far fewer resources out there about the steps to take in order to obtain the skills necessary to practice this elusive discipline. Here we will provide a collection of freely accessible materials and content to jumpstart your understanding of the theory and tools of Data Science.

The full post is here.

Freakonomics: A Must Read

Book cover of Every once in a while I sneak the odd book review into my blog, and this time the selection is Freakonomics: A Rogue Economist Explores the Hidden Side of Everything by Steven D. Levitt and Stephen J. Duber. This is not exactly a timely review; the book was published in 2005 and, as of 2007, it had sold over 3M copies.

As it turns out, Freaknomics is the first book that I’ve read cover-to-cover (so to speak) on my Kindle. While I’ve purchased a few Kindle books (e.g., a useless tax guide, a David Baldacci novel, and Against the Gods) and several Kindle French newspapers (you can score what’s usually tomorrow’s issue of Le Monde for $0.75), for whatever reason, I’d never before managed to get all the way through a work on my Kindle. It could be the device, it could be the books. Time and experience will tell.

Levitt describes himself as a rogue economist, but I think of him more as a rogue data junkie or data miner. The guy loves to pour through data and look for interesting and often non-obvious correlations, sometimes over very long periods of time. Among other issues, the book investigates questions such as:

  • What is the effect of information asymmetry between a client and a Realtor?
  • How can you discover cheating through mining large data volumes — either in schools or amongst Sumo wrestlers?
  • Why do crack cocaine dealers tolerate such low pay and poor (i.e., often deadly) working conditions?
  • What explains the relatively sharp drop-off in violent crime in the 1990s? What percent of the drop is explained by more police? What explains the rest? (And, boy, will you find the answer controversial.)
  • Does “good parenting” matter? Do Mommy-and-me and other head-start programs actually increase test scores later in life? What about habits like daily oral reading?
  • What is the sociological impact of a child’s name?

One of the bittersweet tales surrounding Levitt is that he and his wife lost a baby to Meningitis several years ago, prompting him to join a support group for parents who’d lost children. During his involvement with that group, he was shocked at the number of children who drowned in the family pool, prompting him to start up his data mining engine, eventually arriving at the following conclusion: if you have both a gun and a swimming pool, your child is 100x more likely to die from the pool than the gun.

Here is the Freaknomonics book website. Here is the Freakonomics blog, hosted by the New York Times.

Reblog this post [with Zemanta]